Selected Publications
For a full list, please see Google Scholar or Semantic Scholar.
* Denotes equal contribution from indicated authors.
Creative Text-to-Audio Generation via Synthesizer Programming
Manuel Cherep*, Nikhil Singh*, Jessica Shand
FigurA11y: AI Assistance for Writing Scientific Alt Text
Nikhil Singh, Lucy Lu Wang, Jonathan Bragg
Articulatory Synthesis of Speech and Diverse Vocal Sounds via Optimization
Luke Mo*, Manuel Cherep*, Nikhil Singh*, Quinn Langford, Pattie Maes
Superficial Alignment and Subtle Divergence in LLM Decision-Making
Manuel Cherep*, Nikhil Singh*, Pattie Maes
Contrastive Learning from Synthetic Audio Doppelgangers
Manuel Cherep*, Nikhil Singh*
Looking similar sounding different: Leveraging counterfactual cross-modal pairs for audiovisual representation learning
Nikhil Singh, Chih-Wei Wu, Iroro Orife, Mahdi Kalayeh
Human detection of political speech deepfakes across transcripts, audio, and video
Matthew Groh*, Aruna Sankaranarayanan*, Nikhil Singh, Dong Young Kim, Andrew Lippman, Rosalind Picard
Consent in Crisis: The Rapid Decline of the AI Data Commons
Shayne Longpre, Robert Mahari, Ariel Lee, Campbell Lund, Hamidah Oderinwale, William Brannon, Nayan Saxena, Naana Obeng-Marnu, Tobin South, Cole Hunter, Kevin Klyman, Christopher Klamm, Hailey Schoelkopf, Nikhil Singh, Manuel Cherep, Ahmad Anis, An Dinh, Caroline Chitongo, Da Yin, Damien Sileo, Deividas Mataciunas, Diganta Misra, Emad Alghamdi, Enrico Shippole, Jianguo Zhang, Joanna Materzynska, Kun Qian, Kush Tiwary, Lester Miranda, Manan Dey, Minnie Liang, Mohammed Hamdy, Niklas Muennighoff, Seonghyeon Ye, Seungone Kim, Shrestha Mohanty, Vipul Gupta, Vivek Sharma, Vu Minh Chien, Xuhui Zhou, Yizhi Li, Caiming Xiong, Luis Villa, Stella Biderman, Hanlin Li, Daphne Ippolito, Sara Hooker, Jad Kabbara, Sandy Pentland
Purrfect Pitch: Exploring Musical Interval Learning through Multisensory Interfaces
Sam Chin*, Cathy Mengying Fang*, Nikhil Singh, Ibrahim Ibrahim, Joe Paradiso, Pattie Maes
Ellie Talks About the Weather: Toward Evaluating the Expressive and Enrichment Potential of a Tablet-Based Speech Board in a Single Goffin’s Cockatoo
Jennifer Cunha, Corinne C Renguette, Nikhil Singh, Lily Stella, Megan Mcmahon, Hao Jin, Rebecca Kleinberger
SynthAX: A Fast Modular Synthesizer in JAX
Manuel Cherep*, Nikhil Singh*
Investigating the Physiological and Psychological Effect of an Interactive Musical Interface for Stress and Anxiety Reduction
Kimaya Lecamwasam, Samantha Gutierrez Arango, Nikhil Singh, Neska Elhaouij, Max Addae, Rosalind Picard
Voice at NIME: a Taxonomy of New Interfaces for Vocal Musical Expression
Rébecca Kleinberger, Nikhil Singh, Xiao Xiao, Akito van Troyer
Where to hide a stolen elephant: Leaps in creative writing with multimodal machine intelligence
Nikhil Singh*, Guillermo Bernal*, Daria Savchenko*, Elena L Glassman
A neural network solves, explains, and generates university math problems by program synthesis and few-shot learning at human level
Iddo Drori, Sarah Zhang, Reece Shuttleworth, Leonard Tang, Albert Lu, Elizabeth Ke, Kevin Liu, Linda Chen, Sunny Tran, Newman Cheng, Roman Wang, Nikhil Singh, Taylor L Patti, Jayson Lynch, Avi Shporer, Nakul Verma, Eugene Wu, Gilbert Strang
Image2Reverb: Cross-Modal Reverb Impulse Response Synthesis
Nikhil Singh, Jeff Mentch, Jerry Ng, Matthew Beveridge, Iddo Drori
Image2Lego: Customized LEGO Set Generation from Images
Kyle Lennon, Katharina Fransen, Alexander O'Brien, Yumeng Cao, Matthew Beveridge, Yamin Arefeen, Nikhil Singh, Iddo Drori
The Sound Sketchpad: Expressively Combining Large and Diverse Audio Collections
Nikhil Singh