Shrikanth Narayanan

Vice President of Presidential Initiatives and University Professor and Niki & Max Nikias Chair in Engineering, Research Director and Distinguished Principal Scientist of the Information Sci...

Websites

Signal Analysis and Interpretation Lab

Is this your profile? Click to edit

Phone +1 213 740 6432

Email shri@sipi.usc.edu

Orcid ID CTSI Profile

Overview

Shrikanth (Shri) Narayanan is University Professor and holder of the Niki and Max Nikias Chair in Engineering at the University of Southern California (USC) and serves as the inaugural Vice President for Presidential Initiatives on the Senior Leadership Team of USC’s President. He is a Professor in the Signal and Image Processing Institute of USC’s Ming Hsieh Electrical & Computer Engineering department with joint appointments as Professor in Computer Science, Linguistics, Psychology, Neuroscience, Pediatrics and Otolaryngology-Head and Neck Surgery. He is also the inaugural director of the Ming Hsieh Institute, a Research Director for the Information Sciences Institute at USC and a Visiting Faculty Researcher at Google. He held the inaugural Viterbi Professorship in Engineering at USC (2007-2016). He was also a Research Area Director of the Integrated Media Systems Center, an NSF Engineering Research Center at USC, and was the Research Principal for the USC Pratt and Whitney Institute for Collaborative Engineering, a unique partnership between academia and industry (2003-2007).

Prior to USC, from 1995-2000, he was with AT&T Labs-Research, Florham Park and AT&T Bell Labaratories, Murray Hill–first as a Senior Member and later as a Principal Member of its Technical Staff. Shri Narayanan received his M.S., Engineer, and Ph.D., all in electrical engineering, from UCLA in 1990, 1992, and 1995, respectively, and his bachelor of engineering in electrical engineering from the College of Engineering, Guindy (Chennai, India) in 1988.

Shri Narayanan is a Fellow of the National Academy of Inventors (NAI), the Acoustical Society of America (ASA), the Institute of Electrical and Electronics Engineers (IEEE), the Association for Computing Machinery (ACM), the International Speech Communication Association (ISCA), the Association for Psychological Science (APS), the American Association for the Advancement of Science (AAAS), American Institute for Medical and Biological Engineering (AIMBE) and the Association for the Advancement of Affective Computing (AAAC). Shri Narayanan is a member of the European Academy of Sciences and Arts and a 2022 Guggenheim Fellow. He is also a member of the professional honor societies Tau Beta Pi, Phi Kappa Phi and Eta Kappa Nu.

Shri Narayanan has received several honors and awards including the 2025 IEEE James L. Flanagan Speech and Audio Processing Award (IEEE Technical Field Award), 2024 Edward J. McCluskey Technical Achievement Award from the IEEE Computer Society, the 2023 ISCA Medal for Scientific Achievement, 2023 IEEE SPS Claude Shannon-Harry Nyquist Technical Achievement Award (for contributions to spoken language processing technologies and their societal applications) from the IEEE Signal Processing Society and the 2020 ACM ICMI Sustained Accomplishment Award. His research publications have received the 2023 Richard Deswarte Prize in Digital History (for paper published in Digital Humanities Quarterly with Gabor Toth, Tim Hempel, Krishna Somandepalli), a 2018 ISCA Best Journal Paper Award (for paper published in Computer Speech and Language Journal with Ming Li and Kyu Han), the Ten Year Technical Impact Award from ACM ICMI in 2014, a 2009 Best Transactions (Journal) Paper award (with Chul Min Lee) and a 2005 Best Transactions Paper Award (with Alexandros Potamianos) from the IEEE Signal Processing society for papers published in the IEEE Transactions on Speech and Audio Processing.

Papers co-authored with his students have won recognition at MediaEval 2020 (Emotions and Themes in Music), ACM-AVEC 2018 Emotion Gold-standard Subchallenge, 2018 Behavioral, Economic, and Socio-Cultural Computing Conference (Distinguished Research on Digital Humanities) Interspeech 2016, ICASSP 2016, Interspeech2015-Nativeness Challenge, Interspeech2014-Cognitive Load Challenge, Interspeech2013-Paralinguistics Challenge, Interspeech 2013, Interspeech2012-Speaker Trait Challenge, Interspeech2011-Speaker State Challenge, InterSpeech 2010, InterSpeech 2009-Emotion Challenge, IEEE DCOSS 2009, IEEE MMSP 2007, IEEE MMSP 2006, ICASSP 2005 and ICSLP 2002.

Shri Narayanan received the Engineer’s Council 2015 Distinguished Engineering Educator Award, and was selected as IEEE Signal Processing Society Distinguished Lecturer for 2010-2011, the International Speech Communication Association (ISCA) Distinguished Lecturer for 2015-16, and the 2017 Willard R. Zemlin Memorial Lecturer for American Speech and Hearing Association (ASHA). Shri Narayanan has also received an NSF CAREER award, a Okawa Research Award, IBM Faculty Awards (2008, 2010), Google Faculty Research Award (2016), Amazon Research Award (2020), the 2011 UCLA Engineering Alumni Professional Achievement Award, a 2019 Distinguished Alumnus Award from College of Engineering – Guindy (India). Shri Narayanan has also been recognized at USC for his research, service and mentoring including with a USC Associates Award for Creativity in Research and Scholarship, a faculty fellowship for Interdisciplinary research, USC Viterbi Engineering Junior and Senior Research Awards and Use-inspired research award, USC Electrical Engineering Northrop-Grumman Research award, a Mellon award for mentoring excellence, and a USC Distinguished Faculty Service Award from the Academic Senate.

Shri Narayanan served as the inaugural VP for Education for the IEEE Signal Processing Society (2020-22). He is an Editor for the Computer, Speech and Language Journal and a Senior Editorial Board member for the APSIPA Transactions on Signal and Information Processing, having previously served as Editor-in-Chief for the IEEE Journal of Selected Topics in Signal Processing (2016-2018) and as an Associate Editor for the IEEE Transactions of Speech and Audio Processing (2000-2004), the IEEE Signal Processing Magazine (2005-2008), the IEEE Transactions on Multimedia (2008-2012), IEEE Transactions on Signal and Information Processing over Networks (2014-2015), the IEEE Transactions on Affective Computing (2010-2016), the Journal of Acoustical Society of America (2009-2016) and the APSIPA Transactions on Signal and Information Processing (2011-2020). He holds or has held positions on the Speech Communication and Acoustic Standards committees of the Acoustical Society of America and the Advisory Council of the International Speech Communication Association, the BigData SIG (2014-2017) the Speech Processing Technical Committee (2003-2007) and on the Multimedia Signal Processing technical committee (2005-2008; 2014-2020), Nomination & Appointments Committee (2023-24) of the IEEE Signal Processing Societya He has also served in several leadership roles in organizing conferences and workshops of professional societies such as IEEE, ISCA, and ACM. At USC, he was Chair of the Joint Provost-Senate University Research Committee (2006-09) and, a Past President of the Phi Kappa Phi Academic Honor Society (2007-08).

Awards

John Simon Guggenheim Foundation: Guggenheim Fellowship, 2022
European Academy of Sciences and Arts: Member, 2022
Association for the Advancement of Affective Computing: Fellow, 2022
USC: University Professor, 2020
ACM ICMI: Sustained Accomplishment Award, 2020
American Institute for Medical and Biological Engineering (AIMBE): Fellow, 2019
Association for Psychological Science: Fellow, 2018
National Academy of Inventors: Fellow, 2017
International Speech Communication Association: Fellow, 2016
International Speech Communication Association: Distinguished Lecturer, 2015-2016
Engineers Council: Distinguished Engineering Educator, 2015
IEEE Signal Processing Society: Distinguished Lecturer, 2010-2011
American Association for the Advancement of Science : Fellow, 2010
IEEE: Fellow, 2009
Acoustical Society of America: Fellow, 2005

Publications

Vertical larynx actions and intergestural timing stability in Hausa ejectives and implosives Phonetica. 2024 Dec 17; 81(6):559-597. . View in PubMed
Direct articulatory observation reveals phoneme recognition performance characteristics of a self-supervised speech model JASA Express Lett. 2024 11 01; 4(11). . View in PubMed
Method for assessing visual saliency in children with cerebral/cortical visual impairment using generative artificial intelligence Front Hum Neurosci. 2024; 18:1506286. . View in PubMed
Multimodal neuroimaging data from a 5-week heart rate variability biofeedback randomized clinical trial Sci Data. 2023 07 29; 10(1):503. . View in PubMed
Wearable and Mobile Technologies for the Evaluation and Treatment of Obsessive-Compulsive Disorder: Scoping Review JMIR Ment Health. 2023 Jul 18; 10:e45572. . View in PubMed
Creating musical features using multi-faceted, multi-task encoders based on transformers Sci Rep. 2023 Jul 03; 13(1):10713. . View in PubMed
Mel frequency spectral domain defenses against adversarial attacks on speech recognition systems JASA Express Lett. 2023 03; 3(3):035208. . View in PubMed
Automatic Analysis of Asymmetry in Facial Paralysis Patients Using Landmark-Based Measures Facial Plast Surg Aesthet Med. 2022 Nov-Dec; 24(6):491-493. . View in PubMed
Phone duration modeling for speaker age estimation in children J Acoust Soc Am. 2022 11; 152(5):3000. . View in PubMed
An Automated Quality Evaluation Framework of Psychotherapy Conversations with Local Quality Estimates Comput Speech Lang. 2022 Sep; 75. . View in PubMed
Interpersonal synchrony across vocal and lexical modalities in interactions involving children with autism spectrum disorder JASA Express Lett. 2022 Sep; 2(9):095202. . View in PubMed
TILES-2019: A longitudinal physiologic and behavioral data set of medical residents in an intensive care unit Sci Data. 2022 09 01; 9(1):536. . View in PubMed
Automated evaluation of psychotherapy skills using speech and language technologies Behav Res Methods. 2022 04; 54(2):690-711. . View in PubMed
Representation of professions in entertainment media: Insights into frequency and sentiment trends through computational text analysis PLoS One. 2022; 17(5):e0267812. . View in PubMed
Confusion2Vec 20: Enriching ambiguous spoken language representations with subwords. PLoS One. 2022; 17(3):e0264488. . View in PubMed
Causal Indicators for Assessing the Truthfulness of Child Speech in Forensic Interviews Comput Speech Lang. 2022 Jan; 71. . View in PubMed
Multi-label Multi-task Deep Learning for Behavioral Coding IEEE Trans Affect Comput. 2022 Jan-Mar; 13(1):508-518. . View in PubMed
Boys don’t cry (or kiss or dance): A computational linguistic lens into gendered actions in film PLoS One. 2022; 17(12):e0278604. . View in PubMed
Aliasing artifact reduction in spiral real-time MRI Magn Reson Med. 2021 08; 86(2):916-925. . View in PubMed
A multispeaker dataset of raw and reconstructed speech production real-time MRI video and 3D volumetric images Sci Data. 2021 07 20; 8(1):187. . View in PubMed
Robust diagnostic classification via Q-learning Sci Rep. 2021 06 03; 11(1):11730. . View in PubMed
Improved 3D real-time MRI of speech production Magn Reson Med. 2021 06; 85(6):3182-3195. . View in PubMed
A multimodal analysis of physical activity, sleep, and work shift in nurses with wearable sensor data Sci Rep. 2021 04 22; 11(1):8693. . View in PubMed
Romantic partner presence and physiological responses in daily life: Attachment style as a moderator Biol Psychol. 2021 04; 161:108082. . View in PubMed
Deep multiple instance learning for foreground speech localization in ambient audio from wearable devices EURASIP J Audio Speech Music Process. 2021; 2021(1):7. . View in PubMed
Automated quality assessment of cognitive behavioral therapy sessions through highly contextualized language representations PLoS One. 2021; 16(10):e0258639. . View in PubMed
Meta-learning with Latent Space Clustering in Generative Adversarial Network for Speaker Diarization IEEE/ACM Trans Audio Speech Lang Process. 2021; 29:1204-1219. . View in PubMed
A computational lens into how music characterizes genre in film PLoS One. 2021; 16(4):e0249957. . View in PubMed
Deblurring for spiral real-time MRI using convolutional neural networks Magn Reson Med. 2020 12; 84(6):3438-3452. . View in PubMed
Vocal tract shaping of emotional speech Comput Speech Lang. 2020 Nov; 64. . View in PubMed
TILES-2018, a longitudinal physiologic and behavioral data set of hospital workers Sci Data. 2020 10 16; 7(1):354. . View in PubMed
Affect Estimation with Wearable Sensors J Healthc Inform Res. 2020 Sep; 4(3):261-294. . View in PubMed
Improved real-time tagged MRI using REALTAG Magn Reson Med. 2020 08; 84(2):838-846. . View in PubMed
Variability in individual constriction contributions to third formant values in American English /? /. J Acoust Soc Am. 2020 06; 147(6):3905.. View in PubMed
How an aglossic speaker produces an alveolar-like percept without a functional tongue tip J Acoust Soc Am. 2020 06; 147(6):EL460. . View in PubMed
Improving speaker diarization for naturalistic child-adult conversational interactions using contextual information J Acoust Soc Am. 2020 02; 147(2):EL196. . View in PubMed
Clinical state tracking in serious mental illness through computational analysis of speech PLoS One. 2020; 15(1):e0225695. . View in PubMed
Towards End-2-end Learning for Predicting Behavior Codes from Spoken Utterances in Psychotherapy Conversations Proc Conf Assoc Comput Linguist Meet. 2020 Jul; 2020:3797-3803. . View in PubMed
Lessons Learned: Recommendations For Implementing a Longitudinal Study Using Wearable and Environmental Sensors in a Health Care Organization JMIR Mhealth Uhealth. 2019 12 10; 7(12):e13305. . View in PubMed
Cross-Modal Coordination of Face-Directed Gaze and Emotional Speech Production in School-Aged Children and Adolescents with ASD Sci Rep. 2019 12 04; 9(1):18301. . View in PubMed
A modular architecture for articulatory synthesis from gestural specification J Acoust Soc Am. 2019 12; 146(6):4458. . View in PubMed
Multimodal Human and Environmental Sensing for Longitudinal Behavioral Studies in Naturalistic Settings: Framework for Sensor Selection, Deployment, and Management J Med Internet Res. 2019 08 20; 21(8):e12832. . View in PubMed
Intermittently tagged real-time MRI reveals internal tongue motion during speech production Magn Reson Med. 2019 08; 82(2):600-613. . View in PubMed
3D dynamic MRI of the vocal tract during natural speech Magn Reson Med. 2019 03; 81(3):1511-1520. . View in PubMed
Task-dependence of articulator synergies J Acoust Soc Am. 2019 03; 145(3):1504. . View in PubMed
IMPROVING THE PREDICTION OF THERAPIST BEHAVIORS IN ADDICTION COUNSELING BY EXPLOITING CLASS CONFUSIONS Proc IEEE Int Conf Acoust Speech Signal Process. 2019 May; 2019:6605-6609. . View in PubMed
Identifying Therapist and Client Personae for Therapeutic Alliance Estimation Interspeech. 2019 Sep; 2019:1901-1905. . View in PubMed
Dynamic off-resonance correction for spiral real-time MRI of speech Magn Reson Med. 2019 01; 81(1):234-246. . View in PubMed
Modeling Interpersonal Linguistic Coordination in Conversations using Word Mover’s Distance Interspeech. 2019 Sep; 2019:1423-1427. . View in PubMed
ROLE SPECIFIC LATTICE RESCORING FOR SPEAKER ROLE RECOGNITION FROM SPEECH RECOGNITION OUTPUTS Proc IEEE Int Conf Acoust Speech Signal Process. 2019 May; 2019:7330-7334. . View in PubMed
Participatory methods to support team science development for predictive analytics in health J Clin Transl Sci. 2018 Jun; 2(3):178-182. . View in PubMed
Acoustic Denoising using Dictionary Learning with Spectral and Temporal Regularization IEEE/ACM Trans Audio Speech Lang Process. 2018 May; 26(5):967-980. . View in PubMed
Explaining Coronal Reduction: Prosodic Structure and Articulatory Posture Phonetica. 2018; 75(2):151-181. . View in PubMed
Computational modeling of conversational humor in psychotherapy Interspeech. 2018 Sep; 2018:2344-2348. . View in PubMed
Using Prosodic and Lexical Information for Learning Utterance-level Behaviors in Psychotherapy Interspeech. 2018 Sep; 2018:3413-3417. . View in PubMed
Feasibility of through-time spiral generalized autocalibrating partial parallel acquisition for low latency accelerated real-time MRI of speech Magn Reson Med. 2017 Dec; 78(6):2275-2282. . View in PubMed
MUPET-Mouse Ultrasonic Profile ExTraction: A Signal Processing Tool for Rapid and Unsupervised Analysis of Ultrasonic VocalizationsNeuron. 2017 May 03; 94(3):465-485. e5. . View in PubMed
Test-retest repeatability of human speech biomarkers from static and real-time dynamic magnetic resonance imaging J Acoust Soc Am. 2017 05; 141(5):3323. . View in PubMed
Characterizing Articulation in Apraxic Speech Using Real-Time Magnetic Resonance Imaging J Speech Lang Hear Res. 2017 04 14; 60(4):877-891. . View in PubMed
Predicting couple therapy outcomes based on speech acoustic features PLoS One. 2017; 12(9):e0185123. . View in PubMed
A fast and flexible MRI system for the study of dynamic vocal tract shaping Magn Reson Med. 2017 01; 77(1):112-125. . View in PubMed
Use of machine learning to improve autism screening and diagnostic instruments: effectiveness, efficiency, and multi-instrument fusion J Child Psychol Psychiatry. 2016 08; 57(8):927-37. . View in PubMed
Markov Chain Monte Carlo Inference of Parametric Dictionaries for Sparse Bayesian Approximations IEEE Trans Signal Process. 2016 Jun 15; 64(12):3077-3092. . View in PubMed
Computational Analysis and Simulation of Empathic Behaviors: a Survey of Empathy Modeling with Behavioral Signal Processing Framework Curr Psychiatry Rep. 2016 May; 18(5):49. . View in PubMed
Analysis of engagement behavior in children during dyadic interactions using prosodic cues Comput Speech Lang. 2016 May; 37:47-66. . View in PubMed
A technology prototype system for rating therapist empathy from audio recordings in addiction counseling PeerJ Comput Sci. 2016 Apr; 2. . View in PubMed
Directly data-derived articulatory gesture-like representations retain discriminatory information about phone categories Comput Speech Lang. 2016 Mar 01; 36:330-346. . View in PubMed
Strategies for Disseminating Information on Biomedical Research on Autism to Hispanic Parents J Autism Dev Disord. 2016 Mar; 46(3):1038-50. . View in PubMed
Detecting paralinguistic events in audio stream using context in features and probabilistic decisions Comput Speech Lang. 2016 Mar; 36:72-92. . View in PubMed
Advances in real-time magnetic resonance imaging of the vocal tract for speech science and technology research APSIPA Trans Signal Inf Process. 2016; 5. . View in PubMed
Head Motion Modeling for Human Behavior Analysis in Dyadic Interaction IEEE Trans Multimedia. 2015 Jul 13; 17(7):1107-1119. . View in PubMed
Applying machine learning to facilitate autism diagnostics: pitfalls and promises J Autism Dev Disord. 2015 May; 45(5):1121-36. . View in PubMed
A kinematic study of critical and non-critical articulators in emotional speech production J Acoust Soc Am. 2015 Mar; 137(3):1411-29. . View in PubMed
“Rate My Therapist”: Automated Detection of Empathy in Drug and Alcohol Counseling via Speech and Language Processing PLoS One. 2015; 10(12):e0143055. . View in PubMed
Automatic intelligibility classification of sentence-level pathological speech Comput Speech Lang. 2015 Jan; 29(1):132-144. . View in PubMed
On Quantifying Facial Expression-Related Atypicality of Children with Autism Spectrum Disorder Proc IEEE Int Conf Acoust Speech Signal Process. 2015 Apr; 2015:803-807. . View in PubMed
Developmental acoustic study of American English diphthongs J Acoust Soc Am. 2014 Oct; 136(4):1880-94. . View in PubMed
Real-time magnetic resonance imaging and electromagnetic articulography database for speech production research (TC) J Acoust Soc Am. 2014 Sep; 136(3):1307. . View in PubMed
Robust Unsupervised Arousal Rating: A Rule-Based Framework with Knowledge-Inspired Vocal Features IEEE Trans Affect Comput. 2014 Apr-Jun; 5(2):201-213. . View in PubMed
Intoxicated Speech Detection: A Fusion Framework with Speaker-Normalized Hierarchical Functionals and GMM Supervectors Comput Speech Lang. 2014 Mar 01; 28(2). . View in PubMed
Co-registration of speech production datasets from electromagnetic articulography and real-time magnetic resonance imaging J Acoust Soc Am. 2014 Feb; 135(2):EL115-21. . View in PubMed
Gestural Control in the English Past-Tense Suffix: An Articulatory Study Using Real-Time MRI Phonetica. 2014; 71(4):229-48. . View in PubMed
Barista: A Framework for Concurrent Speech Processing by USC-SAIL Proc IEEE Int Conf Acoust Speech Signal Process. 2014 May; 2014:3306-3310. . View in PubMed
Are articulatory settings mechanically advantageous for speech motor control? PLoS One. 2014; 9(8):e104168.. View in PubMed
Evaluation of swallow function after tongue cancer treatment using real-time magnetic resonance imaging: a pilot study JAMA Otolaryngol Head Neck Surg. 2013 Dec; 139(12):1312-9. . View in PubMed
Spatio-temporal articulatory movement primitives during speech production: extraction, interpretation, and validation J Acoust Soc Am. 2013 Aug; 134(2):1378-94. . View in PubMed
An investigation of articulatory setting using real-time magnetic resonance imaging J Acoust Soc Am. 2013 Jul; 134(1):510-9. . View in PubMed
Dynamic 3-D visualization of vocal tract shaping during speech IEEE Trans Med Imaging. 2013 May; 32(5):838-48. . View in PubMed
A globally-variant locally-constant model for fusion of labels from multiple diverse experts without using reference labels IEEE Trans Pattern Anal Mach Intell. 2013 Apr; 35(4):769-83. . View in PubMed
Behavioral Signal Processing: Deriving Human Behavioral Informatics From Speech and Language: Computational techniques are presented to analyze and model expressed and perceived human behavior-variedly characterized as typical, atypical, distressed, and disordered-from speech and language cues and their applications in health, commerce, education, and beyond Proc IEEE Inst Electr Electron Eng. 2013 Feb 07; 101(5):1203-1233. . View in PubMed
Paralinguistic mechanisms of production in human “beatboxing”: a real-time magnetic resonance imaging study J Acoust Soc Am. 2013 Feb; 133(2):1043-54. . View in PubMed
Statistical Methods for Estimation of Direct and Differential Kinematics of the Vocal Tract Speech Commun. 2013 Jan; 55(1):147-161. . View in PubMed
QUANTIFYING ATYPICALITY IN AFFECTIVE FACIAL EXPRESSIONS OF CHILDREN WITH AUTISM SPECTRUM DISORDERS Proc (IEEE Int Conf Multimed Expo). 2013; 2013:1-6. . View in PubMed
Improved imaging of lingual articulation using real-time multislice MRI J Magn Reson Imaging. 2012 Apr; 35(4):943-8. . View in PubMed
Recognition of physical activities in overweight Hispanic youth using KNOWME Networks J Phys Act Health. 2012 Mar; 9(3):432-41. . View in PubMed
Analyzing the Language of Therapist Empathy in Motivational Interview based Psychotherapy Signal Inf Process Assoc Annu Summit Conf APSIPA Asia Pac. 2012 Dec; 2012. . View in PubMed
Automatic speech recognition using articulatory features from subject-independent acoustic-to-articulatory inversion J Acoust Soc Am. 2011 Oct; 130(4):EL251-7. . View in PubMed
Novel 16-channel receive coil array for accelerated upper airway MRI at 3 Tesla Magn Reson Med. 2011 Jun; 65(6):1711-7. . View in PubMed
Processing speech signal using auditory-like filterbank provides least uncertainty about articulatory gestures J Acoust Soc Am. 2011 Jun; 129(6):4014-22. . View in PubMed
Flexible retrospective selection of temporal resolution in real-time speech MRI using a golden-ratio spiral view order Magn Reson Med. 2011 May; 65(5):1365-71. . View in PubMed
Modeling high-level descriptions of real-life physical activities using latent topic modeling of multimodal sensor signals Annu Int Conf IEEE Eng Med Biol Soc. 2011; 2011:6033-6. . View in PubMed
Optimal Time-Resource Allocation for Energy-Efficient Physical Activity Detection IEEE Trans Signal Process. 2011; 59(4):1843-1857. . View in PubMed
Real-time magnetic resonance imaging investigation of resonance tuning in soprano singing J Acoust Soc Am. 2010 Nov; 128(5):EL335-41. . View in PubMed
A generalized smoothness criterion for acoustic-to-articulatory inversion J Acoust Soc Am. 2010 Oct; 128(4):2162-72. . View in PubMed
Multimodal physical activity recognition by fusing temporal and cepstral information IEEE Trans Neural Syst Rehabil Eng. 2010 Aug; 18(4):369-80. . View in PubMed
Bark frequency transform using an arbitrary order allpass filter IEEE Signal Process Lett. 2010 Mar; 17(6):543-546. . View in PubMed
Pitch contour stylization using an optimal piecewise polynomial approximation IEEE Signal Process Lett. 2009 Sep; 16(9):810-813. . View in PubMed
Closure duration analysis of incomplete stop consonants due to stop-stop interaction J Acoust Soc Am. 2009 Jul; 126(1):EL1-7. . View in PubMed
Prominence Detection Using Auditory Attention Cues and Task-Dependent High Level Information IEEE Trans Audio Speech Lang Process. 2009 Jul 01; 17(5):1009-1024. . View in PubMed
Accelerated three-dimensional upper airway MRI using compressed sensing Magn Reson Med. 2009 Jun; 61(6):1434-40. . View in PubMed
Region segmentation in the frequency domain applied to upper airway real-time magnetic resonance images IEEE Trans Med Imaging. 2009 Mar; 28(3):323-38. . View in PubMed
Effect of bandwidth extension to telephone speech recognition in cochlear implant users J Acoust Soc Am. 2009 Feb; 125(2):EL77-83. . View in PubMed
Unsupervised Adaptation of Categorical Prosody Models for Prosody Labeling and Speech Recognition IEEE Trans Audio Speech Lang Process. 2009 Jan 01; 17(1):138-149. . View in PubMed
On the robustness of overall F0-only modifications to the perception of emotions in speech J Acoust Soc Am. 2008 Jun; 123(6):4547-58. . View in PubMed
Effect of spectral normalization on different talker speech recognition by cochlear implant users J Acoust Soc Am. 2008 May; 123(5):2836-47. . View in PubMed
MODELING THE INTONATION OF DISCOURSE SEGMENTS FOR IMPROVED ONLINE DIALOG ACT TAGGING Proc IEEE Int Conf Acoust Speech Signal Process. 2008; 4518789:5033-5036. . View in PubMed
FINE-GRAINED PITCH ACCENT AND BOUNDARY TONE LABELING WITH PARAMETRIC F0 FEATURES Proc IEEE Int Conf Acoust Speech Signal Process. 2008; 2008:4545-4548. . View in PubMed
AUTOMATIC CLASSIFICATION OF QUESTION TURNS IN SPONTANEOUS SPEECH USING LEXICAL AND PROSODIC EVIDENCE Proc IEEE Int Conf Acoust Speech Signal Process. 2008; 4518782:5005-5008. . View in PubMed
Automatic Prosodic Event Detection Using Acoustic, Lexical, and Syntactic Evidence IEEE Trans Audio Speech Lang Process. 2008 Jan; 16(1):216-228. . View in PubMed
Exploiting Acoustic and Syntactic Features for Automatic Prosody Labeling in a Maximum Entropy Framework IEEE Trans Audio Speech Lang Process. 2008; 16(4):797-811. . View in PubMed
A TOP-DOWN AUDITORY ATTENTION MODEL FOR LEARNING TASK DEPENDENT INFLUENCES ON PROMINENCE DETECTION IN SPEECH Proc IEEE Int Conf Acoust Speech Signal Process. 2008; 2008:3981-3984. . View in PubMed
A NOVEL ALGORITHM FOR UNSUPERVISED PROSODIC LANGUAGE MODEL ADAPTATION Proc IEEE Int Conf Acoust Speech Signal Process. 2008; (4518576):4181-4184. . View in PubMed
Robust Speech Rate Estimation for Spontaneous Speech IEEE Trans Audio Speech Lang Process. 2007 Nov 01; 15(8):2190-2201. . View in PubMed
An Acoustic Measure for Word Prominence in Spontaneous Speech IEEE Trans Audio Speech Lang Process. 2007 Feb 01; 15(2):690-701. . View in PubMed
Automatic acoustic synthesis of human-like laughter J Acoust Soc Am. 2007 Jan; 121(1):527-35. . View in PubMed
Synchronized and noise-robust audio recordings during realtime magnetic resonance imaging scans J Acoust Soc Am. 2006 Oct; 120(4):1791-4. . View in PubMed
Pathological voice assessment Conf Proc IEEE Eng Med Biol Soc. 2006; 2006:1669-73. . View in PubMed
An approach to real-time magnetic resonance imaging for speech production J Acoust Soc Am. 2004 Apr; 115(4):1771-6. . View in PubMed

Shrikanth Narayanan

Overview

Awards

Publications

Similar People