Publications
This section contains publications published by ISI over the past several decades.
2022
Semi-fedser: Semi-supervised learning for speech emotion recognition on federated learning using multiview pseudo-labeling
Tiantian Feng, Shrikanth Narayanan
arXiv preprint arXiv:2203.08810, 2022
Confusion2Vec 2.0: Enriching ambiguous spoken language representations with subwords
Prashanth Gurunath Shivakumar, Panayiotis Georgiou, Shrikanth Narayanan
Plos one 17 (3), e0264488, 2022
End-to-end neural systems for automatic children speech recognition: An empirical study
Prashanth Gurunath Shivakumar, Shrikanth Narayanan
Computer Speech & Language 72, 101289, 2022
A review of speaker diarization: Recent advances with deep learning
Tae Jin Park, Naoyuki Kanda, Dimitrios Dimitriadis, Kyu J Han, Shinji Watanabe, Shrikanth Narayanan
Computer Speech & Language 72, 101317, 2022
Intra-topic latency as an automated behavioral marker of treatment response in autism spectrum disorder
Elizabeth P McKernan, Manoj Kumar, Adriana Di Martino, Lisa Shulman, Alexander Kolevzon, Catherine Lord, Shrikanth Narayanan, So Hyun Kim
Scientific reports 12 (1), 3255, 2022
Special Issue on Signal Analysis for Detection and Monitoring of Contagious Diseases
Bjorn W Schuller, Yonina Eldar, Maja Pantic, Shrikanth Narayanan, Tuomas Virtanen, Jianhua Tao
IEEE Journal on Selected Topics in Signal Processing 16 (2), 2022
Intelligent signal analysis for contagious virus diseases
Björn W Schuller, Yonina Eldar, Maja Pantic, Shrikanth Narayanan, Tuomas Virtanen, Jianhua Tao
IEEE journal of selected topics in signal processing 16 (2), 159-163, 2022
Vault: Augmenting the vision-and-language transformer with the propagation of deep language representations
Georgios Chochlakis, Tejas Srinivasan, Jesse Thomason, Shrikanth Narayanan
arXiv preprint arXiv:2208.09021 6, 17, 2022
Far-field speaker verification challenge (ffsvc) 2022: Challenge evaluation plan
Xiaoyi Qin, Ming Li, Hui Bu, Shrikanth Narayanan, Haizhou Li
2022
Causal indicators for assessing the truthfulness of child speech in forensic interviews
Zane Durante, Victor Ardulov, Manoj Kumar, Jennifer Gongola, Thomas Lyon, Shrikanth Narayanan
Computer speech & language 71, 101263, 2022
Monet: Multi-scale overlap network for duplication detection in biomedical images
Ekraam Sabir, Soumyaroop Nandi, Wael AbdAlmageed, Prem Natarajan
2022 IEEE International Conference on Image Processing (ICIP), 3793-3797, 2022
Alexatm 20b: Few-shot learning using a large-scale multilingual seq2seq model
Saleh Soltan, Shankar Ananthakrishnan, Jack FitzGerald, Rahul Gupta, Wael Hamza, Haidar Khan, Charith Peris, Stephen Rawls, Andy Rosenbaum, Anna Rumshisky, Chandana Satya Prakash, Mukund Sridhar, Fabian Triefenbach, Apurv Verma, Gokhan Tur, Prem Natarajan
arXiv preprint arXiv:2208.01448, 2022
TAGPRIME: A unified framework for relational structure extraction
I Hsu, Kuan-Hao Huang, Shuning Zhang, Wenxin Cheng, Premkumar Natarajan, Kai-Wei Chang, Nanyun Peng
arXiv preprint arXiv:2205.12585, 2022
MASSIVE: A 1M-example multilingual natural language understanding dataset with 51 typologically-diverse languages
Jack FitzGerald, Christopher Hench, Charith Peris, Scott Mackie, Kay Rottmann, Ana Sanchez, Aaron Nash, Liam Urbach, Vishesh Kakarala, Richa Singh, Swetha Ranganath, Laurie Crist, Misha Britan, Wouter Leeuwis, Gokhan Tur, Prem Natarajan
arXiv preprint arXiv:2204.08582, 2022
Dialog management for multiple users
P Krishnan, A Mandal, N Strom, P Natarajan, A Rastrow, ...
US Patent App. 17/112,227, 2022
Multilingual generative language models for zero-shot cross-lingual event argument extraction
Kuan-Hao Huang, I Hsu, Premkumar Natarajan, Kai-Wei Chang, Nanyun Peng
arXiv preprint arXiv:2203.08308, 2022
A thousand words are worth more than a picture: Natural language-centric outside-knowledge visual question answering
Feng Gao, Qing Ping, Govind Thattai, Aishwarya Reganti, Ying Nian Wu, Prem Natarajan
arXiv preprint arXiv:2201.05299, 2022
Alexa teacher model
Jack FitzGerald, Shankar Ananthakrishnan, Konstantine Arkoudas, Davide Bernardi, Abhishek Bhagia, Claudio Delli Bovi, Jin Cao, Rakesh Chada, Amit Chauhan, Luoxin Chen, Anurag Dwarakanath, Satyam Dwivedi, Turan Gojayev, Karthik Gopalakrishnan, Thomas Gueudre, Dilek Hakkani-Tur, Wael Hamza, Jonathan J Hüser, Kevin Martin Jose, Haidar Khan, Beiye Liu, Jianhua Lu, Alessandro Manzotti, Pradeep Natarajan, Karolina Owczarzak, Gokmen Oz, Enrico Palumbo, Charith Peris, Chandana Satya Prakash, Stephen Rawls, Andy Rosenbaum, Anjali Shenoy, Saleh Soltan, Mukund Harakere Sridhar, Lizhen Tan, Fabian Triefenbach, Pan Wei, Haiyang Yu, Shuai Zheng, Gokhan Tur, Prem Natarajan
Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and …, 2022
Transform-retrieve-generate: Natural language-centric outside-knowledge visual question answering
Feng Gao, Qing Ping, Govind Thattai, Aishwarya Reganti, Ying Nian Wu, Prem Natarajan
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
A simple and unified tagging model with priming for relational structure predictions
I-Hung Hsu, Kuan-Hao Huang, Shuning Zhang, Wenxin Cheng, Premkumar Natarajan, Kai-Wei Chang, Nanyun Peng
ArXiv, abs/2205.12585, 2022