Visar

Associate Professor

Speech and Hearing Science
Electrical, Computer, and Energy Engineering
Arizona State University
COOR 3472
(480) 727 - 6455
visar ((at)) asu ((dot)) edu

Journal Publications

Kadetotad, D., Berisha, V., Chakrabarti, C., Seo, J. A 8.93 TOPS/W LSTM Recurrent Neural Network Accelerator Featuring Hierarchical Coarse-Grain Sparsity for On-Device Speech Recognition. IEEE Journal of Solid State Circuits. In press 2020.

Kim, SK, Chong, C, Dumkrieger, G., Ross, K., Berisha, V., Schwedt, T.. Clinical Correlates of Insomnia in Patients with Persistent Post-Traumatic Headache Compared with Migraine. The Journal of Headache and Pain. In press. 2020.

Voleti, R. N. U., Liss, J., & Berisha, V. (2019). A Review of Automated Speech and Language Features for Assessment of Cognition and Thought Disorders. IEEE Journal of Selected Topics in Signal Processing

Shah, M., Tu, M., Berisha, V., Chakrabarti, C., & Spanias, A. (2019). Articulation constrained learning with application to speech emotion recognition. EURASIP journal on audio, speech, and music processing, 2019(1), 14.

Borrie, S. A., Barrett, T. S., Willi, M. M., & Berisha, V. (2019). Syncing Up for a Good Conversation: A Clinically Meaningful Methodology for Capturing Conversational Entrainment in the Speech Domain. Journal of Speech, Language, and Hearing Research, 62(2), 283-296.

Borrie, S. A., Barrett, T. S., Liss, J. M., & Berisha, V. (2019). Sync Pending: Characterizing Conversational Entrainment in Dysarthria Using a Multidimensional, Clinically Informed Approach. Journal of Speech, Language, and Hearing Research, 1-12.

Jiao, Y., LaCross, A., Berisha, V., & Liss, J. (2019). Objective Intelligibility Assessment by Automated Segmental and Suprasegmental Listening Error Analysis. Journal of Speech, Language, and Hearing Research, 62(9), 3359-3366.

Dumkrieger, G., Chong, C. D., Ross, K., Berisha, V., & Schwedt, T. J. (2019). Static and dynamic functional connectivity differences between migraine and persistent post-traumatic headache: A resting-state magnetic resonance imaging study. Cephalalgia, 0333102419847728.

Schwedt, T. J., Peplinski, J., Garcia-Filion, P., & Berisha, V. (2019). Altered speech with migraine attacks: A prospective, longitudinal study of episodic migraine without aura. Cephalalgia, 39(6), 722-731.

Chong, C. D., Peplinski, J., Berisha, V., Ross, K., & Schwedt, T. J. (2019). Differences in fibertract profiles between patients with migraine and those with persistent post-traumatic headache. Cephalalgia, 0333102418815650.

Rutkove, S. B., Qi, K., Shelton, K., Liss, J., Berisha, V., & Shefner, J. M. (2019). ALS longitudinal studies with frequent data collection at home: study design and baseline data. Amyotrophic Lateral Sclerosis and Frontotemporal Degeneration, 20(1-2), 61-67.

Utianski, R., Sandoval, S., Berisha, V., Lansford, K., Liss, J. (2018) The effects of speech compression algorithms on the intelligibility of two individuals with dysarthric speech. American Journal of Speech-Language Pathology, 1-9.

Howard L, Dumkrieger G, Chong CD, Ross K, Berisha V, Schwedt TJ. (2018) Symptoms of Autonomic Dysfunction Amongst Those with Persistent Post-Traumatic Headache Attributed to Mild Traumatic Brain Injury: a Comparison to Migraine and Healthy Controls. Headache.

Berisha, V., Gilton, D., Baxter, L., Corman, S., Blais, C., Brewer, G., Ruston, S., Ball, H., Peter, B., Wingert, K., Rogalsky, C. (2018). Structural neural predictors of Farsi-English bilingualism. Brain and Language.

Wisler, A., Berisha, V., Spanias, A., & Hero, A. O. (2018). Direct estimation of density functionals using a polynomial basis. IEEE Transactions on Signal Processing, 66(3), 558-572.

Schwedt, T., Chong, C., Peplinski, J., Ross, K., & Berisha, V.(2017) Persistent post-traumatic headache vs. migraine: an MRI study demonstrating differences in brain structure, Journal of Headache and Pain, 18:87.

Berisha, V., Wang, S., LaCross, A., Liss, J., & Garcia-Filion, P. (2017). Longitudinal changes in linguistic complexity among professional football players. Brain and language, 169, 57-63.

Xu, Z., Skorheim, S., Tu, M., Berisha, V., Yu, S., Seo, J. S., ... & Cao, Y. (2017). Improving Efficiency in Sparse Learning with the Feedforward Inhibitory Motif. Neurocomputing.

Hsu, S. C., Jiao, Y., McAuliffe, M. J., Berisha, V., Wu, R. M., & Levy, E. S. (2017). Acoustic and perceptual speech characteristics of native Mandarin speakers with Parkinson's disease. The Journal of the Acoustical Society of America, 141(3), EL293-EL299.

Jiao, Y., Berisha, V., Liss, J., Hsu, S. C., Levy, E., & McAuliffe, M. (2016). Articulation entropy: An unsupervised measure of articulatory precision. IEEE Signal Processing Letters.

LaCross, A., Liss, J., Barragan, B., Adams, A., Berisha, V., McAuliffe, M., & Fromont, R. (2016). The role of stress and word size in Spanish speech segmentation. The Journal of the Acoustical Society of America, 140(6), EL484-EL490.

Tu, M., Wisler, A., Berisha, V., & Liss, J. M. (2016). The relationship between perceptual disturbances in dysarthric speech and automatic speech recognition performance. The Journal of the Acoustical Society of America, 140(5), EL416-EL422.

Dorman, M. F., Liss, J., Wang, S., Berisha, V., Ludwig, C., & Natale, S. C. (2016). Experiments on Auditory-Visual Perception of Sentences by Users of Unilateral, Bimodal, and Bilateral Cochlear Implants. Journal of Speech, Language, and Hearing Research, 59(6), 1505-1519.

Lansford, K. L., Berisha, V., & Utianski, R. L. (2016). Modeling listener perception of speaker similarity in dysarthria. The Journal of the Acoustical Society of America, 139(6), EL209-EL215.

Berisha, V., Wisler, A., Hero, A. O., & Spanias, A. (2016). Empirically estimable classification bounds based on a nonparametric divergence measure. IEEE Transactions on Signal Processing, 64(3), 580-591.

Jiao, Y., Berisha, V., Tu, M., & Liss, J. (2015). Convex weighting criteria for speaking rate estimation. IEEE/ACM transactions on audio, speech, and language processing, 23(9), 1421-1430.

Berisha, V., Wang, S., LaCross, A., & Liss, J. (2015). Tracking discourse complexity preceding Alzheimer's disease diagnosis: a case study comparing the press conferences of presidents Ronald Reagan and George Herbert Walker Bush. Journal of Alzheimer's Disease, 45(3), 959-963.

Berisha, V., & Hero, A. O. (2015). Empirical non-parametric estimation of the Fisher Information. IEEE Signal Processing Letters, 22(7), 988-992.

Schwedt, T. J., Berisha, V., & Chong, C. D. (2015). Temporal lobe cortical thickness correlations differentiate the migraine brain from the healthy brain. PloS one, 10(2), e0116687.

Berisha, V., & Cochran, D. (2015). Active data labeling for improved classifier generalizability. Signal Processing, 108, 272-277.

Berisha, V., Sandoval, S., Utianski, R., Liss, J., & Spanias, A. (2014). Characterizing the distribution of the quadrilateral vowel space area. The Journal of the Acoustical Society of America, 135(1), 421-427.

Sandoval, S., Berisha, V., Utianski, R. L., Liss, J. M., & Spanias, A. (2013). Automatic assessment of vowel space area. The Journal of the Acoustical Society of America, 134(5), EL477-EL483.

Krishnamoorthi, H., Spanias, A., & Berisha, V. (2009). A frequency/detector pruning approach for loudness estimation. IEEE Signal Processing Letters, 16(11), 997-1000.

Kwon, H., Berisha, V., Atti, V., & Spanias, A. (2009). Experiments with sensor motes and Java-DSP. IEEE Transactions on Education, 52(2), 257-262.

Atti, V., Spanias, A., Tsakalis, K., Panayiotou, C., Iasemidis, L., & Berisha, V. (2008). Gradient projection-based channel equalization under sustained fading. Signal Processing, 88(2), 236-246.

Berisha, V., & Spanias, A. (2007). Wideband speech recovery using psychoacoustic criteria. EURASIP Journal on Audio, Speech, and Music Processing, 2007(2), 5-5.

Spanias, A., Huang, C. W., Natarajan, A., Ferzli, R., Kwon, H., Atti, V., ... & Misra, S. (2007). Interfacing Java-DSP with a TI DSK for use in a signal processing class. Computers in Education Journal, 17(3), 27-35.


Select Peer-Reviewed Conference Publications

Li, W., Dasarathy, G., & Berisha, V. (2020). Regularization via Structural Label Smoothing. Proceedings of AISTATS 2020.

Lubold, N., Borrie, S. A., Barrett, T. S., Willi, M., & Berisha, V. (2019, January). Do Conversational Partners Entrain on Articulatory Precision?. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (Vol. 2019, pp. 1931-1935).

Voleti, R., Woolridge, S., Liss, J. M., Milanovic, M., Bowie, C. R., & Berisha, V. (2019). Objective Assessment of Social Skills Using Automated Language Analysis for Identification of Schizophrenia and Bipolar Disorder. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (Vol. 2019, pp. 1931-1935).

Xiong, Y., Berisha, V., & Chakrabarti, C. (2019, January). Residual+ Capsule Networks (ResCap) for Simultaneous Single-Channel Overlapped Keyword Recognition. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (Vol. 2019, pp. 3337-3341).

Moore, M., *Saxon, M., Venkateswara, H., Berisha, V., & Panchanathan, S. (2019, January). Say what? A dataset for exploring the error patterns that two ASR engines make. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (Vol. 2019, pp. 2528-2532).

Saxon, M., Liss, J., & Berisha, V. (2019, May). Objective Measures of Plosive Nasalization in Hypernasal Speech. In ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 6520-6524). IEEE.

Voleti, R., Liss, J. M., & Berisha, V. (2019, May). Investigating the Effects of Word Substitution Errors on Sentence Embeddings. In ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 7315-7319). IEEE.

Peplinski, J., Berisha, V., Liss, J., Hahn, S., Shefner, J., Rutkove, S., ... & Shelton, K. (2019, May). Objective Assessment of Vocal Tremor. In ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 6386-6390). IEEE.

Srivastava, G., Kadetotad, D., Yin, S., Berisha, V., Chakrabarti, C., & Seo, J. S. (2019, May). Joint Optimization of Quantization and Structured Sparsity for Compressed Deep Neural Networks. In ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 1393-1397). IEEE.

Kadetotad, D., Berisha, V., Chakrabarti, C., & Seo, J. S. (2019, September). A 8.93-TOPS/W LSTM Recurrent Neural Network Accelerator Featuring Hierarchical Coarse-Grain Sparsity With All Parameters Stored On-Chip. In ESSCIRC 2019-IEEE 45th European Solid State Circuits Conference (ESSCIRC) (pp. 119-122). IEEE.

Dixit, A., Shankar, U., Spanias, A., Berisha, V., Banavar, M. Online Machine Learning Experiments using the new HTML5 Object Oriented Software. In Proceedings of Frontiers in Education Workshop, 2018.

Song, H., Willi, M., Thuagarajan, J., Berisha, V., and Spanias, A. (2018) Triplet network with attention for speaker diarization. In Proceedings of 2018 Interspeech Conference.

Tu, M., Grabek, A., Liss, J., Berisha V. (2018) Investigating the role of L1 in automatic pronunciation evaluation of L2 speech. In Proceedings of 2018 Interspeech Conference

Willi, M, Borrie, S., Barrett, T., Tu, M., Berisha, V. (2018) A discriminative acoustic-prosodic approach for measuring local entrainment. In Proceedings of 2018 Interspeech Conference

Wisler, A., Moon, K., & Berisha, V. (2018). Direct ensemble estimation of density functionals. In 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). (also available on arXiv preprint arXiv:1705.06315.)

Jiao, Y., Tu, M., Berisha, V., Liss, J. (2018) Simulating dysarthric speech for training data augmentation in clinical speech applications, In 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

Kadambi, P., Mohanty, A., Ren, H., Smith, J. McGuinnes, K., Holt, K., Furtwaengler, A., Slepetys, R., Yang, Z., Seo, J., Chae, J., Cao, Y. Berisha, V. (2018) Towards a wearable cough detector based on neural networks, In 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

Kadambi, P., Wisler, A., Berisha, V. (2017) Improved Finite-Sample Estimate of a Nonparametric f-Divergence.In Proc of Asilomar Conference on Signals, Systems, and Computers. IEEE.

V. Berisha, J. Liss, T. Huston, A. Wisler, Y. Jiao, and J. Eig ``Float Like a Butterfly Sting Like a Bee: Changes in Speech Preceded Parkinsonism Diagnosis for Muhammad Ali" Interspeech 2017

M. Tu, V. Berisha, and J. Liss ``Interpretable Objective Assessment of Dysarthric Speech based on Deep Neural Networks" Interspeech 2017 (accepted)

A. Wisler, V. Berisha, D. Wei, K. Ramamurthy, and A. Spanias ``Empirically-Estimable Multi-Class Classification Bounds," Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing. March 2016.

M. Tu, V. Berisha, M. Woolf, J. Seo, and Y. Cao ``Ranking the Parameters of Deep Neural Networks Using the Fisher Information," Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing. March 2016.

Y. Jiao, M. Tu, V. Berisha, and J. Liss ``Online Speaking Rate Estimation Using Recurrent Neural Networks," Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing. March 2016.

A. Wisler, V. Berisha, K. Ramamurthy, J. Liss, and A. Spanias ``Removing Data with Noisy Responses in Regression Analysis," Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing. April 2015.

A. Wisler, V. Berisha, J. Liss, and A. Spanias ``Domain Invariant Speech Features Based on a New Divergence measure,” Proceedings of IEEE Spoken Language Technology Workshop. December 2014.

V. Berisha, S. Sandoval, R. Utianski, J. Liss, and A. Spanias, ``Modeling Pathological Speech Perception From Data with Similarity Labels,” Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing. May 2014.

V. Berisha, S. Sandoval, R.L. Utianski, J.M. Liss, and A. Spanias, ``Selecting Disorder-Specific Features for Speech Pathology Fingerprinting," Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing. May 2013.

V. Berisha, R.L. Utianski, and J.M. Liss, ``Toward a Clinical Tool for Automatic Intelligibility Assessment," Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing. May 2013.

V. Berisha, A. Javadi, D. Anderson, and A. Gray, ``Making Decisions About Unseen Data: Semi-Supervised Learning at Different Levels of Specificity," Proceedings of Asilomar Conference on Signals, Systems, and Computers. October 2010.

JJ Thiagarajan, KN Ramamurthy, P Knee, A Spanias, and V Berisha, ``Sparse representations for automatic target classification in SAR images" IEEE Conference on Communications, Control and Signal Processing, 2010"

S. Philips, V. Berisha, and A. Spanias, ``Energy-Constrained Discriminant Analysis," Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, April 2009.

V. Berisha et al., ``Sparse Manifold Learning with Applications to SAR Image Classification'' in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, April 2007.

V. Berisha, H. Kwon, and A. Spanias, ``Real-time acoustic monitoring using wireless sensor motes," in Proceedings of IEEE International Symposium on Circuits and Systems, 2006.