Jay Shah

Jay Shah

PhD student
Arizona State University

Jay Shah

About Me 🥋🎙🏋🍵🎻

➥ I am a Computer Scientist and a 2nd year Ph.D. student at Arizona State University, co-advised by Dr. Baoxin Li and Dr. Teresa Wu on joint projects of ASU-Mayo Center for Innovative Imaging (AMCII).
➥ Currently focusing on developing Deep Learning models for Biomarker Discovery
   ↳ for Alzheimer's disease and Ageing research
   ↳ for Post-Traumatic Headache using multi-modal data and
   ↳ Building Interpretable AI models for healthcare applications.
➥These projects are my joint collaborations with Mayo Clinic, Barrow Neurological Institute (BNI) & Banner Alzheimer's Institute (BAI).
➥ In addition, fortunate enough to be collaborating with Drs. Todd Schwedt, Catherine Chong from Mayo Clinic, Kewei Chen, Yi Su from BAI and Yonas Geda from BNI.
➥ You can find my CV here.


  • (Abstract, submitted) Shah, J., Li, B., Su, Y., Wu, T., “Estimating Biological Age using Deep Learning as surrogate biomarker for AD Diagnosis”, Alzheimer’s Association International Conference, 2022.
  • (Abstract, submitted) Shah, J., Li, B., Su, Y., Wu, T., “Transfer Learning based Deep Encoder Decoder Network for Amyloid PET Harmonization for Small Datasets”, Alzheimer’s Association International Conference, 2022.
  • (Abstract, submitted) Siddiquee, M., Shah, J., Schwedt, T., Chong, C., Dumkrieger, Ross, K., Wu, T., “Migraine Detection and Biomarker Extraction from T1 structural MRI using Deep Learning”,
  • (Journal) Shah, J., Gao, F., Ghisays, V., Luo, J., Chen, Y., Lee, W., Zhou, Y., Li, B., Benzinger, T., Reiman, E., Chen, K., Su, Y., Wu, T., “Deep Residual Inception Encoder-Decoder Network for Amyloid PET Harmonization”, Alzheimer’s & Dementia Journal.
  • (Abstract) Shah, J., Ghisays, V., Luo, J., Chen, Y., Lee, W., Li B., Benzinger, T., Reiman, E., Chen, K., Su, Y., Wu, T., “Deep Residual Inception Encoder-Decoder Network for Amyloid PET Harmonization”, Arizona Alzheimer’s Consortium, 2021.
  • (Abstract) Shah, J., Chong, C., Schwedt, T., Berisha, V., Li, J., Ross, K., Dumkrieger, G., Zhang, J., Gaw, N., Nikolova, S., Wu, T., “Interpreting Deep Learning Model Predictions using Shapley Values”, INFORMS Annual Meeting, 2021.
  • [link] (Abstract) Shah, J., Ghisays, V., Luo, J., Chen, Y., Lee, W., Li B., Benzinger, T., Reiman, E., Chen, K., Su, Y., Wu, T., “Deep Residual Inception Encoder-Decoder Network for Amyloid PET Harmonization”, Alzheimer’s Association International Conference, 2021.
§Invited talks & Media
⟶ Three Ways Deep Learning Yields New Insights for Medical Researchers, news article
⟶ Speaking at Emerging Research Topics in Engineering (ERTE) – 2021, Topic TBA, Date Oct-2, 2021
"Landscape of Explainable AI, Interpreting Deep Learning predictions and my observations from hosting an ML Podcast" at the 4th OnCV&AI workshop arranged by the Nordling Lab @ National Cheng Kung University in Taiwan
From DA-IICT to Arizona State University and working with Nobel Laureate Frank Wilczek: Journey of Jay Shah
⟶ How AI could revolutionize biology — and vice versa, news article
⟶ Interview on growing a technical podcast at IEEE Spectrum & IEEE TV
⟶ Behind the scenes with a Machine Learning Expert : Jay Shah
⟶ AI Club, ASU: Python Workshop 2020, Convolutional Neural Networks 2020, 2021

⟶ Learning to make a good sound from ⟶ Trying to be better at Tennis ⟶ Preparing for a 21k
⟶ I have a Private Pilot License (PPL) for gliders ~~ and love to soar Arizona peaks ⛰ when weather allows!
⟶ I host a student-friendly podcast + where I talk to Machine Learning engineers, researchers and entrepreneurs exploring different applications of Artificial Intelligence in industry and academia, and getting started with it.

Work Experience

Research Assistant, Ph.D. student

May 2020 - Present
  • Deep Learning
  • Intepretability
  • Computer Vision
  • Medical Imaging

Graduate Teaching Assistant

Oct 2019 - May 2020 • 8 months

Read more... ▸ Teaching Assistant for CSE 598: Introduction to Deep Learning in Visual Computing
▸ Tutoring students on the topics: Fundamentals of Machine Learning, Neural networks & backpropagation, Optimization techniques for neural networks, Modern convolutional neural networks, Unsupervised learning & generative models and Transfer learning.

  • Deep Learning
  • Neural Networks
  • Transfer Learning
  • Visual Computing

Research Intern - Computer Vision

Jun 2019 - Aug 2019 • 3 months

Read more... ▸ Developed a closed-loop, cross-platform application for camera-based vitals monitoring of groups of people using proprietary Philips' contactless monitoring technology.
▸ Improved it's camera-based person detection & identification solutions, using state-of-the-art science and technologies to be used in Philips' clinical products in ICU and NICU.
▸ With the aim of achieving an industry-leading balance between performance and complexity ready for clinical use.

  • Deep Learning
  • Contactless Vitals monitoring
  • Computer Vision
  • Systems Programming

Graduate Research Assistant

Nov 2018 - Jun 2019 • 8 months

Read more... ▸ Collaborated with Frank Wilczek, Professor of Physics at MIT, ASU & Nobel Laureate (2004), & Nathan Newman, Professor and Lamonte H. Lawrence Chair in Solid State Science at ASU to study human color perception and how we can use Machine Learning to expand our senses.
▸ Built tools for Automatic Art Authentication system using image analysis, classification, detection, unmixing & estimation of paint pigments.

  • Machine Learning
  • Image Processing
  • Python
  • Pattern recognition

Machine Learning Engineer (Intern)

Jan 2018 - Jul 2018 • 7 months

Read more... ▸ Created sets of Machine Learning coding challenges that were used in HackerRank tests for technical recruiting and screening processes with Heraldo Memelli.
Creating and reviewing HackerRank test challenges for online contests.
▸ Researching the best practices around Software Development, Data Science, Machine Learning, coding and debugging.

Related Links: List of my Editorials to Artificial Intelligence challenges on the HackerRank website

  • Machine Learning
  • Data Science
  • Python
  • Software development
Summer 2017 • 4 months

Read more... ▸ Worked on Significance-based Large-Scale 3D Point Cloud Compression and Management with Prof. Lin Weisi on a Research Grant of S$537,696 (AcRF-Tier 2).
▸ Improved coding performance in terms of 3D feature selection, point-cloud simplification & significance evaluation according to human perception.

  • Video Comporession
  • Machine Learning
  • Image Processing

Read more... ▸ Worked on Authorship Obfuscation, rephrasing the document of author so that the software cannot recognize the original author of the document at IR-LAB, DAII-CT.
▸ Major research work in the domains of Natural Language Processing and Information Retrieval.

  • Natural Language Processing
  • Information Retrieval
  • Keras
  • Theano


Improving Performance for Distributed SGD using Ray

Spring 2020

▸ A hybrid architecture to overcome the limitations of traditional synchronous and asynchronous parameter server models by defining two new parameters: gradient staleness and pull weights rule.
▸ These parameters help build a model that is not completely synchronous or asynchronous but outperforms baseline models in terms of accuracy and training times.

  • Parameter Servers
  • Scalable Machine Learning
  • Stochastic Gradient Descent
Spring 2020

▸ Based on the error analysis on baselines models, modifications in the existing models such that the model can learn more implicit knowledge and the context.
▸ With the goal of improving baseline accuracy of existing cosmosQA task.

  • Natural Language Processing
  • BERT
  • RoBERTa

Real-Time Video Story Telling As A Service

Spring 2019

▸ An end-to-end Cloud-based Full-Stack service for the visually impaired people that provides audio descriptions of videos streamed via a mobile device.
Technologies Used: Python, Android, REST APIs, SpringBoot, Google App Engine, Google Firebase, Google Functions and Video Intelligence API.

  • Google Cloud Platform
  • Full Stack Application
  • NLP
  • Cloud Computing

Video Surveillance As A Service

Spring 2019

▸ Full-Stack Video surveillance application, automatically scales-out & scales-in on user demand & cost-effectively using AWS cloud resources.
Technologies Used: AWS (EC2, S3, EBS, Snapshot, SQS, IAM) Java, SpringBoot, Shell Scripting, REST APIs.

  • Amazon Web Services
  • Full Stack Application
  • Real time Video Processing
  • Cloud Computing

Evolution of Fake Bots on Twitter

Fall 2019

▸ Collected a large dataset of users active on topics related to the 2016 US election
▸ A cluster analysis on this data to find different types of bots using the methods proposed by Lee et al. to understand intention of each detected type is
▸ An analysis: How the types of bots have changed/appeared/disappeared throughout the years.

  • Clustering
  • Classification
  • Fake bot detectiong
  • Social Media Mining
Spring 2019

▸ An intelligent & interactive data visualization tool for analyzing the WebMD dataset to provide engaging insights to any general user.
Technologies Used: d3, Python, HTML, CSS, JavaScript, JSON, Plotly, NLTK.

  • d3
  • Data Visualization
  • HTML
  • CSS

Document Clustering and 3D Visualization

Spring 2019

▸ Visualizing trending news topics as clusters in 3D space for better analysis using interactive visualizations.
▸ Using cluster identification by topic similarity.
Technologies Used: Python, LDA, t-SNE, NMF, Plotly

  • Document Clustering
  • 3D Visualization
  • Clustering
  • LDA
  • t-SNE

Binary Neural Networks

Fall 2018

▸ Python implementation of Deterministic & Stochastic versions of BNNs, comparison of its performance to traditional NNs in terms of memory usage and computation complexity on Fashion-MNIST dataset.
▸ Achieved a significant decrease in memory usage and time taken for training.
Implementation of the paper: Courbariaux, M., Bengio, Y.: Binarynet: training deep neural networks with weights and activations constrained to + 1 or − 1. CoRR (2016)

  • Binary Neural Networds
  • Python

Aesthetic Features of an Image

Summer 2017

▸ Extraction of aesthetic features of an image and understanding human-image correlation.
▸ Research about the influential factors for an aesthetic judgement.

  • Human-Image Correlation
  • research
  • human perception

Parallel Random Forest

Dhirubhai Ambani Inst. of Info. & Comm. Technology (DA-IICT)
Fall 2017

▸ Implemented random forests using both GPUs and CPUs in order to improve performance using parallelism, reducing unnecessary data accesses and removing data redundancy.

  • GPU
  • CUDA
  • decision-trees
  • random-forest

Relevant course work and skills

Technical Courses

Data-Structures & Algorithms, Intro to Programming, Object Oriented Programming
Human-Aware Artificial Intelligence, Intro to Machine Learning, Fundamentals of Statistical Learning, Neural Networks, Intro to Digital Image Processing, Game Theory - Algorithms and Applications, Natural Language Processing, Vision and Language Fronterior
Data Mining, Social Media Mining, Data Visualization, Semantic Web Mining, Database Management Systems, Distributed and Parallel Database Systems, Software Engineering, Distributed Operating Systems, Systems Software, Cloud Computing, Computer Networks, Operating Systems, Compiler Dessign, GPU Programming, Cryptography
Stochastic Simulation, Algebraic Structures, Probability and Statistics, Discrete Mathematics, Calculus & Complex Variables, Theoretical Computer Science,
Signals and Systems, Digital Logic Design, Analog Circuits, Embedded Hardware Design

Technical Skills

Programming Languages: Python, C/C++, Java, Matlab, SQL and Shell scripting
Machine Learning & Deep Learning Frameworks: Tensorflow, PyTorch, scikit-learn, R-studio, Tableau, Gephi, OpenCV, NLTK, matplotlib, NumPy, SciPy, Pandas
Web Technologies: HTML, CSS, JavaScript, d3, Android, Amazon Web Services, Google Compute Engine, Google App Engine, SpringBoot, Git, MySQL

Extra Courses (Out of Curiosity!)

Introduction to Psychology (Yale University - Coursera),
Introduction to Philosophy, PHI 101, Arizona State University (Audit),
Introduction to Neuroeconomics: How the Brain Makes Decisions (National Research University Higher School of Economics - Coursera),
The Science of Well-Being (Yale University - Coursera)
Teaching the Violin and Viola: Creating a Healthy Foundation(Nortwestern University - Coursera)

DeepMind x UCL | Deep Learning Lectures (DeepMind - YouTube)
Convolutional Neural Networks (Convolutional Neural Networks - Coursera) and it's weekly assignments
AI for Medicine - Specialization(deeplearning.ai - Coursera) and it's weekly assignments

(Click on the images to go to the Podcast)

manish_gupta hanie_sedghi maithra_raghu hima_lakkaraju milind_tambe Po_Shen_loh What got you into AI? Michal_Drozdzal Nasrin_Mostafazadeh ishan_mishra aarti_bagul linda_zhang natasha_jacques cade_metz akshay_chaudhari Renee_Yao Azalia_Mirhoseini Geneviève_Patterson Sharon_Zhou Sharon_Zhou Sara_Hooker Bharath_Ramsundar Shreya_Shankar Harshita_Sahijwani Debarghya_Das Zongwei_Zhou Shaleen_Gupta Siddha_Ganju Andy_Harless Sudharsan_Krishnaswamy Barkha_Bhojak Vidhan_Agarwal Ajinkya_Kolhe Ganesh_Iyer Jineet_Doshi Shraddha_Patel Shashank_Bhushan Vaibhavi_Desai Devanshu_Jain

My Travel Vlogs

(Click on the thumbnails to go to the Vlog)

Japan_Vlog Singapore_Vlog Philips_Vlog 5_countries_in_a_minute roadtrip_arizona roadtrip_arizona cabin_arizona


  • * Ph.D. in Computer Science
    Arizona State University (Present)
  • M.S. in Computer Science
    Arizona State University (2020)
  • B.Tech. in Information & Communication Technology
    Dhirubhai Ambani Institute of Information & Communication Technology (2018)

Professional Activities

Additional Links

    YouTube - Webinars
    ↳ My webinars on Machine Learning for Beginners

    ↳ When I am not coding!