Setu Shah



I am an applied research scientist interested in challenging problems that can have an impact. My primary research areas include applied natural language processing and artificial intelligence.

The following sections should help you understand my work and my experiences.

Welcome to my world.


The following includes my research that has been published as my thesis and in peer-reviewed conferences and journals.

  1. S. Shah, “Biomedical concept association and clustering using word embeddings,” Master’s thesis, Purdue School of Engineering and Technology, IUPUI, 2018. Available through Purdue Hammer and IUPUI ScholarWorks.
  2. S. Shah, Z. Ben Miled, R. Schaefer and S. Berube, “Differential Learning for Outliers: A Case Study of Water Demand Prediction,” in Appplied Sciences vol. 8, no. 11, 2018. Available through MDPI.
  3. X. Luo and S. Shah, “Concept embedding-based weighting scheme for biomedical text clustering and visualization,” in Appplied Informatics vol. 5, no. 1, 2018. Available through Springer.
  4. S. Shah, X. Luo, S. Kanakasabai, R. Tuason, and G. Klopper, “Neural Networks for Mining the Associations between Diseases and Symptoms in Clinical Notes,” in Health Information Science and Systems vol. 7, no. 1, 2018. Available through Springer.
  5. S. Shah, M. Hosseini, Z. Ben Miled, R. Schafer and S. Berube, “A water demand prediction model for Central Indiana,” in Proceedings of the Thirtieth Conference on Innovative Applications of Artificial Intelligence (IAAI ’18), New Orleans, USA, 2018. Available through AAAI Publications.
  6. S. Shah and X. Luo, “Comparison of Deep Learning based Concept Representations for Biomedical Document Clustering,” in Proceedings of 2018 IEEE International Conference on Biomedical and Health Informatics (BHI ’18), Las Vegas, USA, 2018. Available on IEEEXplore.
  7. S. Shah and X. Luo, “Exploring diseases based biomedical document clustering and visualization using self-organizing maps,” in Proceedings of the 2017 IEEE 19th International Conference on e-Health Networking, Applications and Services (Healthcom), Dalian, China, 2017. Available on IEEEXplore.
  8. S. Shah and X. Luo, “Extracting Modifiable Risk Factors from Narrative Preventive Healthcare Guidelines for EHR Integration,” in Proceedings of the 2017 IEEE 17th International Conference on Bioinformatics and Bioengineering (BIBE ’17), Washington DC, USA, 2017. Available on IEEEXplore.
  9. X. Luo, G. Zimet and S. Shah “A Natural Language Processing Framework to analyse the opinions on HPV Vaccination Reflected in Twitter over 10 Years (2008 - 2017),” in Human Vaccines & Immunotherapeutics vol. 15, no. 8, 2019. Available through Taylor & Francis.
  10. I. Terziyska, S. Shah and X. Luo, “Are Recent Terrorism Trends Reflected in Social Media?” in Proceedings of the 2017 IEEE 14th International Conference on Mobile Ad Hoc and Sensor Systems (MASS), 4th National Workshop for REU Research in Networking and Systems, Orlando, USA, 2017. Available on IEEEXplore.

Except having the honor of publications, I have also been featured in a few university briefs.


Some of the projects I've created and been a part of.

Most of the code and documentation is available on my GitHub.

  1. Apple’s COVID-19 Mobility Data for India: I spent a couple days looking at and playing with the mobility data Apple decided to start publishing. I thought it had some interesting trends (specifically for different cities) and I pondered over what was causing some of the specific changes.
  2. Integrating preventive care guidelines & EHR to provide better healthcare: My MS thesis is about improving preventive healthcare recommendations by using natural language processing. Through this project, I have succeeded in providing personalized preventive care recommendations to patients by analyzing patient EHR data and USPSTF Preventive Care guidelines.
  3. Prediction model for water demand in Central Indiana for Citizens Energy Group: I designed a parallel RNN algorithm to predict daily and monthly average water demand with a very high accuracy. My model achieved an average error rate of 1.69% for daily predictions and 2.29% for monthly predictions.
  4. Disease-based biomedical document search and retrieval using Word2Vec: I developed an algorithm that uses disease ontology for biomedical document search and retrieval. With an innovative concept weighing scheme for biomedical documents, I have overcome the problem of semantically equivalent biomedical concepts being represented using heterogeneous lexicons.
  5. Navigation tool to compute the best route based on road safety: This project is based on artificial neural networks and uses information of past fatal accidents that have occurred in USA to predict future accidents and compare various route options from location A to location B.
  6. A Home Automation and Internet of Things Solution for Indian Homes: In this project, a home automation system focusing on solving specific Indian home problems (automated passageway and room lights, keyless door lock and LPG cooking gas leakage detection and ordering system) was created.
  7. Android app to log GPS and Accelerometer data to local storage and server: An Android app that periodically collects data from the GPS and accelerometer sensors and stores it on a local buffer and if enabled, a web server.
  8. Android navigation app that computes the safest route for travel: An extension of the previous navigation tool app that computes the safest route for travel based on time of journey, fatality rate prediction and weather conditions.
  9. Simulation of various Ad-Hoc Routing Protocols using NS-3: Simulated various Mobile Ad-Hoc Networks for routing protocols like AODV, DSDV, DSR, OLSR, GPSR and Bird Flocking Routing Algorithm (BFA) using Network Simulator-3.
  10. Smart Socket: A 3-pin socket that doesn’t enable electricity supply to the appliance connected until the plug is inserted entirely, thus helping in preventing short circuits, excessive draw current and electric shocks for the user and provides protection from surge voltage, under-voltage and ground leakage protection.
  11. Sanjay Shah Seminar website: A couple of years back, when dad asked me if I would develop his website a few years ago, I took up the challenge. While I do not maintain it anymore, I created it in early 2011 and maintained it through the first half of 2016.
  12. Bhavin Shah’s website: I also help a dear friend, my mentor (and now a published author!) with creating and maintaining his website.
  13. Not Just The Talks: I realized I wasn’t fine with the way things were in my country and in the society around me. This was my attempt at making a difference through my writing. It has been quite sometime since I last wrote on there.


I have opinions.

While my real life handwriting is often described as scribble, I like to believe these are more legible. Over the course of the years, I have written quite a lot. Read it. I hope you find something you like.