Setu Shah
Preface
Preface
I am an applied data and research scientist interested in challenging, impactful problems. My primary research areas include applied natural language processing and artificial intelligence.
The following sections should help you understand my work and my experiences.
Welcome to my world.
Some of the projects I've created and been a part of.
Most of the code and documentation is available on my GitHub.
transformers-embeddings
: Among the various things I built at Ginger / Headspace Health was a Python library to make inference with 🤗’s exceptional transformers
library easier. I open sourced it in October, 2022 during Hacktoberfest 2022. (Blogposts on the Ginger Engineering blog, and Headspace Engineering blog.)I like to take inspiration from FOSS projects, and contribute when it makes sense. If I debug issues when I run into them, I also like to fix them.
All of my PRs and issues (on public repos) are listed here, but some of the contributions are highlighted below.
poetry
as a packaging tool in Lambda functions.I added feedstocks (conda-forge packages for the source Python packages) for:
On top of the ones I added, I also help maintain the feedstocks for:
My Master’s thesis was “Biomedical concept association and clustering using word embeddings,” which I worked on at Purdue School of Engineering and Technology, IUPUI. It is available to download through Purdue Hammer and IUPUI ScholarWorks.
The patents I have been awarded include:
The following includes my research that has been published in various journals and conferences:
Except having the honor of publications, I have also been featured in a few university briefs.
And some media coverage and presentations originating from work:
I have opinions.
While my real life handwriting is often described as scribble, I like to believe these are more legible. Over the course of the years, I have written quite a lot. Read it. I hope you find something you like.