I am currently working as a graduate research assistant under the supervision of Dr. Carter Clinton.
- Analyzing metagenomic data from the historic New York African Burial Ground using Next Generation Sequencing
and tools like Qiime2 and Kraken to reconstruct the history of the enslaved African population.
- Categorizing DNA sequences (human, bacterial, animal, etc.) using Bowtie2 and SAMTools; compare human DNA
with public databases for genealogical links and disease markers, utilizing bash, CUDA, and HPC.
I am currently working as a graduate research assistant under the supervision of Dr. Rafael Guerrero.
- Analyzing protien sequences to modify their thermostability through amino acid changes, opening up the path for advanced applications.
- Extracted protein sequence and taxonomic data from NCBI and PDB databases, cleaned the data to rectify inconsistencies and improve data integrity, transformed the raw data using techniques like merging and joining to facilitate further analysis, loaded the optimized datasets for subsequent computational processes.
- Utilized the ete3 toolkit for phylogenetic tree processing, aligning leaf labels with protein sequence data, tree pruning, and understanding unique sister clades' evolutionary significance. Concurrently, conducted statistical analyses to find correlations between amino acid changes and T growth, visualized with box plots, and designed a linear regression classifier for predicting thermostability based on protien sequences.
- Mentored 25+ students in Data Science projects for rural works organizations. Enabled hands-on experience for students using real data from these organizations, leading to valuable insights that helped the nonprofits improve their operations.
- Guided students in delivering impactful presentations, showcasing their findings and providing actionable recommendations to stakeholders.
- Developing and maintaining self-updating datasets sourced from real-time public data sources. Using APIs to connect with public data and create dynamic datasets.
- Documenting codebooks for each dataset and maintaining workflow notes to help future progress by others.
- Proposing ideas for extracting insights and projects from the datasets to guide research and match course goals.
- Leading regular meetings to update on project status, milestones, and future plans of action.
- Publishing these datasets and their insights for the wider academic community.
- Assisting NC DHHS employees in the course “Data at Work: Data Analytics in Excel and Beyond” with key topics including ETL Tools, Data Warehousing, Microsoft Excel, SQL, PowerBI, Statistics, and Data Visualization.
- Conducting regular office hours providing guidance on course concepts, lab techniques, and assisting on their capstone projects.
- Grading lab assignments and student capstone projects.
Graduate Research Assistant
-
IEC Lab NCSU
Dec 2022 - May 2023
- Collaborated with Tasmia Shahriar on the AI-based application Simstudent, using data analysis skills.
- Enhanced model accuracy through data coding, mirroring middle school perspectives.
- Improved Simstudent’s performance, benefiting many middle school students.
- Tutored and evaluated 60+ students in Python course using Minerva platform.
- Teaching Assistant for the course Probability and Statistics. Assisted in attending student queries and graded students assignments and final project to aid the professor.