Samuel Roy

Logo

View My GitHub Profile

Data Scientist

Technical Skills: Python, SQL, AWS, Snowflake, MATLAB

Education

Work Experience

Data Scientist @ Toyota Financial Services (June 2022 - Present)

Data Science Consultant @ Shawhin Talebi Ventures LLC (December 2020 - Present)

Projects

Data-Driven EEG Band Discovery with Decision Trees

Developed objective strategy for discovering optimal EEG bands based on signal power spectra using Python. This data-driven approach led to better characterization of the underlying power spectrum by identifying bands that outperformed the more commonly used band boundaries by a factor of two. The proposed method provides a fully automated and flexible approach to capturing key signal components and possibly discovering new indices of brain activity.

Decoding Physical and Cognitive Impacts of Particulate Matter Concentrations at Ultra-Fine Scales

Used Matlab to train over 100 machine learning models which estimated particulate matter concentrations based on a suite of over 300 biometric variables. We found biometric variables can be used to accurately estimate particulate matter concentrations at ultra-fine spatial scales with high fidelity (r2 = 0.91) and that smaller particles are better estimated than larger ones. Inferring environmental conditions solely from biometric measurements allows us to disentangle key interactions between the environment and the body.

Talks & Lectures

Publications