Hello! I'm

ERIC VILLANUEVA

Aspiring Data Analyst & Data Scientist

CS grad · Chapman University San Francisco, CA

Add your photo here
About me

I'm a data-focused CS grad, born and raised in San Francisco, half Mexican and half Filipino, turning messy datasets into clear and decision-ready insight.

I work with Python, SQL, Tableau, Power BI, and APIs, building pipelines and analyses that people actually use. I treat data like a product: clean structure, clear logic, and outputs that make sense to non-technical stakeholders.

Outside of work, I'm building the Resale Market Price Tracker, a project at the intersection of fashion, data, and market intelligence. I care deeply about expanding access and representation in tech for Black and Latinx communities.

🥾
Outdoors and HikingBorn and raised in the Bay Area, I love being outside on trails, in parks, and anywhere that gets me off a screen.
🎵
Music ObsessedMy taste runs across all genres. Current favorites are Blood Orange, Kelela, and FKA Twigs.
📖
Currently ReadingMy favorite book is Giovanni's Room by James Baldwin. Raw, precise, and unforgettable.
♠️
Poker StudentWorking on sharpening my game. I love the strategy, probability, and psychology behind every hand.
Python
Pandas
NumPy
Scikit-learn
SQL / MySQL
R
Tableau
Power BI
Excel
Matplotlib
Seaborn
Git
Jupyter
Feature Engineering
EDA
Experience
Research & Data Operations
Jan 2025 to Present
Stealth Robotics Startup · San Francisco, CA
  • Collected and structured operational data from robotic systems performing household task automation to support ML model development and training pipelines.
  • Worked with engineering teams to apply standardized data collection protocols, keeping datasets consistent and reliable across evaluation cycles.
  • Contributed to iterative model evaluation by surfacing behavioral patterns and performance gaps through systematic data review.
Mail & Logistics Coordinator
Aug 2022 to Sep 2024
Chapman University · Orange, CA
  • Processed and sorted 350+ packages daily across Amazon and major carriers with 99% accuracy, maintaining throughput under time constraints.
  • Maintained detailed records of all incoming mail and packages to support inventory tracking and data integrity across 4+ campus buildings.
Front-End Web Development Intern
Jun 2020 to Sep 2020
Code Tenderloin · San Francisco, CA
  • Built and deployed a website using HTML, CSS, and JavaScript to highlight housing insecurity and social injustices facing San Francisco's unhoused population.
  • Documented and analyzed tech employee work culture to surface insights on industry accessibility and barriers for underrepresented communities.
Education
B.S. Computer Science, Minor in Business Administration
Chapman University · Orange, CA
Aug 2021 to May 2025
My Work
01
SQL / Analytics / Visualization
COVID-19 Global Data Analysis
  • Modeled and joined two tables across 200+ countries to analyze infection rates, death counts, and vaccination rollout over time.
  • Wrote queries using window functions, CTEs, and temp tables to track rolling vaccination totals and infection rates by country.
  • Built SQL views to store key metrics by continent and country for downstream visualization.
Tools SQL Excel Tableau
200+ countries analyzed across the full pandemic timeline
02
Machine Learning / EDA
Spotify Streams Analysis
  • Analyzed 4,600+ Spotify songs and found Playlist Count (R² = 0.74) was a significantly stronger predictor of streams than Playlist Reach (R² = 0.32).
  • Compared polynomial regression models across degrees 2, 3, and 4 on an 80/20 split; degree-2 was optimal as higher-degree models severely overfit the training data.
  • Applied GMM clustering with AIC selection and compared DBSCAN vs. K-Means on 2,600+ songs, selecting K-Means (k=2, silhouette = 0.49) for cleaner cluster results.
Tools Python Scikit-learn Pandas
4,600+ songs analyzed across regression and clustering models
03
SQL / Power BI / HR Analytics
HR Analytics Dashboard
  • Cleaned and structured 22,000+ employee records spanning 2000 to 2020 in MySQL Workbench, standardizing date formats, normalizing categorical fields, and creating calculated columns for age, tenure, and termination metrics.
  • Wrote SQL queries to aggregate workforce data by department, job title, race, gender, and state, then exported cleaned data into Power BI for visualization.
  • Built an interactive Power BI dashboard covering gender and race distribution, age group breakdowns, HQ vs. remote splits, employee count trends over time, and turnover rates by department.
Tools SQL MySQL Workbench Power BI
22,000+ employee records analyzed across a 20-year window
04
Data Engineering / Analytics
Resale Market Price Tracker
  • Built an end-to-end pipeline pulling live eBay sold listings for luxury fashion brands like Bottega Veneta and Acne Studios.
  • Parsed JSON API responses and cleaned and structured pricing data with Pandas to surface market trends for resale intelligence.
  • Actively expanding brand coverage and visualization layer for deeper price trend analysis.
Tools Python Pandas eBay API JSON
Live market data pipeline, actively in development
Get in touch

Let's build something together

Open to data analyst, data scientist, and associate PM roles, especially in fashion tech, robotics, or social impact. Born and raised in San Francisco.