Hello, my name is Ermias Gelaye Gaga. I am a data research analyst with over 8 years of proven experience working in a fast-paced, and dynamic digital environment,
and handling data. Possess a Master of Cognitive Science from the University of Trento, Italy, and a Certificate in Data Analytics and Visualization from the University of Toronto.
Excited to leverage my background, and training working with
a team of collaborative researchers focused on mining, analyzing, visualizing, and delivering data sourced from
heterogeneous databases.
I have worked on a variety of data projects, which include data analysis, visualization, and building data
pipelines. I will showcase some of my projects here, and link them to their GitHub repositories.
During my free time, I like to Traveling, and Photography to see my Portfolio Click the following Link, ARC-Natural Photography Portfolio.
I enjoy working on different challenges,write and code. which help sharpen my mind and improve on my problem solving skills. I like telling data stories around the data science projects that I do.
The goal of this project was to track, and visualize the top searches for common health issues in the United States, from Cancer to Diabetes, and compare them with the actual location of occurrences for those same health conditions to understand how search data reflects life for millions of Americans.The data sourced from Google Trends, Kaggle, and CDC(Center for disease control and prevention). The project used python for data cleaning and sorting, PostgreSQL for building the database, Flask API to deploy the data into the web, and to create API links.
Skills demonstrated:The frontend technologies CSS, HTML, Javascript, and different js libraries were used to create an interactive visualization dashboard.
Publication: GitHub
In this project, first I pulled data from the OpenWeatherMap API to assemble a dataset on over 500 cities.
After assembling the dataset, I used Matplotlib to plot various aspects of the weather vs. latitude. Factors I looked at
included: temperature, cloudiness, wind speed, and humidity. you can find the linked page, and GitHub repository to look
the source data, and visualizations created as part of the analysis, as well as explanations and descriptions of any
trends and correlations witnessed.
Skills demonstrated: Python, EDA, data collection, data wrangling, data visualization
Publication: GitHub
In this project, I analyzed data sourced from Pymaceuticals Inc., a burgeoning pharmaceutical company based out
of San Diego. Pymaceuticals specialize in anti-cancer pharmaceuticals. In its most recent efforts, it began screening
for potential treatments for squamous cell carcinoma (SCC), a commonly occurring form of skin cancer. The data gathered
from 249 mice with SCC tumor growth were treated through a variety of drug regimens, and compares the performance of
Pymaceuticals' drug of interest, Capomulin, versus the other treatment regimens and visualized the result.
Skills demonstrated: Python, EDA, data visualization, Regression-analysis
Publication: GitHub
The goal of this project was to predict national suicide rates from the observed personal and macroeconomic mental health risk factors. The project answered the following questions:
What are the strongest predictors for suicide in different countries?,
Also, how to build machine learning models to study and predict the findings?,
How socio-economic and demographic factors affect the country’s yearly suicide rates?,
What causes the difference in suicide risks based on geographical location?
I used Machine Learning Models, Data Exploration, and web deployment technologies for delivering the analysis results.
Skills demonstrated: Machine learning, Tableau, Python, EDA, data collection, data wrangling, data visualization
Publication: GitHub
Conducted research, and extracted business data to acquire new customers across a range of industries by using internal tools and external data sources.
Conducted data research, sentiment analysis, created reports and provided recommendations to governmental agencies on how to handle public complaints and negative reviews on social media channels by proactively mining digital and social media weekly.
For 24 weeks, I immersed myself in a data visualization and analytics program. The program equipped me with the skills and tools to work with large datasets and generate data-driven insights.
Research-focused Master's course in Cognitive Neuroscience
Psychological sciences and techniques for the analysis and evaluation of behavioral processes