Introduction
Analysis On Beauty Industry
Our goal is to bring more meaningful and multi-dimensional data to users on cosmetic brands and products so that both brands and end-consumers can make informed decisions
Data Collection: utilize different tools such as Tweepy API, XHR requests to extract product, twitter, brands, reviews and ingredient data from various sources
Data Cleaning: utilize Python and OpenRefine to clean data and make them ready to import into MySQL
Data Storage: database is stored in cloud (GCP) to allow multiple remote collaborators and MySQL Workbench was chosen as GUI tool
Data Model: EER and dimensional model created
Data Analysis: utilize SQL, Pandas, NLP libraries to perform analysis
Data Visualization On 5 Business Use Cases: Brand Perception Based On Sentiment Of Tweets Key Performance User Journey On Demo App Built On Tkinter Competitor Lookup Customer Demographic and Attribute Distribution