sabriaouani.com

Star Automated World Statistics Extraction: Python, Twitter, and Google Suite Integration Star

Year

2023

Services

Web scraping, data transformation and reporting

Project

Dynamic

Description

The Automated World Statistics Extraction project is an innovative integration of Python, Twitter, and Google Suite, designed to transform the unstructured data of tweets into a well-organized, analyzable format. Utilizing Python, the project extracts specific tweets containing world statistics from a dedicated Twitter feed. These tweets, often containing crucial data on various global metrics, are then parsed and transformed into a structured format, making the data easily accessible and analyzable. The project targets tweets with specific structures and data types, ensuring that the extracted information is relevant and valuable.

A cornerstone of this project is automation and real-time data update. The Python script is scheduled to run every six hours, ensuring that the data extracted is always current and relevant. This continuous update mechanism is crucial in the fast-paced environment of Twitter, where data and information are continuously updated. The extracted data is then organized and stored in Google Sheets, making use of the gspread library. This integration ensures that the data is not only stored but is also easily accessible and can be shared among multiple users, enhancing collaboration and analysis.

Visual representation of data is integral in making complex datasets understandable. In this stride, the project employs Plotly and DASH to generate insightful and interactive charts from the structured data. These visual representations are then stored as PNG images in Google Drive and embedded into Google Sheets, ensuring that users have both numerical and visual data at their fingertips. This integration of data extraction, transformation, storage, and visualization underscores the project’s comprehensive approach to making Twitter’s vast and unstructured data landscape accessible, analyzable, and actionable for decision-makers and analysts.

 

Jupiter notebook

1 - scrape tweets
2 - Clean Tweets
3- Transformed tweet