Explore 26 curated free datasets for Python data engineering projects. Access data through APIs or download complete datasets for ETL pipelines, analytics, machine learning, and more.
Choose API datasets for real-time data access and dynamic applications.Choose downloadable datasets for batch processing, offline analysis, and ML training.Use search to find datasets by domain, topic, or organization.
Showing 12 of 26 datasets with #machine-learning
Tag: Machine Learning • Active filters applied
26 datasets
Retrieve data from Reddit, including posts, comments, user information and subreddit details.
Access various natural language processing models and tools provided by OpenAI.
It provides access to Wolfram Alpha's computational knowledge engine, allowing developers to obtain concise answers to factual questions and queries.
Generate random user profiles with realistic attributes such as names, addresses, phone numbers and email addresses.
An open-source database that collects information about music artists, releases, and tracks.
Explore a comprehensive database of breweries, including details like beer types, addresses, and contact information.
Retrieve real-time and historical air quality data from locations around the world.
A collection of databases, domain theories and data generators widely used by the machine learning community.
Many datasets are available on GitHub, covering diverse topics such as social media, finance and healthcare.
Various governments and organizations maintain open data portals, offering access to government statistics, geospatial data and more.
Various governments and organizations maintain open data portals, offering access to government statistics, geospatial data and more.
Wikipedia Dumps provide comprehensive snapshots of Wikipedia articles and other content in XML format.