The data is updated every week. REGRESSION is a dataset directory which contains test data for linear regression.. The site is heavily-focused on betting, but you can find a lot of info about past football matches. Below is a list of data sets handpicked … Here you can find an archive of climate and weather data sets across the US, the largest archive of environmental data in the world. Ever want to know what drugs are prescribed by doctors in the UK? From agriculture, public safety, to local government. In the Google Cloud Console, on the project selector page, select or create a Google Cloud project. Websites as Graphs shows the structure of different websites based on the tags used in their code in a color-coded map that grows on the screen as you watch. Posted by Mirko Krivanek on June 6, 2014 at 5:00pm; View Blog; Read full list if you find these examples interesting. 43.NASA Earth Data System — Since 1994, NASA’s Earth science data has been free and open to all users for any purpose. Charts & Graphs; Data Visualizations; Data sets; Maps; Data Lab; Presentations; Videos; Search Maps & Data . You can find data on trade, government finance, and financial development. Skip and download. It has tons of interesting data sets. There are hundreds (if not thousands) of free data sets available, ready to be used and analyzed by anyone willing to look for them. The set is used to collect demographic and clinical … Datasets and project suggestions: Below are descriptions of several data sets, and some suggested projects. 26.Australian Government Catalogue — As you might have guessed from reading the name, this dataset is focused on the Australian government. It provides near real-time data from cool sounding measuring instruments like a Moderate Resolution Imaging Spectroradiometer or an Atmospheric Infrared Sounder. [optin-monster-shortcode id=”xt9p8ntzj9az89ft9fnq”], Whether you’re doing a science project, creating a cool infographic, or, Luckily, there are enough people in this world who believe data should be shared as much as possible and have created ample resources to simplify things. close … Some of the top ones this past year are: 480,000 Rotten Tomato critic reviews, UC Berkeley’s Self-Driving dataset, 1,340 coffee bean reviews. API available. They help us to know which pages are the most and least popular and see how visitors move around the site. To make it easier for you, we’ve divided it in a couple of categories. 47.Pew Internet — Pew Research Center is a non-partisan fact tank aggregating the most varied data sources. Most Popular; DIY Election Fraud Analysis … Note: If you don't … There are over 50 public data sets supported through Amazon’s registry, ranging from IRS filings to NASA satellite imagery to DNA sequencing to web crawling. Developer Network. Science & Society The various data sets are organized according to themes, such as mortality, health systems, communicable and non-communicable diseases, medicines and vaccines, health risks, and so on. — Facebook allows you to use Instagram’s API to quickly access comments, metadata, and metrics. Academic Torrents. Loads and loads of data on about 14 different topics. The data sets also include usage examples, showing what other organizations and groups have done with the data. That is why the C# snippet below (taken from our Azure Function code) creates a KPI class instance and then sticks it into an array: // populate new KPI object instance var kpi = new KPI (onlinePlayerCnt, helpReqCnt, … Take a look at these five interesting data sets to analyze that reveal how much data is … 32.Open Data Canada — The official government portal sharing public data sets in Canada. trends. 100+ Interesting Data Sets. Tags are somewhat intuitive in their color-codes, with blue for links, red for … View Active Events. 31.NYC Open Data — If London isn’t your thing, you can look up the data for New York City. Novel Coronavirus 2019 time series data on cases dataset data-package datapackage coronavirus covid-19 covid covid19-data Python 553 1,003 25 0 Updated Dec 4, 2020 Ranges from real estate, global macro data, to market data. Or maybe you want to know how many tourists they had compared to last quarter? Displaying 1 - 10 of 43 . — Want to know who the referee was in a particular football (or soccer, depending where you’re from) game in Scotland? 6.National Institute on Drug Abuse — The National Institute on Drug Abuse (NIDA) monitors the prevalence and trends regarding drug abuse in the United States. Reddit, a popular community discussion site, has a section devoted to sharing interesting data sets. You are encouraged to select and flesh out one of these projects, or make up you own well-specified project using these datasets. 9.Instagram API — Facebook allows you to use Instagram’s API to quickly access comments, metadata, and metrics. UNICEF Research and Reports– UNICEF gathers evidence on the situation of wome… We use cookies on Kaggle to deliver our services, analyze web traffic, and improve your experience on the site. 3.Open Corporates — Open Corporates is the largest open database of companies and company data in the world. Pew Research Center offers its raw data from its fascinating research into American life. Data Set Information: * Audio track (encoded as mp3) of each of the 106,574 tracks. Here are some tips and tricks I’ve developed for finding the most interesting data sets. 2.Glassdoor Research — Glassdoor offers data related to employment. Climate Implementation Tracker (Beta) The Climate Implementation Tracker (Beta) is an Excel tool that tracks … You can explore statistics on search volume for almost any search term since 2004. code. — Data about life in London. No offense to statistics aficionados, but definitely not us, regular people. The World Factbook– The World Factbook provides information on the history, people, government, economy, geography, communications, transportation, military, and transnational issues for 267 world entities. All you have to do is download the dataset into a CSV file to analyze the data outside of the Google Trends webpage. By using Kaggle, you agree to our use of cookies. See this post for more information on how to use our datasets and contact us at with any questions. Organized into themes, such as Public Services, Public … They have found out that most data scientists spend: 60% of the time in organizing and cleaning data (!). They are usually only set in response to actions made by you which amount to a request for services, such as setting your privacy preferences, logging in or filling in forms. — Provides weather, water, and climate data, forecasts and warnings for the protection of life and enhancement of the national economy. The data sets are separated into different categories; the distance from the earth, the types of space junk, and the size and mass of the objects. You need to find a suitable and interesting data set and determine how large and messy the data set would be. Internet & Tech. Focus on one single point you want to get across, use simple design, and visualize your data in such a way that it becomes easy to understand. UNICEF Data: Monitoring the situation of children and women. The data journalism team is full of great ideas and is constantly recognized. © 2003-2020 Tableau Software, LLC, a Salesforce Company. It’s a view into the inner workings of companies and organization. I’m not too fond of the phrase “information age.” It sounds like someone sat down and was like, “Hey, there’s a ton of information today… what should we call it? That's why we give you a choice not to allow some types of cookies. Well, one approach might be to download this archive of 216,930 past Jeopardy questions and plug them into your favorite spaced repetition system. WRI relies on rigorous data to inform our research products and innovative solutions. The datasets are older, but still good. If you intend to go beyond the free tier, you must also enable billing. As more organizations make their data available for public access, Amazon has created a registry to find and share those various data sets. Scroll to top. Access simply requires a brief registration on the site and credit to Pew Research Center as the source of the data, with a waiver that Pew is not responsible for alternative data conclusions. ... Their infographics and data visualizations are always interesting and full of value. You probably knew that by now, since it’s kind of hard to overlook it when it’s constantly in the news, a growing professional field, and data skills are increasingly valuable in every job market. We are a bunch of curious and passionate people of different backgrounds and origins joined together for one mission - to empower people around the globe to tell powerful visual stories that matter. Some publications, like the Economist Intelligence Unit or Bellingcat, have built their entire reputation on their great use of data in reporting. Here is a post collecting more that 30 links on datasets available online for free. — If London isn’t your thing, you can look up the data for New York City. 20.NHS Digital — Provides data about health and social care system in the UK. Cool methods of showing data in a pictorial form, graphical, and visual context. Communities. EOSDIS acts as a means to process and distribute Earth science data from the Earth observation satellites, aircraft, and field measurements. You can find data about how film has influenced UK culture and how much Avengers: Endgame made every other film irrelevant the week it came out. There are data visualization tools and an API and other extraction tools available. They are categorized into Ratings, Language, Graph, Advertising and Market Data, Computing Systems and an appendix of other relevant data and resources available via the Yahoo! Yahoo! These data sets might be more interesting in that fewer (or no) visualizations are available online yet, and they can lead to interesting insights. Be sure to check the box that says "Limit to Documents with Excel Spreadsheets". Luckily, there are … As individuals, we are lucky to have access to more data than ever before, as data sets continue to be made available online for free. Data Set Search Websites. Got it. In Kaggle you will get the data sets , kernel and team for discussion . You might have to brush up your Excel skills, since the data is only available in database or CSV form. And while you don’t need to go as far as them, you can certainly learn from them. The best ones serve as consistent seeds of creativity — scanning through the email should spark at least one new idea. Sign up and download. Pew Research Center makes its data available to the public for secondary analysis after a period of time. From endangered species to healthcare, data sets provide answers to all sorts of research questions. These data sets provide a higher level of complexity and difficulty, and help in building upon the solid basics acquired by working with simpler data sets. — Open AQ’s mission is to fight air pollution. Much like the Australian Government Catalogue and You can click on the below links to do a quick jump to your preferred section. Check out the Enron … These cookies enable us to provide enhanced functionality and personalisation. Browse and download data sets available from select WRI websites and publications. Data sets of any type: some links. The data set is “based originally on 5.2 million books published between 1500 and 2008.” I can imagine using it to determine the most overused, cliche phrases, and those phrases that are in danger of becoming cliched. NYC Open Data Over 1100 Datasets are currently available on the NYC open data portal, more than any other U.S. City ICPSR- Inter-university … Want to know how much the population has increased in five years? There is a spreadsheet on this main page with all of the past data sets, they’re so cool. Even if its household data, that is fine. — The Uniform Crime Reporting compiles statistical crime reports, publications, and data points from thousands of cities, universities, states, and federal law enforcement agencies. Well you’re in luck. — Here you’ll find data about different tourism-related topics like hotels, holiday destinations, and more. — Glassdoor offers data related to employment. DataMarket , visualize the world's economy, societies, nature, and industries, with 100 million time series from UN, World Bank, Eurostat and other important data providers. Download Open Datasets on 1000s of Projects + Share Projects on One Platform. 41.Amazon AWS — Amazon provides an open registry of all open data on AWS. (Okay, this one I have thought about.) Ever want to know what drugs are prescribed by doctors in the UK? There are actually a lot of great government data websites on the internet. Ranging from road safety, water and sanitation, to mental health. Time Series Data Library - Curated by Professor Rob Hyndman of Monash University in Australia, this is a collection of over 500 datasets containing time-series data, organized by category. 37.US Travel Association — The U.S. Travel Association is a non-profit organization representing all components of the travel industry and provides high-quality research data on tourism and transportation. Contact Us; Legal; Support us. — The US counterpart of the AGC. Novel Coronavirus 2019 time series data on cases dataset data-package datapackage coronavirus covid-19 covid covid19-data Python 553 1,003 25 0 Updated Dec 4, 2020 9 minutes. — FiveThirtyEight uses hard data and statistical analysis to tell stories about politics, sports, economics, and culture. Want to know the quantity of textiles China exported to South Korea? To work on alternative datasets has to been approved by the instructors. Well, I’ve got you covered. Created in 1945, the IMF is governed by and accountable to the 189 countries that make up its near-global membership. 1.Crunchbase — Find business information about private and public companies. August 2016 edited November 2018 in Knowledge Base. Join our community. Statistical Science Data Sets - Large index of data sets from fully processed to raw. Example data set: Atmospheric Electricity (Lightning). Whether you’re doing a science project, creating a cool infographic, or giving a presentation, data makes everything more interesting. They also provide other resources you can use to find more data. We are committed to making this data publicly available. If we have data, let’s look at data. A robust data set is usually the first step toward answering a … Maintained on the blog is this somewhat eclectic set. You can find data about corruption, election, and media. You can, for example, figure out how much you can save by retaining employees. — Find business information about private and public companies. — With over 50 000 datasets, you’ll have no trouble finding what you need to know about the UK government. By Location . Data is about the US. auto_awesome_motion. They aggregate physical air quality data from public data sources provided by government, research-grade, and other sources. Learn more. – This is the home of the Indian Government’s open data. — Here you can find detailed data from the NYC Taxi and Limousine Commission. Interesting data points: If you use quarter 2 data for testing, you will notice something interesting in the week ending 5/27/2011 every Dow Jones Index stock lost money. … Webscope - A reference library of … Windows Azure Data Market; Socrata Open Data; Quora; Quandl: Data Set Search (including free and premium data sets) Data Set Search; Google Public Data Explorer; Reddit: Data is Beautiful; The World Bank; Kaggle: Platform for Predictive Modeling Competitions that come with training data sets; SNAP: Stanford Large Network Dataset Collection; … How about reading other people’s emails? 50.USDA – Food Composition — The United States Department of Agriculture provides data about the composition and nutrient values of different foods. * Given the metadata, multiple problems can be explored: recommendation, genre recognition, artist identification, year prediction, music annotation, unsupervized categorization. Used by banks and governments, they pride themselves in having the most accurate data. The site contains more than 190,000 data points at time of publishing. Data from the Grand débat. 29. Curated by: Amazon Curated by: Pew Research Center Quandl - This is a web-based front end to a number of public data sets. These datasets vary from data about climate, education, energy, Finance and many more areas. data science project ideas. Also provides data about other subjects like transportation and immigration. — Environment & climate change data in African countries, reported by major international organizations such as the World Bank, WHO, and The Global Fund. 21.US Food & Drug Administration — The FDA provides data about what drugs are currently approved in the US. The following resources may be helpful for you: * UCI Machine Learning Repository: Data Sets (37 Categorical datasets) * Large categorical dataset for regression * Categorical Data Analysis: Data Sets * Datasets for Data Mining HTH — As the name suggests, the WHO provides data about different health-related topics. You can explore statistics on search volume for … If you do not allow these cookies then some or all of these services may not function properly. If you need help with putting your findings into form, we also have write-ups on data visualization blogs to follow and the best data visualization examples for inspiration. The homepage is full of small visualizations telling stories about each data set. Politics & Policy. 17.Centers for Disease Control (CDC) — The CDC provides data on a wide variety of health related topics like diabetes, life expectancy, cancer, and obesity. 15.World Bank — Not only does the World bank provide financial data about countries, it also provides data on education and health. They also provide other resources you can use to find more data.