Search results
Results From The WOW.Com Content Network
Common Crawl is a nonprofit 501 (c) (3) organization that crawls the web and freely provides its archives and datasets to the public. [ 1][ 2] Common Crawl's web archive consists of petabytes of data collected since 2008. [ 3] It completes crawls generally every month. [ 4]
Google Dataset Search. Google Dataset Search is a search engine from Google that helps researchers locate online data that is freely available for use. [1] The company launched the service on September 5, 2018, and stated that the product was targeted at scientists and data journalists. The service was out of beta as of January 23, 2020.
Kaggle is a data science competition platform and online community for data scientists and machine learning practitioners under Google LLC.Kaggle enables users to find and publish datasets, explore and build models in a web-based data science environment, work with other data scientists and machine learning engineers, and enter competitions to solve data science challenges.
Machine learningand data mining. These datasets are used in machine learning (ML) research and have been cited in peer-reviewed academic journals. Datasets are an integral part of the field of machine learning. Major advances in this field can result from advances in learning algorithms (such as deep learning ), computer hardware, and, less ...
List of neuroscience databases. A number of online neuroscience databases are available which provide information regarding gene expression, neurons, macroscopic brain structure, and neurological or psychiatric disorders. Some databases contain descriptive and numerical data, some to brain function, others offer access to 'raw' imaging data ...
A meta search engine for 50 major bioinformatic databases and projects. Project appears to be not available anymore. Free Liebel-Lab KIT from Karlsruhe Institute of Technology: citeULike: Computer science: Not available. Ceased operations as of March 30, 2019 Free Oversity Ltd. ChemXSeer: Chemistry: The project seems abandoned in 2018 Free
As an offline reader, Kiwix works with a library of contents that are zim files: you can pick & choose whichever Wikimedia project (Wikipedia in any language, Wiktionary, Wikisource, etc.), as well as TED Talks, PhET Interactive Maths & Physics simulations, Project Gutenberg, etc. It is free and open source, and currently available for download on:
Kibble: a suite of tools for collecting, aggregating and visualizing activity in software projects. Knox: a REST API Gateway for Hadoop Services. Kudu: a distributed columnar storage engine built for the Apache Hadoop ecosystem. Kvrocks: a distributed key-value NoSQL database, supporting the rich data structure.