DataSet/DataTable - Search News

How to Tell a Good Speech Dataset for AI From a Bad One

Speech AI datasets look interchangeable until production exposes gaps in transcripts, speakers, audio conditions, licenses, ...

Tech Times

AI Chart Understanding Breakthrough: MIT-IBM Dataset Lets Small Models Beat GPT-4o

MIT and IBM released ChartNet, a 1.7-million-sample synthetic training dataset that lets compact open-source vision-language ...

C&EN

Chemists ran 50,688 reactions to make a huge open dataset

To fill in some of those gaps, Cernak and his team at the University of Michigan used ultra-high-throughput automation to ...

Wired

Harvard Is Releasing a Massive Free AI Training Dataset Funded by OpenAI and Microsoft

Harvard University announced Thursday it’s releasing a high-quality dataset of nearly 1 million public-domain books that could be used by anyone to train large language models and other AI tools. The ...

AI Watchdog: Spotify Tracks Dataset

A collection of 114,000 music tracks ripped from Spotify. The data set was assembled by an unknown AI developer on Hugging ...

VentureBeat

Table-augmented generation shows promise for complex dataset querying, outperforms text-to-SQL

AI has transformed the way companies work and interact with data. A few years ago, teams had to write SQL queries and code to extract useful information from large swathes of data. Today, all they ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results