Explore BIRD Benchmark Databases

This page shows all available databases in the BIRD benchmark that can be used for generating incorrect SQL queries.

About the BIRD Benchmark

The BIRD benchmark is a comprehensive collection of databases with natural language questions and their corresponding SQL queries. It's designed for evaluating text-to-SQL systems across diverse schemas.

The datasets contain real-world database schemas with varying complexity levels, making them ideal for generating realistic SQL error patterns across different domains.

Available SQLite Databases

The following database schemas have SQLite files available for testing and validation. Only queries from these databases will produce fully validated results:

California Schools Toxicology Superhero Student Club
California Schools SQLite Available
89 Queries

Database with information about California schools, test scores, and meal programs.

Example Queries:
  • Q: Consider the average difference between K-12 enrollment and 15-17 enrollment of schools that are...
  • Q: Under whose administration is the school with the highest number of students scoring 1500 or...
  • Q: What is the total number of non-chartered schools in the county of Los Angeles with a percent...

+ 86 more queries in this database

View Schema
Financial
106 Queries

Banking database with accounts, clients, loans, and transactions.

No SQLite file available. Queries cannot be fully validated.
Example Queries:
  • Q: List out the account numbers of female clients who are oldest and has lowest average salary,...
  • Q: For the branch which located in the south Bohemia with biggest number of inhabitants, what is...
  • Q: For the client whose loan was approved first in 1993/7/5, what is the increase rate of his/her...

+ 103 more queries in this database

View Schema
Toxicology SQLite Available
145 Queries

Database for chemical compounds with toxicology/carcinogenicity data.

Example Queries:
  • Q: On average how many carcinogenic molecules are single bonded?
  • Q: What elements are in the TR004_8_9 bond atoms?
  • Q: What elements are in a double type bond?

+ 142 more queries in this database

View Schema
Card Games
191 Queries

Database for card games.

No SQLite file available. Queries cannot be fully validated.
Example Queries:
  • Q: What is the percentage of cards whose language is French among the Story Spotlight cards?
  • Q: What percentage of cards with format commander and legal status do not have a content warning?
  • Q: What percentage of cards without power are in French?

+ 188 more queries in this database

View Schema
Codebase Community
186 Queries

Database for codebase community.

No SQLite file available. Queries cannot be fully validated.
Example Queries:
  • Q: Which user added a bounty amount of 50 to the post title mentioning variance?
  • Q: What is the percentage difference of student badges given during 2010 and 2011?
  • Q: Among posts by Harvey Motulsky and Noah Snyder, which one has higher popularity?

+ 183 more queries in this database

View Schema
Superhero SQLite Available
129 Queries

Database containing superhero information including powers, attributes, and publishers.

Example Queries:
  • Q: Please list the superhero names of all the superheroes that have blue eyes and blond hair.
  • Q: List the superheroes from Marvel Comics who have the super power of 'Super Strength'.
  • Q: What is the percentage of superheroes who act in their own self-interest or make decisions based...

+ 126 more queries in this database

View Schema
Formula 1
174 Queries

Database for formula 1.

No SQLite file available. Queries cannot be fully validated.
Example Queries:
  • Q: Paul di Resta was in the No. 853 race, what percent faster did he finish in the 853rd race than...
  • Q: Calculate the percentage whereby Hamilton was not at the 1st track of the the f1 circuit since 2010.
  • Q: How much faster in percentage is the champion than the driver who finished the race last in the...

+ 171 more queries in this database

View Schema
European Football 2
129 Queries

Database for european football 2.

No SQLite file available. Queries cannot be fully validated.
Example Queries:
  • Q: In Scotland Premier League, which away team won the most during the 2010 season?
  • Q: At present, calculate for the player's age who have a sprint speed of no less than 97 between...
  • Q: List the long name of teams with above-average build-up play passing in 2012.

+ 126 more queries in this database

View Schema
Thrombosis Prediction
163 Queries

Database for thrombosis prediction.

No SQLite file available. Queries cannot be fully validated.
Example Queries:
  • Q: For in-patient age 50 and above, what is their average anti-cardiolipin antibody (IgG) concentration?
  • Q: The oldest SJS patient's medical laboratory work was completed on what date, and what age was...
  • Q: What is the ratio of male to female patients among all those with abnormal uric acid counts?

+ 160 more queries in this database

View Schema
Student Club SQLite Available
158 Queries

Database for university student clubs, members, and activities.

Example Queries:
  • Q: Calculate the total average cost that Elijah Allen spent in the events on September and October.
  • Q: How many times was the budget in Advertisement for "Yearly Kickoff" meeting more than "October Meeting"?
  • Q: What is the name of the social event that was attended by the vice president of the Student_Club...

+ 155 more queries in this database

View Schema
Debit Card Specializing
64 Queries

Database for debit card specializing.

No SQLite file available. Queries cannot be fully validated.
Example Queries:
  • Q: What was the difference in gas consumption between CZK-paying customers and EUR-paying customers in 2012?
  • Q: What is the difference in the annual average consumption of the customers with the least amount...
  • Q: Which of the three segments—SME, LAM and KAM—has the biggest and lowest percentage increases in...

+ 61 more queries in this database

View Schema