Open Datasets

Browse Bold Outlook datasets for research, analysis, and experimentation. Public, reusable data collections with clear formats, docs, and licensing.

Open Datasets
Photo by fabio / Unsplash

Bold Outlook publishes public datasets that support research, analysis, experimentation and practical tooling. These datasets are curated or structured for reuse, with an emphasis on clarity, documentation and real-world usefulness.

Datasets may include reference data, structured exports, or project-specific collections used to support tools and open-source work.

What You’ll Find Here

  • Public datasets for analysis and prototyping.
  • Structured reference datasets used in tools and demos.
  • Practical data collections for experimentation and learning.
  • Dataset documentation (format, schema notes and intended use).

Dataset Index

This section lists available datasets. Each dataset includes documentation describing the dataset’s scope, format, and reuse terms.

  • Airports: List of 5,571 airports with the airports' ID number, name, and longitude and latitude in plain text, CSV, data and Excel file formats.
  • Airlines: List of 556 airlines' names and ID numbers in plain text, CSV, Excel and data file formats.
  • Canadian Cities: List of cities in Canada.
  • Commercial Aircrafts: List of 317 commercial aircraft's ID number and full name in plain text, data, CSV and Microsoft Excel formats.
  • Countries: List of countries in plain text, plain text (comma) and typeahead.js formats.
  • Pet Breeds: A collection of breeds for various pets, such as cats, dogs and birds.
  • Programming Languages: A list of popular programming languages.
  • Roman Catholic Popes: A list of Roman Catholic popes in chronological order.
  • Stop Words: A collection of stop words.
  • UNESCO World Heritage Sites: List of UNESCO World Heritage Sites sorted by country and alphabetically.
  • User Agents: A collection of user agents in CSV, Excel and plain text file formats.

Dataset Format & Documentation

Datasets are published as standalone downloads or as repositories (depending on format and update cadence). Each dataset includes basic documentation describing:

  • coverage and scope.
  • file formats (CSV, JSON, etc.).
  • schema/field definitions (where applicable).
  • licensing and reuse terms.
Note: some datasets may be maintained in a versioned format to support reproducible research and consistent tooling.

Licensing

Datasets are released under flexible open licenses whenever possible. Specific licensing details are listed with each dataset.

Usage

Datasets are intended to be reused in projects, prototypes, research, and educational work. Attribution is appreciated where practical, especially when datasets are modified or redistributed.