Changelog

Subscribe to receive a bi-weekly summary of release notes in your inbox.

Note, the Enterprise product is updated with all public data releases below.

May 2024

5/16/24: Enterprise - Added additional SEC filings (S-1, S-4, SC-13D, SC-14D, 20-F, 40-F, 6-K, DEFM14A, PREM14A, SC-TO)

See here for table details.

5/15/24: Enterprise - Added parsed 10K/Qs and updated SEC XBRL methodology

10-K/Q sections are now parsed into plain text, HTML and JSON structure supporting LLM use cases. The data is available in the below table: sec_corporate_report_item_attributes

Also updated SEC XBRL methodology to improve data accuracy.

April 2024

4/23/24: Weather & Environmental Essentials - Added global emissions data from Climate Watch

Climate Watch's historical emissions dataset provides a record of past greenhouse gas emissions across various sectors and gases, categorized by both geographic and sector-specific divisions. The data also presents a range of future emission scenarios, each predicated on distinct assumptions regarding economic growth, technological advancements, and policy implementations. The data is available in the following tables:

  • climate_watch_attributes

  • climate_watch_timeseries

4/17/24: Enterprise - Added additional SEC filings (Form 3 & Form 5)

Form 3 and Form 5 data is available in the following tables:

  • sec_insider_trading_reporting_owners_index

  • sec_insider_trading_securities_index

4/11/24: Enterprise - Added additional SEC filings (Form 4 & Form 144)

Form 4 and Form 144 data is available in the following tables:

  • sec_form144_securities_info_index

  • sec_form144_securities_sold_index

  • sec_form144_securities_to_be_sold_index

  • sec_form4_reporting_owners_index

  • sec_form4_securities_index

4/3/24: Enterprise - Added parsed segment revenues from the SEC

Parsed/cleaned quarterly & annual segment revenues from 10-Ks and 10-Qs are now available in SEC_METRICS_TIMESERIES. This table translates the raw XBRL data from the SEC into a clean version of SEC revenue segments over time.

March 2024

3/7/24: Government Essentials - Added regional GDP, retails sales index, and safety data from the UK Office for National Statistics

Added data from the UK Office for Statistics in support of the Women In Data Safety Hackathon. Datasets added include regional GDP by quarter, retail sales index, prisoner statistics, and domestic abuse statistics. Data is available in the following tables: united_kingdom_timeseries united_kingdom_attributes

3/6/24: Cybersyn AI Utilities - Added OpenAI & Anthropic external functions; added phone number parsing functions; renamed product

Added additional functions that allow users to call out to OpenAI & Anthropic and clean/process phone numbers directly in Snowflake SQL. Update product name to "Cybersyn AI Utilities"

February 2024

2/27/24: Consumer Spending - Added zip codes & customer retention

Expanded product granularity to include zip code level data. Added customer retention data. Data is available in the following tables: CONSUMER_SPENDING_MERCHANT_RETENTION_TIMESERIES CONSUMER_SPENDING_INDUSTRY_TIMESERIES CONSUMER_SPENDING_MERCHANT_TIMESERIES

2/26/24: US Housing & Real Estate Essentials - Added the Building Permit Survey and construction spending from the US Census Bureau

Added new data from the US Census Bureau. An indicator of future construction activity, the Building Permits Survey (BPS) provides the number and valuation of new housing units authorized by building permits at the national, state, and local levels. Construction Spending data quantifies actual expenditures on construction projects, categorized into residential, non-residential, and public sectors

The data is available in the following tables: US_REAL_ESTATE_ATTRIBUTES US_REAL_ESTATE_TIMESERIES

2/21/24: Financial & Economic Essentials - Added daily Nasdaq stock prices & trading volumes; added economic and development indicators from the OECD

Launched daily trading volumes, open/close and high/low prices of all US equities and ETFs executed on the Nasdaq. Cybersyn releases volume and pricing data, inclusive of pre-market and after hours activity, from the previous trading day at 6:00am ET. Data is sourced from Databento, a market data provider that connects directly with Nasdaq.

The data is available in the following table:

STOCK_PRICE_TIMESERIES

Added global economic and development indicators by country from the OECD. Added the OECD's Social Expenditure Database which tracks key indicators of social policy spending across countries. The

The data is available in the following tables:

OECD_ATTRIBUTES OECD_TIMESERIES

2/15/24: GitHub Archive - Added language field with the programming language of the repo

Added language the github_events table that will provide the programming language (e.g. Python, SQL) of the repository

2/15/24: Government Essentials - Added global economic & growth indicators from the World Bank

Expanded coverage of the World Bank to include global and regional growth projections from the Global Economic Prospects report, international debt statistics by country (e.g. average interest on new debt commitments, public sector debt, external assets in debt instruments), global economic indicators by country (e.g. inflation, exchange rates, GDP, retails sales), and world development indicators by country (e.g. access to electricity, bank account ownership, life expectancy).

The data is available in the following tables: WORLD_BANK_ATTRIBUTES WORLD_BANK_TIMESERIES

2/15/24: Financial & Economic Essentials - Added Survey of Consumer Expectations and Household Debt & Credit reports from the New York Fed

Added quarterly household debt & credit data and monthly consumer expectations for inflation, household finances, and the labor market.

The data is available in the following tables: NEWYORKFED_CONSUMER_ATTRIBUTES NEWYORKFED_CONSUMER_TIMESERIES

2/8/24: GitHub Archive - Added last_seen field with repo name last seen date

Added a new field to the github_repos table. last_seen includes the last date a repo_name was used. This field allows you to identify the latest repo_name for a particular repo_id. The field was added as a workaround for a known issue with the source data: In the GitHub events dataset, a repository name should appear as a combination of the repository organization and name. In 2018, there are repository names that are incomplete. For example, rust-lang/ appeared without its full repository name, rust-lang/rust, for ~3 months. Additionally, in 2019, some repository names appeared with duplicate organizations (e.g. nolimits4web/nolimits4web/swiper). In both cases, GitHub implemented a go-forward fix.

2/7/24: Government Essentials - Removed legacy US Treasury revenue collections tables

Removed legacy tables: US_TREASURY_REVENUE_COLLECTIONS_ATTRIBUTES and US_TREASURY_REVENUE_COLLECTIONS_TIMESERIES.

Users still have access to revenue collections data in the US_TREASURY_ATTRIBUTES and US_TREASURY_TIMESERIES tables that were released on 12/14/23.

January 2024

1/31/24: Weather & Environmental Essentials - Removed legacy tables with data from Our World in Data and CDR.fyi

Removed the following legacy tables from Our World in Data and CDR.fyi:

  • carbon_intensity_attributes

  • carbon_intensity_timeseries

  • carbon_credit_purchase_attributes

  • carbon_credit_purchase_timeseries

Users still have access to the carbon intensity data in the OUR_WORLD_IN_DATA_ATTRIBUTES and OUR_WORLD_IN_DATA_TIMESERIES tables that were released on 12/15/23.

1/27/24: Government Essentials - Added global Environmental, Social, Governance (ESG) statistics from the World Bank

Added annual ESG statistics on global sustainability performance for 200+ countries and territories. Example country-level variables include CO2 emissions, natural resource depletion, energy use, government effectiveness, life expectancy, population with access to safely managed drinking water, and renewable electricity output.

Added governance indicators for 200+ countries and territories. The Worldwide Governance Indicators (WGI) report on six broad dimensions of governance: voice and accountability, political stability and absence of violence, government effectiveness, regulatory quality, rule of law, and control of corruption.

The data is available in the following tables:

  • WORLD_BANK_TIMESERIES

  • WORLD_BANK_ATTRIBUTES

1/26/24: LLM Training Essentials, Tech & Innovation Essentials - Added open catalog of scholarly entities and how they are connected from OpenAlex

Added OpenAlex's catalog on scholarly entities and how they are connected to each other. Entities are defined as scholarly works (e.g. journal articles, books, theses), authors, sources, affiliated organizations, topics covered, publishers, and funders. This data is derived from a wide range of sources, offering an extensive overview of academic research and its contributors.

The data is available in the following tables:

  • OPENALEX_AUTHORS_INDEX

  • OPENALEX_CONCEPTS_INDEX

  • OPENALEX_FUNDERS_INDEX

  • OPENALEX_INSTITUTIONS_INDEX

  • OPENALEX_PUBLISHERS_INDEX

  • OPENALEX_SOURCES_INDEX

  • OPENALEX_WORKS_INDEX

1/26/24: Web Traffic Foundation - Added additional AI-focused domains; improved pre-2022 model quality

Added additional domains and limited subdomains to measure traffic for popular AI companies. New domains include chat.openai.com, bard.google.com, claude.ai, and more.

Improved model quality to more accurately predict web traffic in older periods before 2022.

1/19/24: Government Essentials - Added weekly national unemployment insurance claims from the US Department of Labor

Expanded US Department of Labor coverage to include national weekly figures on initial and continuing unemployment claims filed during the previous week. Initial claims represent the number of individuals who have filed for unemployment benefits for the first time during a given week. Continuing claims indicate the number of individuals who are still receiving unemployment benefits in subsequent weeks.

The data is available in the following tables:

  • US_DEPARTMENT_OF_LABOR_UNEMPLOYMENT_INSURANCE_CLAIMS_ATTRIBUTES

  • US_DEPARTMENT_OF_LABOR_UNEMPLOYMENT_INSURANCE_CLAIMS_TIMESERIES

1/9/24: Weather & Environmental Essentials - Added global agrifood emissions from the Food and Agriculture Organization (FAO)

Added greenhouse gas (GHG) emissions related to agriculture, forestry, and other land use activities. Example variables include: 1) Total emissions generated from agrifood systems 2) Emissions associated with crop processes including rice cultivation and burning of crop residues 3) Emissions from livestock 4) Emissions from land use including forests, fires, and drained organic soils 5) Intensity of emissions by agricultural commodity.

The data is available in the following tables:

  • FOOD_AGRICULTURE_ORGANIZATION_ATTRIBUTES

  • FOOD_AGRICULTURE_ORGANIZATION_TIMESERIES

1/8/24: Government Essentials - Added US Consumer Credit, Industrial Production, Financial Accounts, and Commercial Paper from the Federal Reserve

Added the following timeseries from the Federal Reserve to the FEDERAL_RESERVE_TIMESERIES and FEDERAL_RESERVE_ATTRIBUTES tables:

  • Outstanding monthly revolving and non-revolving US consumer credit including select terms (e.g. interest rates on new car loans, personal loans, credit card plans) and major holders (e.g. credit unions, finance companies, depository institutions) - Fed Report Number G.19

  • Monthly industrial production and capacity utilization rates covering manufacturing, mining, and electric and gas utilities. The production index measures real output and the capacity index estimates sustainable potential output. - Fed Report Number G.17

  • Quarterly assets and liabilities by sector and financial instruments. Quarterly balance sheets, including net worth, for households, nonprofits and non-financial businesses. - Fed Report Number Z.1

  • Daily commercial paper issuance rates and volumes - Derived from data supplied by The Depository Trust & Clearing Corporation

  • Monthly owned and managed outstanding receivables at financial services companies. - Fed Report Number G.20

1/5/24: Government Essentials - Added weekly unemployment insurance (i.e. jobless) claims by state from the US Department of Labor

Added weekly unemployment insurance claims (i.e. jobless claims) which count the number of people filing to receive unemployment insurance benefits by state. The claims are broken into two categories - initial (number of people filing for the first time) and continued (number of people filing for ongoing unemployment benefits).

The data is available in the following tables:

  • US_DEPARTMENT_OF_LABOR_UNEMPLOYMENT_INSURANCE_CLAIMS_ATTRIBUTES describes the unemployment insurance variables tracked by the US Department of Labor

  • US_DEPARTMENT_OF_LABOR_UNEMPLOYMENT_INSURANCE_CLAIMS_TIMESERIES provides historical values for each US Department of Labor reported attribute

1/3/24: US Housing & Real Estate Essentials - Added monthly housing prices and weekly mortgage interest rates from Freddie Mac

Added Freddie Mac’s House Price Index (HPI) which provides a monthly measure of typical price inflation for single family homes within the US using data from loans purchased by Freddie Mac or Fannie Mae.

Added Freddie Mac's Primary Mortgage Market Survey (PMMS) which provides a weekly report on the national average of 30/15-year fixed rate mortgages as well as adjustable rate mortgages, tracking borrowing costs for US homebuyers and refinancers.

The data is available in the following tables:

  • FREDDIE_MAC_HOUSING_ATTRIBUTES details the Freddie Mac House Price Index and mortgage interest rate variables covered.

  • FREDDIE_MAC_HOUSING_TIMESERIES provides historical values for each reported attribute

December 2023

12/20/23: Financial & Economic Essentials - Added international economic indicators from the International Monetary Fund (IMF)

Added the below IMF indicators:

  • Primary Commodity Price System - prices of commodities by country

  • Balance of Payments - countries’ economic transactions with the rest of the world e.g. trade balances, capital flows, and official reserves

  • Fiscal Monitor - detailed government finances e.g. revenues, expenditures, transactions in financial assets/liabilities

  • International Financial Statistics - macroeconomic and financial indicators e.g. exchange rates, international liquidity, interest rates, population

  • Coordinated Portfolio Investment Survey - cross-border holdings of equity and debt securities

  • Africa Regional Economic Outlook - economic indicators for Africa

  • Asia and Pacific Regional Economic Outlook - economic indicators for the Asia and Pacific region

The data is available in the following tables:

  • INTERNATIONAL_MONETARY_FUND_ATTRIBUTES: Human-readable variable names describing the financial statistic being measured

  • INTERNATIONAL_MONETARY_FUND_TIMESERIES: Historical values for the tracked financial variables

12/19/23: Canada Government Essentials - Added month-end assets and liabilities for the Bank of Canada from Statistics Canada

Expanded coverage of Statistics Canada, Canada's national statistical office, with month-end assets and liabilities for the Bank of Canada and Chartered Banks. Added month-end currency outside bank/chartered bank deposits for the Bank of Canada.

Select assets include: Government of Canada direct and guaranteed securities (e.g., Treasury Bills, Government of Canada bonds, mortgage bonds), provincial money market securities, provincial bonds, commercial paper, and corporate bonds.

Select liabilities include: notes in circulation, Canadian dollar deposits, securities sold under repurchase agreements, derivatives, foreign currency liabilities, and equity.

The data can be found in the following tables:

  • CANADA_STATCAN_ATTRIBUTES includes a unique variable ID and a human-readable variable name. Each field represents a characteristic of the variable, including its unit of measurement, whether it's seasonally adjusted, frequency of measurement, its Statistics Canada report, and other related information

  • CANADA_STATCAN_TIMESERIES provides historical values for each variable collected by Statistics Canada by GEO_ID. Each row represents a distinct timeseries, date, and value by geographic entity.

12/19/23: Financial & Economic Essentials - Added Eurozone residential property prices & valuations, CPI indices from the European Central Bank

Added annual, quarterly, and monthly residential property prices and valuations by residential property type (e.g. houses, flats) across countries in the European Union (EU). Includes details on the source of the data, seasonal adjustment of the variable, and measurement type (e.g. average, maximum or minimum valuation, estimates of the over/undervaluation).

Added annual, quarterly, and monthly consumer prices indices by EU countries for a wide range of consumer goods (e.g. clothing, baby food, books, cigarettes).

Data is available in the following tables:

  • EUROPEAN_CENTRAL_BANK_TIMESERIES: Historical records for the tracked real estate and financial variables by European country or country group.

  • EUROPEAN_CENTRAL_BANK_ATTRIBUTES: Human-readable variable names and details describing residential property and CPI statistics.

12/18/23: Government Essentials - Added industrial development statistics on the manufacturing sector from the United Nations Industrial Development Organization (UNIDO)

Sourced from the INDSTAT 2 (2016-2023), the data provides statistics on industrial performance and trends by country for the manufacturing sector. The following variables over time, by country are included:

  • Number of establishments

  • Number of employees

  • Wages and salaries

  • Output

  • Value added

  • Gross fixed capital formation

  • Number of female employees

  • Index of industrial production

Data is available in the following tables:

  • UNITED_NATIONS_INDUSTRIAL_DEVELOPMENT_ORGANIZATION_ATTRIBUTES details each manufacturing indicator tracked by ISIC economic activity and INDSTAT yearly version.

  • UNITED_NATIONS_INDUSTRIAL_DEVELOPMENT_ORGANIZATION_TIMESERIES provides historical values on an annual basis for each UNIDO variable tracked.

12/18/23: Weather & Environmental Essentials - Added global emissions by country and industry from the European Commission's Emissions Database for Global Atmospheric Research (EDGAR)

Added global emissions of carbon dioxide (CO2), methane (CH4), nitrous oxide (N2O), and various fluorinated gases (F-gases). F-gases are man-made greenhouse gases emitted from everyday products (e.g. air conditioners, refrigerators) and industrial applications. Both biogenic (emissions that come from natural sources like soils/plants) and fossil fuel sources are included.

Data is available in the following tables:

  • EUROPEAN_COMMISSION_EDGAR_ATTRIBUTES details each emissions variable tracked by greenhouse gas type and Intergovernmental Panel on Climate Change (IPCC) industry

  • EUROPEAN_COMMISSION_EDGAR_TIMESERIES provides historical values on a monthly or annual basis for each tracked variable

12/15/23: Weather & Environmental Essentials - Added annual temperature change, emissions, and energy consumption by country/region, sector, and source from OWID

Added Our Wold in Data (OWID) metrics covering annual temperature change, emissions, and energy consumption by country/state, sector, and source to the OUR_WORLD_IN_DATA_ATTRIBUTES and OUR_WORLD_IN_DATA_TIMESERIES tables.

Variable added cover:

  • Global warming

  • Carbon Dioxide Emissions

  • Carbon Intensity

  • Emissions from Aviation

  • Energy Consumption

  • Fugitive Emissions

  • Greenhouse Gas Emissions

  • Methane Emissions

  • Nitrous Oxide Emissions

12/14/23: Tech & Innovation Essentials - Added web domain redirect relationships and domain characteristics

Added two new tables DOMAIN_REDIRECT_RELATIONSHIPS and DOMAIN_CHARACTERISTICS:

  • DOMAIN_REDIRECT_RELATIONSHIPS includes the original web domain and the domain that it redirects to as well as the start and end dates that the redirect relationship was observed. At the time of this release, the table covers ~50K domains of the 300M+ domains Cybersyn tracks

  • DOMAIN_CHARACTERISTICS details which web domains are active/inactive based on the HTTP response status of the domain and whether the domain is the primary landing page or redirects to another domain. At the time of this release, the table covers 800K+ domains that Cybersyn tracks.

12/14/23: Government Essentials - Added HQM corporate bond yields, U.S. Treasury savings bonds, TreasuryDirect securities, and interest rates for U.S. Treasury securities

Added high quality market corporate bond yields, U.S. Treasury savings bonds (issues, redemptions, maturities), securities issued in TreasuryDirect, and average interest rates on U.S. Treasury securities from the U.S Treasury.

  • The US_TREASURY_ATTRIBUTES table includes details on each variable

  • The US_TREASURY_TIMESERIES table includes historical values for each variable

12/6/23: Government Essentials - Added global labor statistics from the International Labour Organization (ILO)

Added global labor data from the International Labour Organization (ILO). Variables include employment, unemployment, population, potential labor force, labor force participation, rate of labor underutilization, working poverty rate, and annual growth rate of output per worker. Data can be cut by sex, age, occupation, status in employment, economic activity (e.g. education, utilities, construction), rural/urban areas, and economic class.

  • The INTERNATIONAL_LABOUR_ORGANIZATION_ATTRIBUTES table includes descriptions of each labor statistic

  • The INTERNATIONAL_LABOUR_ORGANIZATION_TIMESERIES table includes historical values for each ILO variable tracked

12/5/23: Financial & Economic Essentials - Added interest rates posted by major chartered banks in Canada for selected products; Added secured overnight financing rates (SOFR) and US Treasury bill rates

Added interest rates for selected products posted by the six major chartered banks in Canada to the FINANCIAL_FRED_ATTRIBUTES, FINANCIAL_FRED_TIMESERIES, and FINANCIAL_FRED_VARIABLE_SERIES_ID_CROSSWALK tables. The typical rate is calculated based on the statistical mode of the rates published by the six banks.

The posted rates cover:

  • Prime lending rate

  • Conventional mortgages

  • Guaranteed investment certificates

  • Personal daily interest savings

  • Non-checkable savings deposits

Added secured over night financing rates (SOFR) and US Treasury bill rates to the FINANCIAL_FRED_ATTRIBUTES, FINANCIAL_FRED_TIMESERIES, and FINANCIAL_FRED_VARIABLE_SERIES_ID_CROSSWALK tables.

Series added (FRED ID):

  • Secured Overnight Financing Rate (SOFR)

  • 30-Day Average Secured Overnight Financing Rate (SOFR30DAYAVG)

  • 90-Day Average Secured Overnight Financing Rate (SOFR90DAYAVG)

  • 180-Day Average Secured Overnight Financing Rate (SOFR180DAYAVG)

  • Secured Overnight Financing Rate Index (SOFRINDEX)

  • 3-Month Treasury Bill Minus Federal Funds Rate (TB3SMFFM)

  • 6-Month Treasury Bill Minus Federal Funds Rate (TB6SMFFM)

12/5/23: Canada Government Essentials - Added interest rates posted by major charted banks in Canada for selected products

Added interest rates for selected products posted by the six major chartered banks in Canada to the CANADA_STATCAN_ATTRIBUTES and CANADA_STATCAN_TIMESERIES tables. The typical rate is calculated based on the statistical mode of the rates published by the six banks.

The posted rates cover:

  • Prime lending rate

  • Conventional mortgages

  • Guaranteed investment certificates

  • Personal daily interest savings

  • Non-checkable savings deposits

12/4/23: Financial & Economic Essentials - Added Monthly State Retail Sales (MSRS) from the US Census Bureau

Added monthly state-level retail sales by NAICS codes to the FINANCIAL_FRED_ATTRIBUTES, FINANCIAL_FRED_TIMESERIES, and FINANCIAL_FRED_VARIABLE_SERIES_ID_CROSSWALK tables. The US Census Bureau publishes year-over-year percent changes for each state using a composite model of Monthly Retail Trade Survey (MRTS) data, administrative data, and third-party data beginning in January, 2019.

Year-over-year percent change estimates for the following industries are now available at the state level:

  • Total Retail Sales Excluding Nonstore Retailers

  • Building Material and Garden Equipment and Supplies Dealers

  • Clothing and Clothing Accessories Stores

  • Electronics and Appliance Stores

  • Food and Beverage Stores

  • Furniture and Home Furnishings Stores

  • Gasoline Stations

  • Health and Personal Care Stores

  • Motor Vehicle and Parts Dealers

  • Sporting Good, Hobby, Musical Instrument and Book Stores

  • Miscellaneous Stores

November 2023

11/30/23: Government Essentials - Added global trade, tariff, and import relationship data from the World Trade Organization (WTO)

Added global trade flows, imposed tariffs, and trade interactions between countries from the World Trade Organization (WTO) . The data details export and import figures for goods and services across different countries and regions, tariff rates and structures that WTO member countries apply to imports from other nations, trade dependencies between countries, and the balance of trade between specific pairs of nations.

  • The WORLD_TRADE_ORGANIZATION_ATTRIBUTES table details the global trade, tariff, and import relationship statistics tracked by the World Trade Organization (WTO).

  • The WORLD_TRADE_ORGANIZATION_TIMESERIES table provides timeseries values by date for the reported trade statistics by country, country group, and global region (as defined by the World Trade Organization).

11/30/23: Government Essentials - Added global health indicators from the World Health Organization (WHO)

Added 1,100+ health-related indicators for 194 members of the World Health Organization (WHO) and their associated country groups and global regions. Example metrics include alcohol consumption among adolescents and adults, tobacco control policies, abortion rates, accessibility of dementia care services, and adolescent fertility rates. Environmental health indicators including air pollution's impact on mortality rates and disability-adjusted life years (DALYs) as well as deaths attributable to the environment are also included.

  • The WORLD_HEALTH_ORGANIZATION_ATTRIBUTES table details the health statistics tracked by the World Health Organization (WHO).

  • The WORLD_HEALTH_ORGANIZATION_TIMESERIES table provides timeseries values by date for the reported health indicators by country, country group, or global region (as defined by global organizations like UNICEF, the United Nations, the World Bank, and the World Health Organization).

11/30/23: Government Essentials, Financial & Economic Essentials - Added country groups and regions from the WHO, WTO, and UN to geography tables

Added additional country groups and geography types to the GEOGRAPHY_INDEX from the World Health Organization (WHO), World Trade Organization (WTO), and United Nations (UN). The member countries of the added geographies are mapped in the GEOGRAPHY_RELATIONSHIPS table. Select new geographic regions include:

  • BRICS members

  • World Trade Organization (WTO) members

  • Association of Southeast Asian Nations (ASEAN)

  • UNICEF regions

  • United Nations regions

  • United Nations Sustainable Development Goal (SDG) regions

  • World Bank regions

  • World Health Organization (WHO) regions and income regions

  • World Bank regions and income groups

11/28/23: Government Essentials - Added US agricultural export sales data from the USDA

Added US Export Sales Reporting (ESR) data on weekly export sales activity for 40+ US agricultural commodities sold abroad from the US Department of Agriculture's (USDA) Foreign Agricultural Service (FAS).

  • The US_DEPARTMENT_OF_AGRICULTURE_COMMODITIES_ATTRIBUTES now includes the export of commodities in addition to the existing production, supply, and distribution variables.

  • The US_DEPARTMENT_OF_AGRICULTURE_COMMODITIES_TIMESERIES table provides the reported metrics for each commodity by GEO_ID.

11/22/23: Financial & Economic Essentials - Added data from the Bank for International Settlements (BIS) on global banking conditions, property prices, and consumer price indicators

Added data from the Bank for International Settlements (BIS) on global banking conditions, property prices, and consumer price indicators

The Bank for International Settlements (BIS) is an international financial institution owned by 60+ central banks that represent countries accounting for ~95% of global GDP. As part of its mission is to support international monetary and financial cooperation, the BIS acts as a bank for central banks across the world.

Added the below BIS data to two new tables, BANK_FOR_INTERNATIONAL_SETTLEMENTS_ATTRIBUTES, which describes the metrics tracked by BIS, and BANK_FOR_INTERNATIONAL_SETTLEMENTS_TIMESERIES, which provides the values of metrics:

  • Residential property prices

  • Central bank policy rates

  • Consumer price indicators

  • Assets and liabilities of internationally active banks, including credit to the non-financial sector as well as the geographical and currency composition of a bank’s balance sheet

11/22/23: Consumer Spending Foundation - Added spend by geography for states and top 100 CBSAs. Added online/offline spend breakdown

Expanded the granularity of values in the CONSUMER_SPENDING_TIMESERIES table to include breakdown by geography. Geographies are grouped both by purchaser primary location (i.e. where the spender lives) and purchase location (i.e. where the physical store is located). The data covers US states and the top 100 core-based statistical areas (“CBSAs”) as measured by population.

  • Purchaser geographies are represented in the PURCHASER_PRIMARY_GEO_ID and PURCHASER_PRIMARY_GEO_NAME fields

  • Purchase location geographies are represented in PURCHASE_LOCATION_GEO_ID and PURCHASE_LOCATION_GEO_NAME fields

  • PURCHASER_PRIMARY_GEO_ID and PURCHASER_PRIMARY_GEO_ID can be used to join the data with Cybersyn’s geography tables such as the GEOGRAPHY_INDEX based on the GEO_ID field

Added purchase channels covering online and offline spend.

  • Purchase channels are reflected in the CHANNEL field in the CONSUMER_SPENDING_ATTRIBUTES table and included in the VARIABLE_NAME values in the CONSUMER_SPENDING_TIMESERIES table

Note that data for “purchase location” geographies only includes offline spend.

11/20/23: Consumer Spending Foundation - Added aggregations by NAICS codes and by 4-5-4 retail months

Expanded the CONSUMER_SPENDING_TIMESERIES and the CONSUMER_SPENDING_ATTRIBUTES tables to include aggregations by NAICS (North American Industry Classification System) and 4-5-4 retail calendar months.

Deprecated MCC_CODE, MCC_CODE_DESCRIPTION and MART_VARIABLE fields in the the CONSUMER_SPENDING_ATTRIBUTES table and replaced them with AGGREGATION_TYPE and AGGREGATION_VALUE.The newly added fields can be used to filter to a desired level of aggregation including to the NAICS, MARTS Segment, and MCC levels.

Additionally, the following fields were added in anticipation of upcoming dataset expansions. Note that these fields only contain total values for now. Future iterations of the product will include more granular cuts of data.

  • Added COMPANY_NAME, PURCHASER_PRIMARY_GEO_ID, PURCHASER_PRIMARY_GEO_NAME, PURCHASE_LOCATION_GEO_ID, and PURCHASE_LOCATION_GEO_NAME to CONSUMER_SPENDING_TIMESERIES

  • Added CHANNEL to CONSUMER_SPENDING_ATTRIBUTES

11/20/23: Financial & Economic Essentials - Added 10 timeseries from FRED covering mortgage rates and additional CPI measures

Added 10 additional timeseries from FRED to the financial_fred_timeseries and financial_fred_attributes tables:

  • MORTGAGE15US: 15-Year Fixed Rate Mortgage Average in the United States

  • MORTGAGE30US: 30-Year Fixed Rate Mortgage Average in the United States

  • CUSR0000SEFV: Consumer Price Index for All Urban Consumers: Food Away from Home in U.S. City Average, Seasonally Adjusted

  • CUUR0000SEFV: Consumer Price Index for All Urban Consumers: Food Away from Home in U.S. City Average, Not Seasonally Adjusted

  • CUSR0000SETG01: Consumer Price Index for All Urban Consumers: Airline Fares in U.S. City Average, Seasonally Adjusted

  • CUUR0000SETG01: Consumer Price Index for All Urban Consumers: Airline Fares in U.S. City Average, Not Seasonally Adjusted

  • CUSR0000SS62031: Consumer Price Index for All Urban Consumers: Admission to Movies, Theaters, and Concerts in U.S. City Average, Seasonally Adjusted

  • CUUR0000SS62031: Consumer Price Index for All Urban Consumers: Admission to Movies, Theaters, and Concerts in U.S. City Average, Not Seasonally Adjusted

  • CUSR0000SEHB: Consumer Price Index for All Urban Consumers: Lodging Away from Home in U.S. City Average, Seasonally Adjusted

  • CUUR0000SEHB: Consumer Price Index for All Urban Consumers: Lodging Away from Home in U.S. City Average, Not Seasonally Adjusted

11/20/23: Government Essentials - Expanded American Community Survey (ACS) history for 1,400+ population variables since 2005 for ~500K geographies

Added historical data from the American Community Survey (ACS) to the AMERICAN_COMMUNITY_SURVEY_ATTRIBUTES and AMERICAN_COMMUNITY_SURVEY_TIMESERIES tables for over 1,400 population variables dating back to 2005 at the following geographic entity levels: country, states, counties, cities, zip codes, core-based statistical areas (CBSAs), census tracts, and census block groups. Example population variable additions include age, race, income, employment status, immigration status, and household status.

Data is as up to date as the latest ACS publication.

11/16/23: Weather & Environmental Essentials - Added 6 tables covering disaster declaration and National Flood Insurance Program (NFIP) insurance data from FEMA

Added Federal Emergency Management Agency (FEMA) data on federally-declared disasters in the United States, disaster recovery public programs, and the National Flood Insurance Program (NFIP) insurance policies and claims. Six new tables were included in the release:

  • FEMA_DISASTER_DECLARATION_INDEX - Details for each federally-declared disaster (e.g. name, type, date, public assistance funding amounts). Table is unique by DISASTER_ID.

  • FEMA_DISASTER_DECLARATION_AREAS_INDEX - Geographic entities (e.g. counties, cities) impacted by each federally-declared disaster. A disaster declaration can include multiple geographic locations.

  • FEMA_MISSION_ASSIGNMENT_INDEX - Work orders issued by FEMA since 2012 to other government agencies, supporting emergency response activation across the US (e.g. requesting transportation support from the US Department of Transportation during a hurricane).

  • FEMA_NATIONAL_FLOOD_INSURANCE_PROGRAM_CLAIM_INDEX - Details on National Flood Insurance Program (NFIP) claims including features of the insured property, information on the flood event precipitating the claim, the cost of the damage, and subsequent insurance payout amounts.

  • FEMA_NATIONAL_FLOOD_INSURANCE_PROGRAM_POLICY_INDEX - Details on National Flood Insurance Program (NFIP) policies including features of the insured property, building and contents insurance coverage, deductibles, rates, and policy durations.

  • FEMA_REGION_INDEX - Human readable names and details for each FEMA Region

11/13/23: Canada Government Essentials - Added 6 timeseries from Statistics Canada covering Canadian debt securities and real estate development

Added 6 new timeseries to the CANADA_STATCAN_TIMESERIES and CANADA_STATCAN_ATTRIBUTES tables from 3 of Statistics Canada’s underlying sources:

  • Bank of Canada: Government of Canada debt gross new issues, retirements and net new issues, and par values

  • Annual Survey of Service Industries: Real estate agents, brokers, and appraisers summary statistics (e.g. salaries, wages, operating revenue)

  • Canada Mortgage and Housing Corporation:

    • Absorptions and unabsorbed inventory, newly completed dwellings, by type of dwelling unit in select census metropolitan areas

    • Absorptions and unabsorbed inventory, newly completed dwellings, by type of dwelling unit in census agglomerations of 50K+

    • Housing starts, under construction and completions in select census metropolitan areas

    • Housing starts, under construction and completions in census agglomerations of 50K+

11/7/23: Cybersyn Public Domain Pro - Added point-in-time history for 45 tables

Added point-in-time history tables for the following Cybersyn datasets.

  • American Community Survey

  • Canada Statcan

  • Carbon Credit Purchases

  • Carbon Intensity

  • Company Index and Characteristics

  • Data Commons

  • Domain Index

  • FHFA

  • Geography Characteristics

  • Home Mortgage Disclosures

  • IMEI

  • IRS

  • NOAA Weather Stations and Metrics

  • OpenFIGI Security Index

  • PermID Security Index

  • Points of Interest (POIs)

  • Urban Crime

  • US Addresses

  • USDA

  • US Treasury

  • USPS Address Changes

11/6/23: Consumer Spending Foundation - Added nominal value projections (dollar-level figures)

Expanded values in the CONSUMER_SPENDING_TIMESERIES table to include nominal estimates. New measures include estimates for revenue ($), transactions (#), and average order value ($). These newly added variables build on the existing year-over-year (%) estimates for those measures already in the table.

Nominal values can be used to measure market share or compare average transaction amounts across retailers. Because these estimates are based on a panel of consumer spending, they are not meant to be projections of the entirety of US spending but they are accurate as a measure of relative spend. For example, the sum of all Chipotle spend will not equal the company's actual spend but Chipotle's market share relative to McDonald's should be correct.

The newly-added variables in the timeseries table include matching variables in the CONSUMER_SPENDING_ATTRIBUTES table with the new variables being MEASURE values of Revenue, Transactions, and AOV.

11/3/23: Consumer Spending Foundation, Government Essentials - Added CALENDAR_INDEX table

Added the CALENDAR_INDEX table which compiles common calendars into a single table. Each calendar type has a unique CALENDAR_ID, which allows users to select which calendar type they want to use. Individual periods within each calendar type include period start and end dates.

The CALENDAR_INDEX currently includes regular calendar periods (days, weeks, months, quarters, and years) and 4-5-4 retail calendar periods (4-5-4 retail months, quarters, and years).

The 4-5-4 retail calendar is a standardized accounting and reporting calendar system used by many retailers, where each fiscal year is divided into 13 weeks, aiming to align with seasonal variations and facilitate more accurate financial comparisons.

11/3/23: Government Essentials - Added global agricultural commodity production and distribution data from the USDA

Added two tables sourced from the US Department of Agriculture's (USDA) Foreign Agricultural Service (FAS) which provides production, supply, and distribution data on agricultural commodities for both the United States and other producing and consuming countries since 1960.

  • us_department_of_agriculture_commodities_attributes describes the production, supply, and distribution metrics tracked for each commodity by the USDA.

  • us_department_of_agriculture_commodities_timeseries table provides the reported metrics for each commodity and country.

October 2023

10/25/23 - Financial & Economic Essentials - Added detailed branch-level data and Summary of Deposits (SOD) data from the FDIC

The Summary of Deposits (SOD) data from the FDIC is an annual survey, capturing branch-level deposits as of June 30 for all FDIC-insured institutions, including U.S. branches of foreign banks.

The fdic_branch_locations_indextable provides details on FDIC-insured bank branches, including branch-specific location information as well as institution-level regulatory and insurance data.

The fdic_summary_of_deposits_attributes table describes the deposit types tracked by the Summary of Deposits (SOD) survey.

The fdic_summary_of_deposits_timeseries table provides the results of the annual Summary of Deposits (SOD) survey going back to 1994 for banks’ branch-level deposits.

10/19/2023: Government Essentials - Added US Federal Government Revenue Collections from the US Treasury Fiscal Data

The US Treasury provides a daily overview of net federal revenue collections from income tax deposits, customs duties, fees for government services, fines, and loan repayments. These collections undergo electronic and/or non-electronic processing, involving various channels such as mail, internet, banking, and over-the-counter transactions, all of which are comprehensively incorporated within this dataset.

The us_treasury_revenue_collections_timeseries table provides daily net collections amounts broken down by tax category and processing channel. The us_treasury_revenue_collections_attributes table details each collection method reported by the US Treasury.

10/18/23: US Insurance & Healthcare Provider Foundation - Added Form 5500 Schedule A Part 1 insurance data from the US Department of Labor.

Expanded the US Department of Labor data to include information found on Form 5500 Schedule A Part 1. The new table, us_department_of_labor_form_5500_broker_index, provides commission and fee amounts received by a broker for an insurance policy. Additional information about the brokers in the table includes their address, classification as an insurance broker, as well as notes pertaining to the compensation disbursed to them.

The us_department_of_labor_form_5500_broker_index can be joined to insurance carrier and policy information to individual Form 5500 filings, using INSURANCE_POLICY_ID and ACK_ID.

10/16/23: Financial & Economic Essentials - Added exchange rates from the Bank for International Settlements (BIS)

Due to the discontinuation of certain currency conversion pairs by the European Central Bank (ECB), our primary source for daily FX rates, we have sourced a number of these pairs from an alternative source, the Bank for International Settlements (BIS), for ongoing history. The Bank for International Settlements will be used to get data for the following currency conversion pairs after September 27, 2023: USD:AED, USD:ARS, USD:CLP, USD:COP, USD:DZD, USD:MAD, USD:PEN, USD:QAR, USD:SAR, USD:TWD, USD:UAH.

Full history from the Bank for International Settlements was added for the following currency pairs: USD:ALL, USD:AUD, USD:BAM, USD:BHD, USD:BND, USD:EUR, USD:GBP, USD:IRR, USD:ISK, USD:KWD, USD:KZT, USD:LKR, USD:MKD, USD:MUR, USD:NPR, USD:NZD, USD:OMR, USD:RSD, USD:RUB, USD:TND, USD:TTD, USD:UYU, USD:VEF, USD:XDR.

10/11/23: Government Essentials - Added population variables to the American Community Survey tables.

Expanded the american_community_survey_attributes and american_community_survey_timeseries tables to include additional population variables related to income, age, and educational attainment.

New series include Household Income in the Past 12 Months (Inflation-Adjusted), Educational Attainment for the Population 25 Years and Over, and Age of Householder By Household Income in the Past 12 Months (Inflation-Adjusted). These series are available by multiple breakdowns (ex. income, age, gender, etc.).

10/11/23: Consumer Spending Foundation - Added company-level estimates; Added COMPANY_INDEX table

Added year-over-year estimates for company-level spend, transaction, and AOV to the CONSUMER_SPENDING_TIMESERIES. Company-level information is identified with the company_id field.

The company_id is a unique identifier assigned by Cybersyn to each company and is joinable to the COMPANY_INDEX, which provides company names and other helpful identifiers such as CIK, LEI, PermID, and more. Note that when the company_id is null, then the row represents data for all companies.

10/09/23: US Housing & Real Estate Essentials - Added U.S. Census Regions & Divisions

Expanded the geography_index table to include U.S. Census Regions and Divisions.

Expanded the geography_hierarchy table to include the relationships between U.S. Census regions and U.S. Census divisions; U.S. Census regions and U.S. states; and U.S. Census divisions and U.S. states.

Census regions include the United States Northeast, Midwest, etc. and census divisions include the United States Middle Atlantic, East North Central, etc.

10/8/23: SEC Filings - Added PERMID_SECURITY_INDEX table to expand security coverage
  • Added one new table to our dataset, PERMID_SECURITY_INDEX, which includes security identifiers from Refinitiv’s PermID database. These are persistent identifiers for active and inactive securities across global asset classes. The table includes over 15K PermIDs for various securities.

  • This data can be merged to the 13F filings data (SEC_HOLDING_FILING_ATTRIBUTES) and can be mapped back to companies using the COMPANY_SECURITY_RELATIONSHIPS table.

10/06/23: Tech & Innovation Essentials - Added in repository of web domains plus included GitHub Archive & US Patents tables

Cleaned and aggregated over 300M domains in a single source to track the list of websites globally into new domain_index table.

Added GitHub Archive and US Patents Grants tables to the product, rebranded product from "IMEI Type Allocation Codes" to "Tech & Innovation Essentials."

10/2/23: Government Essentials - Added population data from the American Community Survey

Expanded our population dataset to include annual estimates from the American Community Survey (ACS) for 2021 and 2022 at multiple geographic levels in the United States.

September 2023

9/18/23: Financial & Economic Essentials - Added 6 timeseries from FRED covering core CPI and industry and commodity-specific PPI data

Added 6 new timeseries from FRED to the financial_fred_timeseries and financial_fred_attributes tables:

  • PPIFIS: Producer Price Index by Commodity: Final Demand

  • PCU4841214841212: Producer Price Index by Industry: General Freight Trucking, Long-Distance Truckload

  • PCU4841224841221: Producer Price Index by Industry: General Freight Trucking, Long-Distance Less Than Truckload

  • PCU3313133131: Producer Price Index by Industry: Alumina and Aluminum Production and Processing

  • WPU101707: Producer Price Index by Commodity: Metals & Metal Products: Cold Rolled Steel Sheet and Strip

  • CPILFESL: (CORE CPI) Consumer Price Index for All Urban Consumers: All Items Less Food & Energy in U.S. City Average

9/17/23: SEC Filings, LLM Training Essentials - Added COMPANY_INDEX, COMPANY_CHARACTERISTICS, and COMPANY_SECURITY_RELATIONSHIP tables; added PermIds

Added three new tables:

  • The COMPANY_INDEX table aggregates commonly used company identifiers (i.e. CIKs, EINs, and LEIs) into a single a single company_id, which can be used across Cybersyn’s datasets as a unique identifier for corporate entities.

  • The COMPANY_SECURITY_RELATIONSHIP table maps OpenFIGI and PermID securities (i.e. securities with multiple "levels" such as OpenFIGI FIGI ID and OpenFIGI Share Class ID) to the Company.

  • TheCOMPANY_CHARACTERISTICS table includes categorical characteristics of a Company (e.g. industry, address, previous names). A characteristic may be temporal with start and end dates indicating the range for which the data is valid.

Added PermId securities published by Refinitiv.

9/15/23: Weather & Environmental Essentials - Added daily weather data from 80K+ weather stations across 180 countries; updated product name

New tables added:

  • NOAA_WEATHER_STATION_INDEX contains metadata on the weather stations from the Global Historical Climatology Network daily (GHCNd) database, including mappings to Cybersyn’s GEO_ID at the country-, state- and zip-level (where applicable).

  • NOAA_WEATHER_METRICS_ATTRIBUTES includes the daily weather variables tracked globally and their measurement details.

  • NOAA_WEATHER_METRICS_TIMESERIES provides the details of the daily global weather variables recorded at each weather station.

Updated product name from "Emissions & Environment Essentials" to "Weather & Environmental Essentials"

9/15/23: US Insurance & Healthcare Provider Foundation - Added healthcare provider emails; combined TELEPHONE and TELEPHONE_EXTENSION into one field; changed TELEPHONE to array to accommodate numerous values
  • Added EMAIL field to the nppes_provider_addresses table with provider emails per address.

  • Combined TELEPHONE and TELEPHONE_EXTENSION from the nppes_provider_addresses table into a single field, TELEPHONE, and removed the TELEPHONE_EXTENSION field.

  • Aggregated all values for TELEPHONE, FAX, and EMAIL that are associated with the same NPI and address into arrays in one row. Rows in the nppes_provider_addresses table are now uniquely defined by NPI and full address.

9/15/23: US Points of Interest & Addresses, Government Essentials - Added FIPS 10-4 country codes and state abbreviations

Expanded the geography_characteristics table to include mappings of FIPS 10-4 country codes and U.S. state abbreviations to country and state-level GEO_IDs, respectively.

9/10/23: US Insurance & Healthcare Provider Foundation - Added table to relate taxonomy classifications to practitioners’ license numbers

Added the NPPES_PROVIDER_TAXONOMY_AND_LICENSE_NUMBERS table that relates taxonomy classifications to practitioners’ license numbers. This table provides users the ability to filter for license numbers based on practitioners’ primary taxonomy.

9/7/23: SEC Filings - Added 13F filings to include data on quarterly investment fund managers’ holdings; added a securities index table based on OpenFIGI data

Expanded our dataset to include individual filings from 13F fund holding reports, which disclose the equity holdings of institutional investment managers. Added three new tables, SEC_HOLDING_FILING_INDEX, SEC_HOLDING_FILING_ATTRIBUTES and OPENFIGI_SECURITY_INDEX.

SEC_HOLDING_FILING_INDEX table contains metadata from individual 13F filings including filing date and filing organization. SEC_HOLDING_FILING_ATTRIBUTES table includes securities' names, market value, number of shares held, and OpenFIGI IDs, which facilitate easier mapping and analysis to outside data sources. Table OPENFIGI_SECURITY_INDEX contains an index of over 2M securities listed on OpenFIGI and can be joined with table SEC_HOLDING_FILING_ATTRIBUTES using TOP_LEVEL_OPENFIGI_ID - the unique identifier for each security in the two tables.

9/6/23: US Insurance & Healthcare Provider Foundation - Added contact information and broker payments to Form 5500 data
  • Added new fields to us_department_of_labor_form_5500_filing_index with Form 5500 contact information including name and phone numbers: ADMIN_SIGNED_NAME, SPONSOR_SIGNED_NAME, DIRECT_FILING_ENTITY_SIGNED_NAME, ADMIN_PHONE_NUM and SPONSOR_DIRECT_FILING_ENTITY_PHONE_NUM.

  • Added new fields to us_department_of_labor_form_5500_policy_index with payments to agents and brokers: COMMISSIONS_PAID_TO_BROKER and FEES_PAID_TO_BROKER.

  • Removed ~10k rows from us_department_of_labor_form_5500_policy_index that had NULL values for each field as they filed no data around insurance policies from Form 5500 Schedule A.

August 2023

8/31/23: US Insurance & Healthcare Provider Foundation - Added deactivated NPI numbers to NPPES data

Added a new table, nppes_npi_index, that contains information on when NPIs were first issued, deactivated, or reactivated - dating back to 2005. This table also includes a boolean flag to indicate if an NPI is currently active.

Note that while all NPIs appear in the nppes_npi_index table, only actively registered NPIs as well as NPIs deactivated after August 1, 2023 appear in the nppes_practitioner_attributes and nppes_organization_attributes tables. This means the dataset does not include attribute-level data (names, type of providers, specialization) on providers with NPIs deactivated before August 2023.

8/27/23: US Points of Interest & Addresses, US Housing & Real Estate Essentials - Added points of interest data from Overture Maps Foundation

Added the point_of_interest_index table, which includes names and categories for points of interest in the US. Each POI is uniquely identified by a POI_ID.

To tie POIs to addresses, we added a new column, ADDRESS_ID, to the us_addresses table to uniquely identify each individual address. This column allows users to join addresses to POIs using the new point_of_interest_addresses_relationships table with POI_ID and ADDRESS_ID as the join keys for the point_of_interest_index table and us_addresses table, respectively.

8/27/23: US Points of Interest & Addresses, US Housing & Real Estate Essentials - Added 7.2M new addresses, removed 49.8M duplicate addresses, deleted 1.2M addresses with Null STREET value

Added 7.2M new addresses covering points of interest from Overture Maps Foundation to the us_addresses table.

Removed 49.8M addresses that were duplicative aside from minor variability in coordinates. Removed 1.2M rows from rows from the us_addresses table where the STREET value contained a string with value Null.

8/27/23: US Points of Interest & Addresses, US Housing & Real Estate Essentials - Added country-level geospatial boundaries to the geography_characteristics table

Added country-level geospatial boundaries to the geography_characteristics table with data from Overture Maps Foundation.

8/13/23: SEC Filings - Added 8-K filings and exhibits for 10-Qs and 10-Ks

Expanded our coverage of SEC documents to include the full text of 8-K filings and associated exhibits. 8-K filings include company press releases, earnings releases, and other major corporate events.

Added the full text of exhibits for 10-K and 10-Q filings. Exhibit types include lists of subsidiaries, merger agreements, and material changes in financial conditions. Exhibits are denoted in the variable and variable_name columns (e.g. 10-K EX-21 Filing Text).

Added the sec_document_id column. This field is a combination of the ADSH (accession number) and the document type (e.g. 10-K). This serves as a unique identifier for each individual component that makes up a filing in cases when one or more exhibits are included in a filing.

8/11/23: Government Essentials, US Housing & Real Estate Essentials, US Addresses & Geographic Areas - Added geospatial boundaries data for territories in the US and Canada

The Census Bureau and Statistics Canada publish geospatial boundaries data for their territories at multiple geographic levels. We added a table geography_characteristics with the boundary coordinates from the most recent releases in both WKT and GeoJSON formats. The table is joinable at different levels using Cybersyn's GEO_ID. This GEO_ID is compatible with all Cybersyn listings that have geographic identifiers. Currently, the geographic levels covered include:

  • State (US and Canada)

  • County (US only)

  • Census Tract (US only)

  • ZIP Code (US only)

  • Dissemination Area and Aggregate Dissemination Area (Canada only)

  • Census Division and Census Subdivision (Canada only)

  • Census Agglomeration and Census Agglomeration Part (Canada only)

  • Census Metropolitan Division and Census Metropolitan Division Part (Canada only)

8/10/23: Financial & Economic Essentials - Added crosswalk to FRED series IDs & 107 new series from GDP, Employment Situation, Housing Starts, and Residential Construction reports

Added a new table, financial_fred_variable_series_id_crosswalk, that enables a join between Cybersyn’s variable and FRED’s unique series ID

Added new series from the following four reports:

  • Gross Domestic Product (data produced by the US Bureau of Economic Analysis)

  • Employment Situation (US Bureau of Labor Statistics)

  • New Residential Construction (US Department of Housing and Urban Development)

  • Quarterly Starts and Completions by Purpose and Design (US Department of Housing and Urban Development)

8/7/23: Government Essentials - Added text-based US government contracts data from SAM.gov

The US government publishes contract opportunities and proposals to do business with the federal government via the System for Award Management (sam.gov) for contracts and awards with a value of at least $25,000. The data goes back to January 2002 and includes metadata providing descriptions of government contracts and the corresponding awards granted for those contracts.

July 2023

7/31/23: Financial & Economic Essentials - Added release_name and release_source for better discoverability
  • release_name: The collection, group of data, or report from which a time series originates. This column can be used as a filter to find related series.

  • release_source: The organization (e.g. FDIC, Federal Reserve) that FRED collects the data from.

Two columns were added to financial_fred_attributes to provide better categorization and discoverability:

7/28/23: Canada Government Essentials - Removed the age_group measure and labour_force_statisticcolumns

Legacy columns from the July 16, 2023 release were removed.

  • age_group

  • measure

  • labour_force_statistic

7/25/23: Financial & Economic Essentials - Added 124 time series from FRED covering a variety of economic data

124 additional time series added from FRED to the financial_fred_timeseries and financial_fred_attributes.

7/16/23: Canada Government Essentials - Added prices, output, and household income; schema changes
  • New datasets:

    • Household income and finances: Household income and consumption, household credit liabilities, and household savings rates

    • Prices and output: Core consumer price index, gross domestic product by industry group, and new housing price indices

  • Updated datasets:

    • StatCan archived a number of retail trade series and published new series. Old series are marked “Archived…” in the report field. The latest series have been included to replace these.

  • Schema changes:

    • The following columns in the canada_statcan_attributes view were updated. The deprecated columns will be removed on 7/28/2023.

      • age_group will be folded into demographic_group, which applies more broadly to age ranges, income groups, and household makeups (e.g., 35 to 44 years, lowest income quintile, elderly persons not in an economic family)

      • measure will be renamed report. The report column displays the StatCan dataset from which the data originates (e.g. Consumer Price Index, New Housing Price Index)

      • labour_force_statistic will be renamed statistic. The new statistic column will provide the label for the specific economic metric that is being reported (e.g. Median After Tax Income, Number of families)

7/3/23: US Insurance & Healthcare Provider Foundation - Added Form 5500 insurer information

Added information from US Department of Labor Form 5500 filings about company benefit providers and the insurance/benefit plans they offer. New tables include us_department_of_labor_form_5500_filing_index and us_department_of_labor_form_5500_policy_index.

June 2023

6/14/23: SEC Filings - Added full text 10-Qs and 10-Ks

Added the full text of 10-K/Q filings. These are contained in the sec_report_text_attributes table.

6/13/23: Financial & Economic Essentials - Added Bureau of Labor Statistics datasets

Added Consumer Price Index (CPI), Average Prices (AP), Job Openings and Labor Turnover Survey (JOLTS), State and Metro Area Employment , Hours, & Earnings (SAE), Local Area Unemployment Statistics (LAUS) from the Bureau of Labor Statistics.

6/1/23: Government Essentials - Added 3,000 new US zip codes from USPS and US Census
  • Using USPS address change data, we added 3,000 zip codes (mostly PO Box) to the dc_geo_index.

  • Using both the USPS address change and US Census Bureau data, we increased the coverage in geography_relationships table with 6,500 new zip and city relationships. We now map 86% of zip codes to a city.

May 2023

5/26/23: Financial & Economic Essentials - Added 5 global central bank policy rates

Central bank policy rates from Brazil, Canada, England, Mexico, and Japan added to fred_timeseries and fred_attributes

5/19/23: Government Essentials - Updated product name from Cybersyn Data Commons to Cybersyn Government Essentials

Rebranded Cybersyn Data Commons as Cybersyn Government Essentials. We updated the naming conventions for schemas, tables, and column names to make them consistent across all of Cybersyn’s existing and future data products.

Cybersyn will continue to support and update your older version of the Data Commons tables.

5/19/23: US Addresses & Geographic Areas, GitHub Events - Added source data from National Address Database (NAD)

Added the National Address Database (NAD) as a source to increase our US address coverage:

  • Increased the coverage from 140 million addresses to more than 188 million.

  • There is now at least one address in more than 85% of zip codes, up from 74% previously.

  • Increased the portion of cities that are mapped to distinct IDs joinable to our other data sets from 24% to over 77%

Last updated

Copyright © 2024 Cybersyn