Changelog
Subscribe to receive a bi-weekly summary of release notes in your inbox.
Note, the Enterprise product is updated with all public data releases below.
July 2024
7/30/24: Government Essentials - Added monthly US domestic flight segment data from the US Department of Transportation.
The US Department of Transportation (DOT) provides monthly data on domestic flight segments within the United States, categorized by World Area Codes (WAC). It provides details into the types of flights, aircrafts, and segments. This dataset is useful for analyzing domestic air traffic patterns, evaluating airline market share and performance, and conducting assessments within the aviation sector.
The data is available in the following tables:
US_DEPARTMENT_OF_TRANSPORTATION_ATTRIBUTES
US_DEPARTMENT_OF_TRANSPORTATION_TIMESERIES
AIRPORT_INDEX
AIRCRAFT_INDEX
AIRCRAFT_CARRIER_INDEX
7/1/24: US Insurance & Healthcare - Added Form 5500 Schedule SB from the US Department of Labor.
Expanded the US Department of Labor data to include information found in Form 5500 Schedule SB. The new tables, us_department_of_labor_form_5500_schedule_sb_attributes
and us_department_of_labor_form_5500_schedule_sb_timeseries
, monitor the financial health and funding adequacy of single-employer defined benefit pension plans.
7/1/24: Weather & Environmental - Added electricity consumption, sales, and capacity data from the EIA.
The EIA is part of the US Department of Energy and serves as a primary source of data on energy production, consumption, and sales in the US.
Our EIA data covers the following: (1) Electricity price, sales, revenue and number of customers by sector, and (2) State-specific electricity generation, consumption, and capacity data. The data is available in the following tables:
eia_energy_attributes
eia_energy_timeseries
June 2024
6/26/24: Enterprise - Added company event transcripts, incl. earnings transcripts.
Cybersyn now provides transcripts of hosted company events (e.g., earnings calls, updates/briefings, Annual General Meetings, and more) in JSON-format. The events cover thousands of public companies globally. Transcripts are usually made available within 3 hours of the event conclusion.
Data is available in the COMPANY_EVENT_TRANSCRIPT_ATTRIBUTES
table.
6/25/24: Enterprise - Added parsed 20-F items into HTML, text, and JSON format.
Expanded the SEC_CORPORATE_REPORT_ITEM_ATTRIBUTES
table to include parsed 20-Fs. 20-Fs are annual reports submitted by non-US public companies to the SEC, similar to a 10-K filing by US companies. Parsing 20-Fs into items optimizes SEC filings for LLM use cases.
6/17/24: US Insurance & Healthcare - Added Form 5500 Schedule H from the US Department of Labor.
Expanded the US Department of Labor data to include information found in Form 5500 Schedule H. The new tables, us_department_of_labor_form_5500_schedule_h_attributes
and us_department_of_labor_form_5500_schedule_h_timeseries
, provide detailed financial data of large pension and welfare benefit plans, including assets, liabilities, income, expenses, and other financial information of the plan.
6/4/24: US Insurance & Healthcare - Added Form 5500 Schedule C service provider data from the US Department of Labor.
Expanded the US Department of Labor data to include information found in Form 5500 Schedule C:
Part I, Item 1: Persons Receiving only Eligible Indirect Compensation.
us_department_of_labor_form_5500_schedule_c_part1_item1_index
Reports information on service providers who received only eligible indirect compensation and not direct compensation. It requires details on the type of services provided and the nature of the eligible indirect compensation.Part I, Item 2: Persons Receiving Direct or Indirect Compensation.
us_department_of_labor_form_5500_schedule_c_part1_item2_index
Provides details on service providers who received $5,000 or more in direct or indirect compensation. Includes identification information, service codes, and amounts received.Part I, Item 3: Indirect Compensation over $1,000.
us_department_of_labor_form_5500_schedule_c_part1_item3_index
Requires reporting for service providers receiving significant indirect compensation (excluding eligible compensation) and detailed information on each source of $1,000+. Details must be provided if a formula was used to determine the compensation.Part II: Service Providers failing to provide information.
us_department_of_labor_form_5500_schedule_c_part2_index
Lists service providers who failed or refused to provide the required information for Schedule C. It includes the service provider's identification and efforts made by the plan to obtain the information.Part III: Terminated Service Providers.
us_department_of_labor_form_5500_schedule_c_part3_index
Reports on service providers terminated during the plan year. Includes their identification, the nature of services provided, and the reasons for termination.
May 2024
5/28/24: Financial & Economic Essentials, Government Essentials - Added the US Economic Census survey conducted by the US Census Bureau.
The Economic Census is a survey conducted by the US Census Bureau every 5 years that provides detailed economic data on US businesses, such as number of establishments, types of goods and services provided, employment figures, payroll expenses, and operational costs, across different industries and geographic regions.
The data is available in the following tables:
US_ECONOMIC_CENSUS_ATTRIBUTES
US_ECONOMIC_CENSUS_TIMESERIES
5/21/24: Government Essentials, Crime Statistics - Added national and state crime estimates in the US from the FBI.
The FBI, or Federal Bureau of Investigation, is the principal federal law enforcement agency in the US, taked with investigating and enforcing federal laws. Data includes crime totals across the nation and its 50 states from 1979. The offense category is provided for reference.
The data is available in the following tables:
FBI_CRIME_ATTRIBUTES
FBI_CRIME_TIMESERIES
5/28/24: Government Essentials - Added state and Continuum of Care (CoC)-level homelessness estimates from the US Department of Housing & Urban Development.
The US Department of Housing and Urban Development (HUD) is a federal agency responsible for national policies on America's housing needs. They conduct the Annual Homeless Assessment Report (AHAR) and Point-in-Time (PIT) count and Housing Inventory Count (HIC) surveys in January of each year. These reports provide a detailed view on the number of homeless individuals, chronically homeless persons, homeless veterans, and homeless children and youth.
The data is available in the following tables:
HOUSING_URBAN_DEVELOPMENT_ATTRIBUTES
HOUSING_URBAN_DEVELOPMENT_TIMESERIES
5/16/24: Enterprise - Added additional SEC filings (S-1, S-4, SC-13D, SC-14D, 20-F, 40-F, 6-K, DEFM14A, PREM14A, SC-TO)
See here for table details.
5/15/24: Enterprise - Added parsed 10K/Qs and updated SEC XBRL methodology
10-K/Q sections are now parsed into plain text, HTML and JSON structure supporting LLM use cases. The data is available in the below table:
sec_corporate_report_item_attributes
Also updated SEC XBRL methodology to improve data accuracy.
April 2024
4/22/24: Weather & Environmental Essentials - Added global emissions data from Climate Watch
Climate Watch's historical emissions dataset provides a record of past greenhouse gas emissions across various sectors and gases, categorized by both geographic and sector-specific divisions. The data also presents a range of future emission scenarios, each predicated on distinct assumptions regarding economic growth, technological advancements, and policy implementations. The data is available in the following tables:
climate_watch_attributes
climate_watch_timeseries
4/17/24: Enterprise - Added additional SEC filings (Form 3 & Form 5)
Form 3 and Form 5 data is available in the following tables:
sec_insider_trading_reporting_owners_index
sec_insider_trading_securities_index
4/11/24: Enterprise - Added additional SEC filings (Form 4 & Form 144)
Form 4 and Form 144 data is available in the following tables:
sec_form144_securities_info_index
sec_form144_securities_sold_index
sec_form144_securities_to_be_sold_index
sec_form4_reporting_owners_index
sec_form4_securities_index
4/3/24: Enterprise - Added parsed segment revenues from the SEC
Parsed/cleaned quarterly & annual segment revenues from 10-Ks and 10-Qs are now available in SEC_METRICS_TIMESERIES
. This table translates the raw XBRL data from the SEC into a clean version of SEC revenue segments over time.
March 2024
3/7/24: Government Essentials - Added regional GDP, retails sales index, and safety data from the UK Office for National Statistics
Added data from the UK Office for Statistics in support of the Women In Data Safety Hackathon. Datasets added include regional GDP by quarter, retail sales index, prisoner statistics, and domestic abuse statistics.
Data is available in the following tables:
united_kingdom_timeseries
united_kingdom_attributes
3/6/24: Cybersyn AI Utilities - Added OpenAI & Anthropic external functions; added phone number parsing functions; renamed product
Added additional functions that allow users to call out to OpenAI & Anthropic and clean/process phone numbers directly in Snowflake SQL. Update product name to "Cybersyn AI Utilities"
February 2024
2/27/24: Consumer Spending - Added zip codes & customer retention
Expanded product granularity to include zip code level data. Added customer retention data.
Data is available in the following tables:
CONSUMER_SPENDING_MERCHANT_RETENTION_TIMESERIES
CONSUMER_SPENDING_INDUSTRY_TIMESERIES
CONSUMER_SPENDING_MERCHANT_TIMESERIES
2/26/24: US Housing & Real Estate Essentials - Added the Building Permit Survey and construction spending from the US Census Bureau
Added new data from the US Census Bureau. An indicator of future construction activity, the Building Permits Survey (BPS) provides the number and valuation of new housing units authorized by building permits at the national, state, and local levels. Construction Spending data quantifies actual expenditures on construction projects, categorized into residential, non-residential, and public sectors
The data is available in the following tables:
US_REAL_ESTATE_ATTRIBUTES
US_REAL_ESTATE_TIMESERIES
2/21/24: Financial & Economic Essentials - Added daily Nasdaq stock prices & trading volumes; added economic and development indicators from the OECD
Launched daily trading volumes, open/close and high/low prices of all US equities and ETFs executed on the Nasdaq. Cybersyn releases volume and pricing data, inclusive of pre-market and after hours activity, from the previous trading day at 6:00am ET. Data is sourced from Databento, a market data provider that connects directly with Nasdaq.
The data is available in the following table:
STOCK_PRICE_TIMESERIES
Added global economic and development indicators by country from the OECD. Added the OECD's Social Expenditure Database which tracks key indicators of social policy spending across countries. The
The data is available in the following tables:
OECD_ATTRIBUTES
OECD_TIMESERIES
2/15/24: GitHub Archive - Added language
field with the programming language of the repo
Added language
the github_events
table that will provide the programming language (e.g. Python, SQL) of the repository
2/15/24: Government Essentials - Added global economic & growth indicators from the World Bank
Expanded coverage of the World Bank to include global and regional growth projections from the Global Economic Prospects report, international debt statistics by country (e.g. average interest on new debt commitments, public sector debt, external assets in debt instruments), global economic indicators by country (e.g. inflation, exchange rates, GDP, retails sales), and world development indicators by country (e.g. access to electricity, bank account ownership, life expectancy).
The data is available in the following tables:
WORLD_BANK_ATTRIBUTES
WORLD_BANK_TIMESERIES
2/15/24: Financial & Economic Essentials - Added Survey of Consumer Expectations and Household Debt & Credit reports from the New York Fed
Added quarterly household debt & credit data and monthly consumer expectations for inflation, household finances, and the labor market.
The data is available in the following tables:
NEWYORKFED_CONSUMER_ATTRIBUTES
NEWYORKFED_CONSUMER_TIMESERIES
2/8/24: GitHub Archive - Added last_seen
field with repo name last seen date
Added a new field to the github_repos
table. last_seen
includes the last date a repo_name
was used. This field allows you to identify the latest repo_name
for a particular repo_id
.
The field was added as a workaround for a known issue with the source data: In the GitHub events dataset, a repository name should appear as a combination of the repository organization and name. In 2018, there are repository names that are incomplete. For example, rust-lang/ appeared without its full repository name, rust-lang/rust, for ~3 months. Additionally, in 2019, some repository names appeared with duplicate organizations (e.g. nolimits4web/nolimits4web/swiper). In both cases, GitHub implemented a go-forward fix.
2/7/24: Government Essentials - Removed legacy US Treasury revenue collections tables
Removed legacy tables: US_TREASURY_REVENUE_COLLECTIONS_ATTRIBUTES
and US_TREASURY_REVENUE_COLLECTIONS_TIMESERIES
.
Users still have access to revenue collections data in the US_TREASURY_ATTRIBUTES
and US_TREASURY_TIMESERIES
tables that were released on 12/14/23.
January 2024
1/31/24: Weather & Environmental Essentials - Removed legacy tables with data from Our World in Data and CDR.fyi
Removed the following legacy tables from Our World in Data and CDR.fyi:
carbon_intensity_attributes
carbon_intensity_timeseries
carbon_credit_purchase_attributes
carbon_credit_purchase_timeseries
Users still have access to the carbon intensity data in the OUR_WORLD_IN_DATA_ATTRIBUTES
and OUR_WORLD_IN_DATA_TIMESERIES
tables that were released on 12/15/23.
1/27/24: Government Essentials - Added global Environmental, Social, Governance (ESG) statistics from the World Bank
Added annual ESG statistics on global sustainability performance for 200+ countries and territories. Example country-level variables include CO2 emissions, natural resource depletion, energy use, government effectiveness, life expectancy, population with access to safely managed drinking water, and renewable electricity output.
Added governance indicators for 200+ countries and territories. The Worldwide Governance Indicators (WGI) report on six broad dimensions of governance: voice and accountability, political stability and absence of violence, government effectiveness, regulatory quality, rule of law, and control of corruption.
The data is available in the following tables:
WORLD_BANK_TIMESERIES
WORLD_BANK_ATTRIBUTES
1/26/24: LLM Training Essentials, Tech & Innovation Essentials - Added open catalog of scholarly entities and how they are connected from OpenAlex
Added OpenAlex's catalog on scholarly entities and how they are connected to each other. Entities are defined as scholarly works (e.g. journal articles, books, theses), authors, sources, affiliated organizations, topics covered, publishers, and funders. This data is derived from a wide range of sources, offering an extensive overview of academic research and its contributors.
The data is available in the following tables:
OPENALEX_AUTHORS_INDEX
OPENALEX_CONCEPTS_INDEX
OPENALEX_FUNDERS_INDEX
OPENALEX_INSTITUTIONS_INDEX
OPENALEX_PUBLISHERS_INDEX
OPENALEX_SOURCES_INDEX
OPENALEX_WORKS_INDEX
1/26/24: Web Traffic Foundation - Added additional AI-focused domains; improved pre-2022 model quality
Added additional domains and limited subdomains to measure traffic for popular AI companies. New domains include chat.openai.com, bard.google.com, claude.ai, and more.
Improved model quality to more accurately predict web traffic in older periods before 2022.
1/19/24: Government Essentials - Added weekly national unemployment insurance claims from the US Department of Labor
Expanded US Department of Labor coverage to include national weekly figures on initial and continuing unemployment claims filed during the previous week. Initial claims represent the number of individuals who have filed for unemployment benefits for the first time during a given week. Continuing claims indicate the number of individuals who are still receiving unemployment benefits in subsequent weeks.
The data is available in the following tables:
US_DEPARTMENT_OF_LABOR_UNEMPLOYMENT_INSURANCE_CLAIMS_ATTRIBUTES
US_DEPARTMENT_OF_LABOR_UNEMPLOYMENT_INSURANCE_CLAIMS_TIMESERIES
1/9/24: Weather & Environmental Essentials - Added global agrifood emissions from the Food and Agriculture Organization (FAO)
Added greenhouse gas (GHG) emissions related to agriculture, forestry, and other land use activities. Example variables include: 1) Total emissions generated from agrifood systems 2) Emissions associated with crop processes including rice cultivation and burning of crop residues 3) Emissions from livestock 4) Emissions from land use including forests, fires, and drained organic soils 5) Intensity of emissions by agricultural commodity.
The data is available in the following tables:
FOOD_AGRICULTURE_ORGANIZATION_ATTRIBUTES
FOOD_AGRICULTURE_ORGANIZATION_TIMESERIES
1/8/24: Government Essentials - Added US Consumer Credit, Industrial Production, Financial Accounts, and Commercial Paper from the Federal Reserve
Added the following timeseries from the Federal Reserve to the FEDERAL_RESERVE_TIMESERIES
and FEDERAL_RESERVE_ATTRIBUTES
tables:
Outstanding monthly revolving and non-revolving US consumer credit including select terms (e.g. interest rates on new car loans, personal loans, credit card plans) and major holders (e.g. credit unions, finance companies, depository institutions) - Fed Report Number G.19
Monthly industrial production and capacity utilization rates covering manufacturing, mining, and electric and gas utilities. The production index measures real output and the capacity index estimates sustainable potential output. - Fed Report Number G.17
Quarterly assets and liabilities by sector and financial instruments. Quarterly balance sheets, including net worth, for households, nonprofits and non-financial businesses. - Fed Report Number Z.1
Daily commercial paper issuance rates and volumes - Derived from data supplied by The Depository Trust & Clearing Corporation
Monthly owned and managed outstanding receivables at financial services companies. - Fed Report Number G.20
1/5/24: Government Essentials - Added weekly unemployment insurance (i.e. jobless) claims by state from the US Department of Labor
Added weekly unemployment insurance claims (i.e. jobless claims) which count the number of people filing to receive unemployment insurance benefits by state. The claims are broken into two categories - initial (number of people filing for the first time) and continued (number of people filing for ongoing unemployment benefits).
The data is available in the following tables:
US_DEPARTMENT_OF_LABOR_UNEMPLOYMENT_INSURANCE_CLAIMS_ATTRIBUTES
describes the unemployment insurance variables tracked by the US Department of LaborUS_DEPARTMENT_OF_LABOR_UNEMPLOYMENT_INSURANCE_CLAIMS_TIMESERIES
provides historical values for each US Department of Labor reported attribute
1/3/24: US Housing & Real Estate Essentials - Added monthly housing prices and weekly mortgage interest rates from Freddie Mac
Added Freddie Mac’s House Price Index (HPI) which provides a monthly measure of typical price inflation for single family homes within the US using data from loans purchased by Freddie Mac or Fannie Mae.
Added Freddie Mac's Primary Mortgage Market Survey (PMMS) which provides a weekly report on the national average of 30/15-year fixed rate mortgages as well as adjustable rate mortgages, tracking borrowing costs for US homebuyers and refinancers.
The data is available in the following tables:
FREDDIE_MAC_HOUSING_ATTRIBUTES
details the Freddie Mac House Price Index and mortgage interest rate variables covered.FREDDIE_MAC_HOUSING_TIMESERIES
provides historical values for each reported attribute
December 2023
12/20/23: Financial & Economic Essentials - Added international economic indicators from the International Monetary Fund (IMF)
Added the below IMF indicators:
Primary Commodity Price System - prices of commodities by country
Balance of Payments - countries’ economic transactions with the rest of the world e.g. trade balances, capital flows, and official reserves
Fiscal Monitor - detailed government finances e.g. revenues, expenditures, transactions in financial assets/liabilities
International Financial Statistics - macroeconomic and financial indicators e.g. exchange rates, international liquidity, interest rates, population
Coordinated Portfolio Investment Survey - cross-border holdings of equity and debt securities
Africa Regional Economic Outlook - economic indicators for Africa
Asia and Pacific Regional Economic Outlook - economic indicators for the Asia and Pacific region
The data is available in the following tables:
INTERNATIONAL_MONETARY_FUND_ATTRIBUTES
: Human-readable variable names describing the financial statistic being measuredINTERNATIONAL_MONETARY_FUND_TIMESERIES
: Historical values for the tracked financial variables
12/19/23: Canada Government Essentials - Added month-end assets and liabilities for the Bank of Canada from Statistics Canada
Expanded coverage of Statistics Canada, Canada's national statistical office, with month-end assets and liabilities for the Bank of Canada and Chartered Banks. Added month-end currency outside bank/chartered bank deposits for the Bank of Canada.
Select assets include: Government of Canada direct and guaranteed securities (e.g., Treasury Bills, Government of Canada bonds, mortgage bonds), provincial money market securities, provincial bonds, commercial paper, and corporate bonds.
Select liabilities include: notes in circulation, Canadian dollar deposits, securities sold under repurchase agreements, derivatives, foreign currency liabilities, and equity.
The data can be found in the following tables:
CANADA_STATCAN_ATTRIBUTES
includes a unique variable ID and a human-readable variable name. Each field represents a characteristic of the variable, including its unit of measurement, whether it's seasonally adjusted, frequency of measurement, its Statistics Canada report, and other related informationCANADA_STATCAN_TIMESERIES
provides historical values for each variable collected by Statistics Canada byGEO_ID
. Each row represents a distinct timeseries, date, and value by geographic entity.
12/19/23: Financial & Economic Essentials - Added Eurozone residential property prices & valuations, CPI indices from the European Central Bank
Added annual, quarterly, and monthly residential property prices and valuations by residential property type (e.g. houses, flats) across countries in the European Union (EU). Includes details on the source of the data, seasonal adjustment of the variable, and measurement type (e.g. average, maximum or minimum valuation, estimates of the over/undervaluation).
Added annual, quarterly, and monthly consumer prices indices by EU countries for a wide range of consumer goods (e.g. clothing, baby food, books, cigarettes).
Data is available in the following tables:
EUROPEAN_CENTRAL_BANK_TIMESERIES
: Historical records for the tracked real estate and financial variables by European country or country group.EUROPEAN_CENTRAL_BANK_ATTRIBUTES
: Human-readable variable names and details describing residential property and CPI statistics.
12/18/23: Government Essentials - Added industrial development statistics on the manufacturing sector from the United Nations Industrial Development Organization (UNIDO)
Sourced from the INDSTAT 2 (2016-2023), the data provides statistics on industrial performance and trends by country for the manufacturing sector. The following variables over time, by country are included:
Number of establishments
Number of employees
Wages and salaries
Output
Value added
Gross fixed capital formation
Number of female employees
Index of industrial production
Data is available in the following tables:
UNITED_NATIONS_INDUSTRIAL_DEVELOPMENT_ORGANIZATION_ATTRIBUTES
details each manufacturing indicator tracked by ISIC economic activity and INDSTAT yearly version.UNITED_NATIONS_INDUSTRIAL_DEVELOPMENT_ORGANIZATION_TIMESERIES provides historical values on an annual basis for each UNIDO variable tracked.
12/18/23: Weather & Environmental Essentials - Added global emissions by country and industry from the European Commission's Emissions Database for Global Atmospheric Research (EDGAR)
Added global emissions of carbon dioxide (CO2), methane (CH4), nitrous oxide (N2O), and various fluorinated gases (F-gases). F-gases are man-made greenhouse gases emitted from everyday products (e.g. air conditioners, refrigerators) and industrial applications. Both biogenic (emissions that come from natural sources like soils/plants) and fossil fuel sources are included.
Data is available in the following tables:
EUROPEAN_COMMISSION_EDGAR_ATTRIBUTES
details each emissions variable tracked by greenhouse gas type and Intergovernmental Panel on Climate Change (IPCC) industryEUROPEAN_COMMISSION_EDGAR_TIMESERIES
provides historical values on a monthly or annual basis for each tracked variable
12/15/23: Weather & Environmental Essentials - Added annual temperature change, emissions, and energy consumption by country/region, sector, and source from OWID
Added Our Wold in Data (OWID) metrics covering annual temperature change, emissions, and energy consumption by country/state, sector, and source to the OUR_WORLD_IN_DATA_ATTRIBUTES
and OUR_WORLD_IN_DATA_TIMESERIES
tables.
Variable added cover:
Global warming
Carbon Dioxide Emissions
Carbon Intensity
Emissions from Aviation
Energy Consumption
Fugitive Emissions
Greenhouse Gas Emissions
Methane Emissions
Nitrous Oxide Emissions
12/14/23: Tech & Innovation Essentials - Added web domain redirect relationships and domain characteristics
Added two new tables DOMAIN_REDIRECT_RELATIONSHIPS
and DOMAIN_CHARACTERISTICS
:
DOMAIN_REDIRECT_RELATIONSHIPS
includes the original web domain and the domain that it redirects to as well as the start and end dates that the redirect relationship was observed. At the time of this release, the table covers ~50K domains of the 300M+ domains Cybersyn tracksDOMAIN_CHARACTERISTICS
details which web domains are active/inactive based on the HTTP response status of the domain and whether the domain is the primary landing page or redirects to another domain. At the time of this release, the table covers 800K+ domains that Cybersyn tracks.
12/14/23: Government Essentials - Added HQM corporate bond yields, U.S. Treasury savings bonds, TreasuryDirect securities, and interest rates for U.S. Treasury securities
Added high quality market corporate bond yields, U.S. Treasury savings bonds (issues, redemptions, maturities), securities issued in TreasuryDirect, and average interest rates on U.S. Treasury securities from the U.S Treasury.
The
US_TREASURY_ATTRIBUTES
table includes details on each variableThe
US_TREASURY_TIMESERIES
table includes historical values for each variable
12/6/23: Government Essentials - Added global labor statistics from the International Labour Organization (ILO)
Added global labor data from the International Labour Organization (ILO). Variables include employment, unemployment, population, potential labor force, labor force participation, rate of labor underutilization, working poverty rate, and annual growth rate of output per worker. Data can be cut by sex, age, occupation, status in employment, economic activity (e.g. education, utilities, construction), rural/urban areas, and economic class.
The
INTERNATIONAL_LABOUR_ORGANIZATION_ATTRIBUTES
table includes descriptions of each labor statisticThe
INTERNATIONAL_LABOUR_ORGANIZATION_TIMESERIES
table includes historical values for each ILO variable tracked
12/5/23: Financial & Economic Essentials - Added interest rates posted by major chartered banks in Canada for selected products; Added secured overnight financing rates (SOFR) and US Treasury bill rates
Added interest rates for selected products posted by the six major chartered banks in Canada to the FINANCIAL_FRED_ATTRIBUTES
, FINANCIAL_FRED_TIMESERIES
, and FINANCIAL_FRED_VARIABLE_SERIES_ID_CROSSWALK
tables. The typical rate is calculated based on the statistical mode of the rates published by the six banks.
The posted rates cover:
Prime lending rate
Conventional mortgages
Guaranteed investment certificates
Personal daily interest savings
Non-checkable savings deposits
Added secured over night financing rates (SOFR) and US Treasury bill rates to the FINANCIAL_FRED_ATTRIBUTES
, FINANCIAL_FRED_TIMESERIES
, and FINANCIAL_FRED_VARIABLE_SERIES_ID_CROSSWALK
tables.
Series added (FRED ID):
Secured Overnight Financing Rate (SOFR)
30-Day Average Secured Overnight Financing Rate (SOFR30DAYAVG)
90-Day Average Secured Overnight Financing Rate (SOFR90DAYAVG)
180-Day Average Secured Overnight Financing Rate (SOFR180DAYAVG)
Secured Overnight Financing Rate Index (SOFRINDEX)
3-Month Treasury Bill Minus Federal Funds Rate (TB3SMFFM)
6-Month Treasury Bill Minus Federal Funds Rate (TB6SMFFM)
12/5/23: Canada Government Essentials - Added interest rates posted by major charted banks in Canada for selected products
Added interest rates for selected products posted by the six major chartered banks in Canada to the CANADA_STATCAN_ATTRIBUTES
and CANADA_STATCAN_TIMESERIES
tables. The typical rate is calculated based on the statistical mode of the rates published by the six banks.
The posted rates cover:
Prime lending rate
Conventional mortgages
Guaranteed investment certificates
Personal daily interest savings
Non-checkable savings deposits
12/4/23: Financial & Economic Essentials - Added Monthly State Retail Sales (MSRS) from the US Census Bureau
Added monthly state-level retail sales by NAICS codes to the FINANCIAL_FRED_ATTRIBUTES
, FINANCIAL_FRED_TIMESERIES
, and FINANCIAL_FRED_VARIABLE_SERIES_ID_CROSSWALK
tables. The US Census Bureau publishes year-over-year percent changes for each state using a composite model of Monthly Retail Trade Survey (MRTS) data, administrative data, and third-party data beginning in January, 2019.
Year-over-year percent change estimates for the following industries are now available at the state level:
Total Retail Sales Excluding Nonstore Retailers
Building Material and Garden Equipment and Supplies Dealers
Clothing and Clothing Accessories Stores
Electronics and Appliance Stores
Food and Beverage Stores
Furniture and Home Furnishings Stores
Gasoline Stations
Health and Personal Care Stores
Motor Vehicle and Parts Dealers
Sporting Good, Hobby, Musical Instrument and Book Stores
Miscellaneous Stores
November 2023
11/30/23: Government Essentials - Added global trade, tariff, and import relationship data from the World Trade Organization (WTO)
Added global trade flows, imposed tariffs, and trade interactions between countries from the World Trade Organization (WTO) . The data details export and import figures for goods and services across different countries and regions, tariff rates and structures that WTO member countries apply to imports from other nations, trade dependencies between countries, and the balance of trade between specific pairs of nations.
The
WORLD_TRADE_ORGANIZATION_ATTRIBUTES
table details the global trade, tariff, and import relationship statistics tracked by the World Trade Organization (WTO).The
WORLD_TRADE_ORGANIZATION_TIMESERIES
table provides timeseries values by date for the reported trade statistics by country, country group, and global region (as defined by the World Trade Organization).
11/30/23: Government Essentials - Added global health indicators from the World Health Organization (WHO)
Added 1,100+ health-related indicators for 194 members of the World Health Organization (WHO) and their associated country groups and global regions. Example metrics include alcohol consumption among adolescents and adults, tobacco control policies, abortion rates, accessibility of dementia care services, and adolescent fertility rates. Environmental health indicators including air pollution's impact on mortality rates and disability-adjusted life years (DALYs) as well as deaths attributable to the environment are also included.
The
WORLD_HEALTH_ORGANIZATION_ATTRIBUTES
table details the health statistics tracked by the World Health Organization (WHO).The
WORLD_HEALTH_ORGANIZATION_TIMESERIES
table provides timeseries values by date for the reported health indicators by country, country group, or global region (as defined by global organizations like UNICEF, the United Nations, the World Bank, and the World Health Organization).
11/30/23: Government Essentials, Financial & Economic Essentials - Added country groups and regions from the WHO, WTO, and UN to geography tables
Added additional country groups and geography types to the GEOGRAPHY_INDEX
from the World Health Organization (WHO), World Trade Organization (WTO), and United Nations (UN). The member countries of the added geographies are mapped in the GEOGRAPHY_RELATIONSHIPS
table. Select new geographic regions include:
BRICS members
World Trade Organization (WTO) members
Association of Southeast Asian Nations (ASEAN)
UNICEF regions
United Nations regions
United Nations Sustainable Development Goal (SDG) regions
World Bank regions
World Health Organization (WHO) regions and income regions
World Bank regions and income groups
11/28/23: Government Essentials - Added US agricultural export sales data from the USDA
Added US Export Sales Reporting (ESR) data on weekly export sales activity for 40+ US agricultural commodities sold abroad from the US Department of Agriculture's (USDA) Foreign Agricultural Service (FAS).
The
US_DEPARTMENT_OF_AGRICULTURE_COMMODITIES_ATTRIBUTES
now includes the export of commodities in addition to the existing production, supply, and distribution variables.The
US_DEPARTMENT_OF_AGRICULTURE_COMMODITIES_TIMESERIES
table provides the reported metrics for each commodity byGEO_ID
.
11/22/23: Financial & Economic Essentials - Added data from the Bank for International Settlements (BIS) on global banking conditions, property prices, and consumer price indicators
Added data from the Bank for International Settlements (BIS) on global banking conditions, property prices, and consumer price indicators
The Bank for International Settlements (BIS) is an international financial institution owned by 60+ central banks that represent countries accounting for ~95% of global GDP. As part of its mission is to support international monetary and financial cooperation, the BIS acts as a bank for central banks across the world.
Added the below BIS data to two new tables, BANK_FOR_INTERNATIONAL_SETTLEMENTS_ATTRIBUTES
, which describes the metrics tracked by BIS, and BANK_FOR_INTERNATIONAL_SETTLEMENTS_TIMESERIES
, which provides the values of metrics:
Residential property prices
Central bank policy rates
Consumer price indicators
Assets and liabilities of internationally active banks, including credit to the non-financial sector as well as the geographical and currency composition of a bank’s balance sheet
11/22/23: Consumer Spending Foundation - Added spend by geography for states and top 100 CBSAs. Added online/offline spend breakdown
Expanded the granularity of values in the CONSUMER_SPENDING_TIMESERIES
table to include breakdown by geography. Geographies are grouped both by purchaser primary location (i.e. where the spender lives) and purchase location (i.e. where the physical store is located). The data covers US states and the top 100 core-based statistical areas (“CBSAs”) as measured by population.
Purchaser geographies are represented in the
PURCHASER_PRIMARY_GEO_ID
andPURCHASER_PRIMARY_GEO_NAME
fieldsPurchase location geographies are represented in
PURCHASE_LOCATION_GEO_ID
andPURCHASE_LOCATION_GEO_NAME
fieldsPURCHASER_PRIMARY_GEO_ID
andPURCHASER_PRIMARY_GEO_ID
can be used to join the data with Cybersyn’s geography tables such as theGEOGRAPHY_INDEX
based on theGEO_ID
field
Added purchase channels covering online and offline spend.
Purchase channels are reflected in the
CHANNEL
field in theCONSUMER_SPENDING_ATTRIBUTES
table and included in theVARIABLE_NAME
values in theCONSUMER_SPENDING_TIMESERIES
table
Note that data for “purchase location” geographies only includes offline spend.
11/20/23: Consumer Spending Foundation - Added aggregations by NAICS codes and by 4-5-4 retail months
Expanded the CONSUMER_SPENDING_TIMESERIES
and the CONSUMER_SPENDING_ATTRIBUTES
tables to include aggregations by NAICS (North American Industry Classification System) and 4-5-4 retail calendar months.
Deprecated MCC_CODE
, MCC_CODE_DESCRIPTION
and MART_VARIABLE
fields in the the CONSUMER_SPENDING_ATTRIBUTES
table and replaced them with AGGREGATION_TYPE
and AGGREGATION_VALUE
.The newly added fields can be used to filter to a desired level of aggregation including to the NAICS, MARTS Segment, and MCC levels.
Additionally, the following fields were added in anticipation of upcoming dataset expansions. Note that these fields only contain total values for now. Future iterations of the product will include more granular cuts of data.
Added
COMPANY_NAME
,PURCHASER_PRIMARY_GEO_ID
,PURCHASER_PRIMARY_GEO_NAME
,PURCHASE_LOCATION_GEO_ID
, andPURCHASE_LOCATION_GEO_NAME
toCONSUMER_SPENDING_TIMESERIES
Added
CHANNEL
toCONSUMER_SPENDING_ATTRIBUTES
11/20/23: Financial & Economic Essentials - Added 10 timeseries from FRED covering mortgage rates and additional CPI measures
Added 10 additional timeseries from FRED to the financial_fred_timeseries
and financial_fred_attributes
tables:
MORTGAGE15US: 15-Year Fixed Rate Mortgage Average in the United States
MORTGAGE30US: 30-Year Fixed Rate Mortgage Average in the United States
CUSR0000SEFV: Consumer Price Index for All Urban Consumers: Food Away from Home in U.S. City Average, Seasonally Adjusted
CUUR0000SEFV: Consumer Price Index for All Urban Consumers: Food Away from Home in U.S. City Average, Not Seasonally Adjusted
CUSR0000SETG01: Consumer Price Index for All Urban Consumers: Airline Fares in U.S. City Average, Seasonally Adjusted
CUUR0000SETG01: Consumer Price Index for All Urban Consumers: Airline Fares in U.S. City Average, Not Seasonally Adjusted
CUSR0000SS62031: Consumer Price Index for All Urban Consumers: Admission to Movies, Theaters, and Concerts in U.S. City Average, Seasonally Adjusted
CUUR0000SS62031: Consumer Price Index for All Urban Consumers: Admission to Movies, Theaters, and Concerts in U.S. City Average, Not Seasonally Adjusted
CUSR0000SEHB: Consumer Price Index for All Urban Consumers: Lodging Away from Home in U.S. City Average, Seasonally Adjusted
CUUR0000SEHB: Consumer Price Index for All Urban Consumers: Lodging Away from Home in U.S. City Average, Not Seasonally Adjusted
11/20/23: Government Essentials - Expanded American Community Survey (ACS) history for 1,400+ population variables since 2005 for ~500K geographies
Added historical data from the American Community Survey (ACS) to the AMERICAN_COMMUNITY_SURVEY_ATTRIBUTES
and AMERICAN_COMMUNITY_SURVEY_TIMESERIES
tables for over 1,400 population variables dating back to 2005 at the following geographic entity levels: country, states, counties, cities, zip codes, core-based statistical areas (CBSAs), census tracts, and census block groups. Example population variable additions include age, race, income, employment status, immigration status, and household status.
Data is as up to date as the latest ACS publication.
11/16/23: Weather & Environmental Essentials - Added 6 tables covering disaster declaration and National Flood Insurance Program (NFIP) insurance data from FEMA
Added Federal Emergency Management Agency (FEMA) data on federally-declared disasters in the United States, disaster recovery public programs, and the National Flood Insurance Program (NFIP) insurance policies and claims. Six new tables were included in the release:
FEMA_DISASTER_DECLARATION_INDEX
- Details for each federally-declared disaster (e.g. name, type, date, public assistance funding amounts). Table is unique byDISASTER_ID
.FEMA_DISASTER_DECLARATION_AREAS_INDEX
- Geographic entities (e.g. counties, cities) impacted by each federally-declared disaster. A disaster declaration can include multiple geographic locations.FEMA_MISSION_ASSIGNMENT_INDEX
- Work orders issued by FEMA since 2012 to other government agencies, supporting emergency response activation across the US (e.g. requesting transportation support from the US Department of Transportation during a hurricane).FEMA_NATIONAL_FLOOD_INSURANCE_PROGRAM_CLAIM_INDEX
- Details on National Flood Insurance Program (NFIP) claims including features of the insured property, information on the flood event precipitating the claim, the cost of the damage, and subsequent insurance payout amounts.FEMA_NATIONAL_FLOOD_INSURANCE_PROGRAM_POLICY_INDEX
- Details on National Flood Insurance Program (NFIP) policies including features of the insured property, building and contents insurance coverage, deductibles, rates, and policy durations.FEMA_REGION_INDEX
- Human readable names and details for each FEMA Region
11/13/23: Canada Government Essentials - Added 6 timeseries from Statistics Canada covering Canadian debt securities and real estate development
Added 6 new timeseries to the CANADA_STATCAN_TIMESERIES
and CANADA_STATCAN_ATTRIBUTES
tables from 3 of Statistics Canada’s underlying sources:
Bank of Canada: Government of Canada debt gross new issues, retirements and net new issues, and par values
Annual Survey of Service Industries: Real estate agents, brokers, and appraisers summary statistics (e.g. salaries, wages, operating revenue)
Canada Mortgage and Housing Corporation:
Absorptions and unabsorbed inventory, newly completed dwellings, by type of dwelling unit in select census metropolitan areas
Absorptions and unabsorbed inventory, newly completed dwellings, by type of dwelling unit in census agglomerations of 50K+
Housing starts, under construction and completions in select census metropolitan areas
Housing starts, under construction and completions in census agglomerations of 50K+
11/7/23: Cybersyn Public Domain Pro - Added point-in-time history for 45 tables
Added point-in-time history tables for the following Cybersyn datasets.
American Community Survey
Canada Statcan
Carbon Credit Purchases
Carbon Intensity
Company Index and Characteristics
Data Commons
Domain Index
FHFA
Geography Characteristics
Home Mortgage Disclosures
IMEI
IRS
NOAA Weather Stations and Metrics
OpenFIGI Security Index
PermID Security Index
Points of Interest (POIs)
Urban Crime
US Addresses
USDA
US Treasury
USPS Address Changes
11/6/23: Government Essentials - Added trade-related datasets from the US Dept Commerce's International Trade Administration
Added 5 new tables containing trade-related datasets from the US Department of Commerce's International Trade Administration (ITA). The ITA provides data on trade events, trade leads, export business service providers, ITA export assistance centers, and export restricted entities.
INTERNATIONAL_TRADE_ADMINISTRATION_BUSINESS_SERVICE_PROVIDERS_INDEX
provides a directory of US and foreign-based businesses providing services that many small- and medium-sized exporters require (e.g., legal, tax, consulting, market research, export management, travel facilitation services).INTERNATIONAL_TRADE_ADMINISTRATION_TRADE_EVENTS_INDEX
provides trade events (including conferences, webinars, trade shows, workshops, and more) for US businesses interested in selling their products and services overseas.INTERNATIONAL_TRADE_ADMINISTRATION_EXPORT_ASSISTANCE_CENTERS_INDEX
provides a directory of all of the International Trade Administration's (ITA) domestic and international export assistance centers and the areas they service.INTERNATIONAL_TRADE_ADMINISTRATION_EXPORT_SCREENED_ENTITIES_INDEX
lists parties from the Consolidated Screening List (CSL) for which the United States government maintains restrictions on certain exports, reexports, or transfers of items. These may be sanctioned individuals from the Department of Commerce, Department of State, Treasury, or Office of Foreign Assets Control.INTERNATIONAL_TRADE_ADMINISTRATION_TRADE_LEADS_INDEX
provides contract opportunities for US businesses selling their products and services overseas.
11/6/23: Consumer Spending Foundation - Added nominal value projections (dollar-level figures)
Expanded values in the CONSUMER_SPENDING_TIMESERIES
table to include nominal estimates. New measures include estimates for revenue ($), transactions (#), and average order value ($). These newly added variables build on the existing year-over-year (%) estimates for those measures already in the table.
Nominal values can be used to measure market share or compare average transaction amounts across retailers. Because these estimates are based on a panel of consumer spending, they are not meant to be projections of the entirety of US spending but they are accurate as a measure of relative spend. For example, the sum of all Chipotle spend will not equal the company's actual spend but Chipotle's market share relative to McDonald's should be correct.
The newly-added variables in the timeseries table include matching variables in the CONSUMER_SPENDING_ATTRIBUTES
table with the new variables being MEASURE
values of Revenue
, Transactions
, and AOV
.
11/3/23: Consumer Spending Foundation, Government Essentials - Added CALENDAR_INDEX
table
Added the CALENDAR_INDEX
table which compiles common calendars into a single table. Each calendar type has a unique CALENDAR_ID
, which allows users to select which calendar type they want to use. Individual periods within each calendar type include period start and end dates.
The CALENDAR_INDEX
currently includes regular calendar periods (days, weeks, months, quarters, and years) and 4-5-4 retail calendar periods (4-5-4 retail months, quarters, and years).
The 4-5-4 retail calendar is a standardized accounting and reporting calendar system used by many retailers, where each fiscal year is divided into 13 weeks, aiming to align with seasonal variations and facilitate more accurate financial comparisons.
11/3/23: Government Essentials - Added global agricultural commodity production and distribution data from the USDA
Added two tables sourced from the US Department of Agriculture's (USDA) Foreign Agricultural Service (FAS) which provides production, supply, and distribution data on agricultural commodities for both the United States and other producing and consuming countries since 1960.
us_department_of_agriculture_commodities_attributes
describes the production, supply, and distribution metrics tracked for each commodity by the USDA.us_department_of_agriculture_commodities_timeseries
table provides the reported metrics for each commodity and country.
October 2023
10/25/23 - Financial & Economic Essentials - Added detailed branch-level data and Summary of Deposits (SOD) data from the FDIC
The Summary of Deposits (SOD) data from the FDIC is an annual survey, capturing branch-level deposits as of June 30 for all FDIC-insured institutions, including U.S. branches of foreign banks.
The fdic_branch_locations_index
table provides details on FDIC-insured bank branches, including branch-specific location information as well as institution-level regulatory and insurance data.
The fdic_summary_of_deposits_attributes
table describes the deposit types tracked by the Summary of Deposits (SOD) survey.
The fdic_summary_of_deposits_timeseries
table provides the results of the annual Summary of Deposits (SOD) survey going back to 1994 for banks’ branch-level deposits.
10/19/2023: Government Essentials - Added US Federal Government Revenue Collections from the US Treasury Fiscal Data
The US Treasury provides a daily overview of net federal revenue collections from income tax deposits, customs duties, fees for government services, fines, and loan repayments. These collections undergo electronic and/or non-electronic processing, involving various channels such as mail, internet, banking, and over-the-counter transactions, all of which are comprehensively incorporated within this dataset.
The us_treasury_revenue_collections_timeseries
table provides daily net collections amounts broken down by tax category and processing channel. The us_treasury_revenue_collections_attributes
table details each collection method reported by the US Treasury.
10/18/23: US Insurance & Healthcare Provider Foundation - Added Form 5500 Schedule A Part 1 insurance data from the US Department of Labor.
Expanded the US Department of Labor data to include information found on Form 5500 Schedule A Part 1. The new table, us_department_of_labor_form_5500_broker_index
, provides commission and fee amounts received by a broker for an insurance policy. Additional information about the brokers in the table includes their address, classification as an insurance broker, as well as notes pertaining to the compensation disbursed to them.
The us_department_of_labor_form_5500_broker_index
can be joined to insurance carrier and policy information to individual Form 5500 filings, using INSURANCE_POLICY_ID
and ACK_ID
.
10/16/23: Financial & Economic Essentials - Added exchange rates from the Bank for International Settlements (BIS)
Due to the discontinuation of certain currency conversion pairs by the European Central Bank (ECB), our primary source for daily FX rates, we have sourced a number of these pairs from an alternative source, the Bank for International Settlements (BIS), for ongoing history. The Bank for International Settlements will be used to get data for the following currency conversion pairs after September 27, 2023: USD:AED, USD:ARS, USD:CLP, USD:COP, USD:DZD, USD:MAD, USD:PEN, USD:QAR, USD:SAR, USD:TWD, USD:UAH.
Full history from the Bank for International Settlements was added for the following currency pairs: USD:ALL, USD:AUD, USD:BAM, USD:BHD, USD:BND, USD:EUR, USD:GBP, USD:IRR, USD:ISK, USD:KWD, USD:KZT, USD:LKR, USD:MKD, USD:MUR, USD:NPR, USD:NZD, USD:OMR, USD:RSD, USD:RUB, USD:TND, USD:TTD, USD:UYU, USD:VEF, USD:XDR.
10/11/23: Government Essentials - Added population variables to the American Community Survey tables.
Expanded the american_community_survey_attributes
and american_community_survey_timeseries
tables to include additional population variables related to income, age, and educational attainment.
New series include Household Income in the Past 12 Months (Inflation-Adjusted), Educational Attainment for the Population 25 Years and Over, and Age of Householder By Household Income in the Past 12 Months (Inflation-Adjusted). These series are available by multiple breakdowns (ex. income, age, gender, etc.).
10/11/23: Consumer Spending Foundation - Added company-level estimates; Added COMPANY_INDEX
table
Added year-over-year estimates for company-level spend, transaction, and AOV to the CONSUMER_SPENDING_TIMESERIES
. Company-level information is identified with the company_id field.
The company_id is a unique identifier assigned by Cybersyn to each company and is joinable to the COMPANY_INDEX
, which provides company names and other helpful identifiers such as CIK, LEI, PermID, and more. Note that when the company_id is null, then the row represents data for all companies.
10/09/23: US Housing & Real Estate Essentials - Added U.S. Census Regions & Divisions
Expanded the geography_index
table to include U.S. Census Regions and Divisions.
Expanded the geography_hierarchy
table to include the relationships between U.S. Census regions and U.S. Census divisions; U.S. Census regions and U.S. states; and U.S. Census divisions and U.S. states.
Census regions include the United States Northeast, Midwest, etc. and census divisions include the United States Middle Atlantic, East North Central, etc.
10/8/23: SEC Filings - Added PERMID_SECURITY_INDEX
table to expand security coverage
Added one new table to our dataset,
PERMID_SECURITY_INDEX
, which includes security identifiers from Refinitiv’s PermID database. These are persistent identifiers for active and inactive securities across global asset classes. The table includes over 15K PermIDs for various securities.
This data can be merged to the 13F filings data (
SEC_HOLDING_FILING_ATTRIBUTES
) and can be mapped back to companies using theCOMPANY_SECURITY_RELATIONSHIPS
table.
10/06/23: Tech & Innovation Essentials - Added in repository of web domains plus included GitHub Archive & US Patents tables
Cleaned and aggregated over 300M domains in a single source to track the list of websites globally into new domain_index
table.
Added GitHub Archive and US Patents Grants) tables to the product, rebranded product from "IMEI Type Allocation Codes" to "Tech & Innovation Essentials."
10/2/23: Government Essentials - Added population data from the American Community Survey
Expanded our population dataset to include annual estimates from the American Community Survey (ACS) for 2021 and 2022 at multiple geographic levels in the United States.
September 2023
9/18/23: Financial & Economic Essentials - Added 6 timeseries from FRED covering core CPI and industry and commodity-specific PPI data
Added 6 new timeseries from FRED to the financial_fred_timeseries
and financial_fred_attributes
tables:
PPIFIS: Producer Price Index by Commodity: Final Demand
PCU4841214841212: Producer Price Index by Industry: General Freight Trucking, Long-Distance Truckload
PCU4841224841221: Producer Price Index by Industry: General Freight Trucking, Long-Distance Less Than Truckload
PCU3313133131: Producer Price Index by Industry: Alumina and Aluminum Production and Processing
WPU101707: Producer Price Index by Commodity: Metals & Metal Products: Cold Rolled Steel Sheet and Strip
CPILFESL: (CORE CPI) Consumer Price Index for All Urban Consumers: All Items Less Food & Energy in U.S. City Average
9/17/23: SEC Filings, LLM Training Essentials - Added COMPANY_INDEX
, COMPANY_CHARACTERISTICS
, and COMPANY_SECURITY_RELATIONSHIP
tables; added PermIds
Added three new tables:
The
COMPANY_INDEX
table aggregates commonly used company identifiers (i.e. CIKs, EINs, and LEIs) into a single a singlecompany_id
, which can be used across Cybersyn’s datasets as a unique identifier for corporate entities.The
COMPANY_SECURITY_RELATIONSHIP
table maps OpenFIGI and PermID securities (i.e. securities with multiple "levels" such as OpenFIGI FIGI ID and OpenFIGI Share Class ID) to the Company.The
COMPANY_CHARACTERISTICS
table includes categorical characteristics of a Company (e.g. industry, address, previous names). A characteristic may be temporal with start and end dates indicating the range for which the data is valid.
Added PermId securities published by Refinitiv.
9/15/23: Weather & Environmental Essentials - Added daily weather data from 80K+ weather stations across 180 countries; updated product name
New tables added:
NOAA_WEATHER_STATION_INDEX
contains metadata on the weather stations from the Global Historical Climatology Network daily (GHCNd) database, including mappings to Cybersyn’sGEO_ID
at the country-, state- and zip-level (where applicable).NOAA_WEATHER_METRICS_ATTRIBUTES
includes the daily weather variables tracked globally and their measurement details.NOAA_WEATHER_METRICS_TIMESERIES
provides the details of the daily global weather variables recorded at each weather station.
Updated product name from "Emissions & Environment Essentials" to "Weather & Environmental Essentials"
9/15/23: US Insurance & Healthcare Provider Foundation - Added healthcare provider emails; combined TELEPHONE
and TELEPHONE_EXTENSION
into one field; changed TELEPHONE
to array to accommodate numerous values
Added
EMAIL
field to thenppes_provider_addresses
table with provider emails per address.
Combined
TELEPHONE
andTELEPHONE_EXTENSION
from thenppes_provider_addresses
table into a single field,TELEPHONE
, and removed theTELEPHONE_EXTENSION
field.
Aggregated all values for
TELEPHONE
,FAX
, andEMAIL
that are associated with the same NPI and address into arrays in one row. Rows in thenppes_provider_addresses
table are now uniquely defined by NPI and full address.
9/15/23: US Points of Interest & Addresses, Government Essentials - Added FIPS 10-4 country codes and state abbreviations
Expanded the geography_characteristics
table to include mappings of FIPS 10-4 country codes and U.S. state abbreviations to country and state-level GEO_ID
s, respectively.
9/10/23: US Insurance & Healthcare Provider Foundation - Added table to relate taxonomy classifications to practitioners’ license numbers
Added the NPPES_PROVIDER_TAXONOMY_AND_LICENSE_NUMBERS
table that relates taxonomy classifications to practitioners’ license numbers. This table provides users the ability to filter for license numbers based on practitioners’ primary taxonomy.
9/7/23: SEC Filings - Added 13F filings to include data on quarterly investment fund managers’ holdings; added a securities index table based on OpenFIGI data
Expanded our dataset to include individual filings from 13F fund holding reports, which disclose the equity holdings of institutional investment managers. Added three new tables, SEC_HOLDING_FILING_INDEX
, SEC_HOLDING_FILING_ATTRIBUTES
and OPENFIGI_SECURITY_INDEX
.
SEC_HOLDING_FILING_INDEX
table contains metadata from individual 13F filings including filing date and filing organization. SEC_HOLDING_FILING_ATTRIBUTES
table includes securities' names, market value, number of shares held, and OpenFIGI IDs, which facilitate easier mapping and analysis to outside data sources. Table OPENFIGI_SECURITY_INDEX
contains an index of over 2M securities listed on OpenFIGI and can be joined with table SEC_HOLDING_FILING_ATTRIBUTES
using TOP_LEVEL_OPENFIGI_ID
- the unique identifier for each security in the two tables.
9/6/23: US Insurance & Healthcare Provider Foundation - Added contact information and broker payments to Form 5500 data
Added new fields to
us_department_of_labor_form_5500_filing_index
with Form 5500 contact information including name and phone numbers:ADMIN_SIGNED_NAME
,SPONSOR_SIGNED_NAME
,DIRECT_FILING_ENTITY_SIGNED_NAME
,ADMIN_PHONE_NUM
andSPONSOR_DIRECT_FILING_ENTITY_PHONE_NUM
.Added new fields to
us_department_of_labor_form_5500_policy_index
with payments to agents and brokers:COMMISSIONS_PAID_TO_BROKER
andFEES_PAID_TO_BROKER
.Removed ~10k rows from
us_department_of_labor_form_5500_policy_index
that had NULL values for each field as they filed no data around insurance policies from Form 5500 Schedule A.
August 2023
8/31/23: US Insurance & Healthcare Provider Foundation - Added deactivated NPI numbers to NPPES data
Added a new table, nppes_npi_index
, that contains information on when NPIs were first issued, deactivated, or reactivated - dating back to 2005. This table also includes a boolean flag to indicate if an NPI is currently active.
Note that while all NPIs appear in the nppes_npi_index
table, only actively registered NPIs as well as NPIs deactivated after August 1, 2023 appear in the nppes_practitioner_attributes
and nppes_organization_attributes
tables. This means the dataset does not include attribute-level data (names, type of providers, specialization) on providers with NPIs deactivated before August 2023.
8/27/23: US Points of Interest & Addresses, US Housing & Real Estate Essentials - Added points of interest data from Overture Maps Foundation
Added the point_of_interest_index
table, which includes names and categories for points of interest in the US. Each POI is uniquely identified by a POI_ID
.
To tie POIs to addresses, we added a new column, ADDRESS_ID
, to the us_addresses
table to uniquely identify each individual address. This column allows users to join addresses to POIs using the new point_of_interest_addresses_relationships
table with POI_ID
and ADDRESS_ID
as the join keys for the point_of_interest_index
table and us_addresses
table, respectively.
8/27/23: US Points of Interest & Addresses, US Housing & Real Estate Essentials - Added 7.2M new addresses, removed 49.8M duplicate addresses, deleted 1.2M addresses with Null
STREET
value
Added 7.2M new addresses covering points of interest from Overture Maps Foundation to the us_addresses
table.
Removed 49.8M addresses that were duplicative aside from minor variability in coordinates. Removed 1.2M rows from rows from the us_addresses
table where the STREET
value contained a string with value Null
.
8/27/23: US Points of Interest & Addresses, US Housing & Real Estate Essentials - Added country-level geospatial boundaries to the geography_characteristics
table
Added country-level geospatial boundaries to the geography_characteristics
table with data from Overture Maps Foundation.
8/13/23: SEC Filings - Added 8-K filings and exhibits for 10-Qs and 10-Ks
Expanded our coverage of SEC documents to include the full text of 8-K filings and associated exhibits. 8-K filings include company press releases, earnings releases, and other major corporate events.
Added the full text of exhibits for 10-K and 10-Q filings. Exhibit types include lists of subsidiaries, merger agreements, and material changes in financial conditions. Exhibits are denoted in the variable
and variable_name
columns (e.g. 10-K EX-21 Filing Text
).
Added the sec_document_id
column. This field is a combination of the ADSH (accession number) and the document type (e.g. 10-K). This serves as a unique identifier for each individual component that makes up a filing in cases when one or more exhibits are included in a filing.
8/11/23: Government Essentials, US Housing & Real Estate Essentials, US Addresses & Geographic Areas - Added geospatial boundaries data for territories in the US and Canada
The Census Bureau and Statistics Canada publish geospatial boundaries data for their territories at multiple geographic levels. We added a table geography_characteristics
with the boundary coordinates from the most recent releases in both WKT and GeoJSON formats. The table is joinable at different levels using Cybersyn's GEO_ID
. This GEO_ID
is compatible with all Cybersyn listings that have geographic identifiers. Currently, the geographic levels covered include:
State (US and Canada)
County (US only)
Census Tract (US only)
ZIP Code (US only)
Dissemination Area and Aggregate Dissemination Area (Canada only)
Census Division and Census Subdivision (Canada only)
Census Agglomeration and Census Agglomeration Part (Canada only)
Census Metropolitan Division and Census Metropolitan Division Part (Canada only)
8/10/23: Financial & Economic Essentials - Added crosswalk to FRED series IDs & 107 new series from GDP, Employment Situation, Housing Starts, and Residential Construction reports
Added a new table, financial_fred_variable_series_id_crosswalk
, that enables a join between Cybersyn’s variable and FRED’s unique series ID
Added new series from the following four reports:
Gross Domestic Product (data produced by the US Bureau of Economic Analysis)
Employment Situation (US Bureau of Labor Statistics)
New Residential Construction (US Department of Housing and Urban Development)
Quarterly Starts and Completions by Purpose and Design (US Department of Housing and Urban Development)
8/7/23: Government Essentials - Added text-based US government contracts data from SAM.gov
The US government publishes contract opportunities and proposals to do business with the federal government via the System for Award Management (sam.gov) for contracts and awards with a value of at least $25,000. The data goes back to January 2002 and includes metadata providing descriptions of government contracts and the corresponding awards granted for those contracts.
July 2023
7/31/23: Financial & Economic Essentials - Added release_name
and release_source
for better discoverability
release_name
: The collection, group of data, or report from which a time series originates. This column can be used as a filter to find related series.release_source
: The organization (e.g. FDIC, Federal Reserve) that FRED collects the data from.
Two columns were added to financial_fred_attributes
to provide better categorization and discoverability:
7/28/23: Canada Government Essentials - Removed the age_group
measure
and labour_force_statistic
columns
Legacy columns from the July 16, 2023 release were removed.
age_group
measure
labour_force_statistic
7/25/23: Financial & Economic Essentials - Added 124 time series from FRED covering a variety of economic data
124 additional time series added from FRED to the financial_fred_timeseries
and financial_fred_attributes
.
7/16/23: Canada Government Essentials - Added prices, output, and household income; schema changes
New datasets:
Household income and finances: Household income and consumption, household credit liabilities, and household savings rates
Prices and output: Core consumer price index, gross domestic product by industry group, and new housing price indices
Updated datasets:
StatCan archived a number of retail trade series and published new series. Old series are marked “Archived…” in the
report
field. The latest series have been included to replace these.
Schema changes:
The following columns in the
canada_statcan_attributes
view were updated. The deprecated columns will be removed on 7/28/2023.age_group
will be folded intodemographic_group
, which applies more broadly to age ranges, income groups, and household makeups (e.g., 35 to 44 years, lowest income quintile, elderly persons not in an economic family)measure
will be renamedreport
. Thereport
column displays the StatCan dataset from which the data originates (e.g. Consumer Price Index, New Housing Price Index)labour_force_statistic
will be renamedstatistic
. The new statistic column will provide the label for the specific economic metric that is being reported (e.g. Median After Tax Income, Number of families)
7/3/23: US Insurance & Healthcare Provider Foundation - Added Form 5500 insurer information
Added information from US Department of Labor Form 5500 filings about company benefit providers and the insurance/benefit plans they offer. New tables include us_department_of_labor_form_5500_filing_index
and us_department_of_labor_form_5500_policy_index
.
June 2023
6/14/23: SEC Filings - Added full text 10-Qs and 10-Ks
Added the full text of 10-K/Q filings. These are contained in the sec_report_text_attributes
table.
6/13/23: Financial & Economic Essentials - Added Bureau of Labor Statistics datasets
Added Consumer Price Index (CPI), Average Prices (AP), Job Openings and Labor Turnover Survey (JOLTS), State and Metro Area Employment , Hours, & Earnings (SAE), Local Area Unemployment Statistics (LAUS) from the Bureau of Labor Statistics.
6/1/23: Government Essentials - Added 3,000 new US zip codes from USPS and US Census
Using USPS address change data, we added 3,000 zip codes (mostly PO Box) to the
dc_geo_index
.
Using both the USPS address change and US Census Bureau data, we increased the coverage in
geography_relationships
table with 6,500 new zip and city relationships. We now map 86% of zip codes to a city.
May 2023
5/26/23: Financial & Economic Essentials - Added 5 global central bank policy rates
Central bank policy rates from Brazil, Canada, England, Mexico, and Japan added to fred_timeseries
and fred_attributes
5/19/23: Government Essentials - Updated product name from Cybersyn Data Commons to Cybersyn Government Essentials
Rebranded Cybersyn Data Commons as Cybersyn Government Essentials. We updated the naming conventions for schemas, tables, and column names to make them consistent across all of Cybersyn’s existing and future data products.
Cybersyn will continue to support and update your older version of the Data Commons tables.
5/19/23: US Addresses & Geographic Areas, GitHub Events - Added source data from National Address Database (NAD)
Added the National Address Database (NAD) as a source to increase our US address coverage:
Increased the coverage from 140 million addresses to more than 188 million.
There is now at least one address in more than 85% of zip codes, up from 74% previously.
Increased the portion of cities that are mapped to distinct IDs joinable to our other data sets from 24% to over 77%
Last updated