Overview
Cybersyn Foundations is a suite of public domain datasets made available all in one location on the Snowflake Marketplace. The product offers a foundational layer of data for any analysis.
Core Product Components
-
Unified Schema: We create a single, unified schema across data products, which strikes a balance between flexibility to accommodate arbitrarily shaped data along with consistency in core tables. Any user who has used a Cybersyn Data Product before should feel oriented in a new dataset. Cybersyn Data Products are built around two concepts: entities and timeseries.
-
Dataset Joinability: We create standardized index tables, joinable to all Cybersyn tables (where applicable), including geography and company indices.
-
Data Release: Cybersyn consistently monitors each data source and updates the data product on Snowflake Marketplace automatically.
-
Point-in-Time History: History tables create an auditable record of when and how data changed over time. Whether data changed because the underlying source restated historical values, Cybersyn backfilled older history, or a data vendor corrected a previously-undetected error in their methodology, accessing a full history of what data was available when allows for point-in-time backtesting and creates accountability and transparency that helps clients understand changes in underlying data.
Key Information
EAV Model: All Cybersyn products follow the EAV (entity, attributes, value) model with a unified schema. Entities are tangible objects (e.g. geography, company) that Cybersyn provides data on. All timeseries' dates and values that refer to the entity are included in a timeseries table. Descriptors of the timeseries are included in an attributes table. Data is joinable across all Cybersyn products that have a GEO_ID
. Refer to Cybersyn Concepts for more details.
Restatements: Timeseries
tables, by default, contain the latest published version of all releases. Point-in-time tables maintain pre-restatement values.
As with all Public Domain datasets, Cybersyn aims to release data on Snowflake Marketplace as soon as the underlying source releases new data. We check periodically for changes to the underlying source and, upon detecting a change, propagate the data to Snowflake Marketplace immediately. See our release process for more details
For information on key dataset-specific attributes, visit our Source-based documentation.
Primary Use Cases
The breadth of our data allows for wide-ranging applications. A few highlights include:
Macroeconomic Indicators & Trends
Cybersyn offers broad coverage of US & international financial and economic indicators, giving a real-time view into the world's economy.
Market and Competitor Analysis
Through key data sources like SEC filings and earnings transcripts, built on top of a standardized company index, Cybersyn data empowers market and competitor analysis research through the lens of publicly-available filings.
Errata & Future Improvements
We note known issues and planned future improvements. If you would like to submit a bug report or feature request, email us at snowflake-public-data@snowflake.com.
Disclaimers
Details on the data sources for each table are available in our Source-based documentation pages as well as the Data Catalog. Links to provider license, terms and disclaimers are provided where appropriate.
Cybersyn is not endorsed by or affiliated with any of these providers. Contact snowflake-public-data@snowflake.com for questions.