SEC Filings

Financial statements, press releases and fiscal calendars for US public companies; fund manager investment holdings

Overview

A subset of SEC Filings submitted by corporations, funds, and individuals. The dataset includes raw (plain) text, parsed and unparsed XBRL, and HTML. SEC Filings are available in both the Free and Enterprise products. Coverage varies between tiers, with Enterprise having additional forms and features, such as 10-K/Qs parsed into sections for easier input into LLMs. Cybersyn's company reference spine of ~100K public and private companies and Cybersyn's OpenFIGI and PermID security master are also included.

Data Sources, Attributes, Sample Queries

A detailed description of the data is available by source. Source pages include key attributes (e.g. geographic coverage, time granularity, history, entity level), release frequency, notes & methodologies, sample queries, and disclaimers applicable to each data source.

All Cybersyn products follow the EAV (entity, attributes, value) model with a unified schema. Entities are tangible objects (e.g. geography, company) that Cybersyn provides data on. All timeseries' dates and values that refer to the entity are included in a timeseries table. Descriptors of the timeseries are included in an attributes table. Data is joinable across all Cybersyn products that have a GEO_ID. Refer to Cybersyn Concepts for more details.

As with all Public Domain datasets, Cybersyn aims to release data on Snowflake Marketplace as soon as the underlying source releases new data. We check periodically for changes to the underlying source and, upon detecting a change, propagate the data to Snowflake Marketplace immediately. See our release process for more details.

Data Dictionary

πŸ“–pageData Dictionary

Releases & Changelog

New releases are included in our Enterprise product. Click here for more details.

10/8/23 - Added PERMID_SECURITY_INDEX table to expand security coverage
  • Added one new table to our dataset, PERMID_SECURITY_INDEX, which includes security identifiers from Refinitiv’s PermID database. These are persistent identifiers for active and inactive securities across global asset classes. The table includes over 15K PermIDs for various securities.

  • This data can be merged to the 13F filings data (SEC_HOLDING_FILING_ATTRIBUTES) and can be mapped back to companies using the COMPANY_SECURITY_RELATIONSHIPS table.

9/17/23 - Added COMPANY_INDEX, COMPANY_CHARACTERISTICS, and COMPANY_SECURITY_RELATIONSHIPS tables; added PermIds

Added three new tables:

  • The COMPANY_INDEX table aggregates commonly used company identifiers (i.e. CIKs, EINs, and LEIs) into a single a single company_id, which can be used across Cybersyn’s datasets as a unique identifier for corporate entities.

  • The COMPANY_SECURITY_RELATIONSHIPS table maps OpenFIGI and PermID securities (i.e. securities with multiple "levels" such as OpenFIGI FIGI ID and OpenFIGI Share Class ID) to the Company.

  • TheCOMPANY_CHARACTERISTICS table includes categorical characteristics of a Company (e.g. industry, address, previous names). A characteristic may be temporal with start and end dates indicating the range for which the data is valid.

Added PermId securities published by Refinitiv.

9/7/23 - Added 13F filings to include data on quarterly investment fund managers’ holdings; added a securities index table based on OpenFIGI data

Expanded our dataset to include individual filings from 13F fund holding reports, which disclose the equity holdings of institutional investment managers. Added three new tables, SEC_HOLDING_FILING_INDEX, SEC_HOLDING_FILING_ATTRIBUTES and OPENFIGI_SECURITY_INDEX.

SEC_HOLDING_FILING_INDEX table contains metadata from individual 13F filings including filing date and filing organization. SEC_HOLDING_FILING_ATTRIBUTES table includes securities' names, market value, number of shares held, and OpenFIGI IDs, which facilitate easier mapping and analysis to outside data sources. Table OPENFIGI_SECURITY_INDEX contains an index of over 2M securities listed on OpenFIGI and can be joined with table SEC_HOLDING_FILING_ATTRIBUTES using TOP_LEVEL_OPENFIGI_ID - the unique identifier for each security in the two tables.

8/13/23 - Added 8-K filings and exhibits for 10-Qs and 10-Ks

Expanded our coverage of SEC documents to include the full text of 8-K filings and associated exhibits. 8-K filings include company press releases, earnings releases, and other major corporate events.

Added the full text of exhibits for 10-K and 10-Q filings. Exhibit types include lists of subsidiaries, merger agreements, and material changes in financial conditions. Exhibits are denoted in the variable and variable_name columns (e.g. 10-K EX-21 Filing Text).

Added the sec_document_id column. This field is a combination of the ADSH (accession number) and the document type (e.g. 10-K). This serves as a unique identifier for each individual component that makes up a filing in cases when one or more exhibits are included in a filing.

6/14/23 Added full text 10-Qs and 10-Ks

Added the full text of 10-K/Q filings. These are contained in the sec_report_text_attributes table.

Disclaimers

The data in this dataset is sourced on the individual source pages. Links to provider terms, license, and disclaimers are provided where appropriate.

Cybersyn is not endorsed by or affiliated with any of these providers. Contact support@cybersyn.com for questions.

Last updated

Copyright Β© 2024 Cybersyn