SEC EDGAR 13F Holdings Pipeline
Pure Python (stdlib only) pipeline for fetching, parsing, and storing SEC EDGAR 13F institutional holdings filings into a SQLite database. No third-party dependencies.
Overview
A zero-dependency Python pipeline for institutional holdings data from SEC EDGAR. Fetches 13F filings, parses XML, and stores structured holdings data in SQLite — using only Python’s standard library.
Features
- Fetches 13F-HR filings from SEC EDGAR full-text search API
- Parses XML holdings data (issuer, CUSIP, value, shares, investment discretion)
- Stores to SQLite with proper schema and indexes
- Rate-limit compliant with SEC EDGAR fair access policy
- No pip installs required — pure
urllib,xml.etree,sqlite3
Use Cases
- Quantitative research on institutional positioning
- Building signal features from smart money flows
- Portfolio overlap analysis
- Historical holdings trend analysis
Tech Stack
Python stdlib · SQLite · SEC EDGAR API · XML parsing