More package author’s introduction, please access this link
Instead of loading everything at once into your RAM, you divide your data into chunks.
To quote author of the disk.frame package: “we go from”R can only deal with data that fits in RAM"
Connect from R to Wharton Research Data Services
to set up connection from R to WRDS (here)
library(RPostgres)
library(tidyverse)
# I've set up wrds connection before hand. # Please use your username and password here.
Comprehensive patent data can be found here
United States
NBER patent data or link
Search link for individual patent: link
Patent API
USPTO - United States patent and Trademark Office
Patent ranking by orgs
Bulk Data Storage System: repository for raw public bulk data
For Researcher
Patent Assignment Dataset details information of patent assignment since 1970 with schema and description and code
Pre-Grant Publications Data Download Tables with example code note that organizaiton here is different from Compustat and CRSP, hard to match.
Information can be found in CRSP/COMPUSTAT MERGED DATABASE GUIDE
Change Identifiers:
Ticker: can be reassign to another company - abbreviation used to uniquely identify publicly-traded shares of a stock
CUSIP: A company can have multiple CUSIPS due to structural changes.