26 lines
1.1 KiB
Markdown
26 lines
1.1 KiB
Markdown
# PatentSlurp
|
|
Patent slurper for Dr Lars Hass, LUMS
|
|
|
|
# TODO
|
|
|
|
1) Add stripper for redundant xml tags
|
|
2) Harvest below data from Google dumps 2001-2015:
|
|
-------------------------------------------------------------------------------
|
|
storage display value
|
|
variable name type format label variable label
|
|
-------------------------------------------------------------------------------
|
|
sta str2 %2s assg/state
|
|
cnt str3 %3s assg/country
|
|
assgnum byte %8.0g assg/assignee seq. number (imc)
|
|
cty str72 %72s assg/city
|
|
pdpass long %12.0g Unique assignee number
|
|
ptype str1 %9s patent type
|
|
patnum long %12.0g patent number
|
|
-------------------------------------------------------------------------------
|
|
3) Compare data with NBER data (http://eml.berkeley.edu/~bhhall/NBER06.html)
|
|
4) ...
|
|
|
|
# Useful Tools
|
|
|
|
http://codebeautify.org/xmlviewer
|
|
http://www.regexr.com/
|