eric
92f333fc48
sort sitemaps
2019-01-18 12:02:45 -05:00
eric
ee03d2d434
add hosts
2018-07-12 12:56:09 -04:00
eric
8dd1fb1822
remove doab author loader
...
now uses oai functionality
2018-04-16 13:44:10 -04:00
eric
e03fa239b4
revamp doab loading
...
- doab loading now done primarily by oai, no processing of csv.
- added pyoai and updated lxml
- doab ids or urls in ebook submission now handled by oai scrape
- doab_load_books removed
- doab_utils moved from Gluejar/DOAB
- licenses now recognizes OpenEdition
- new ebook type "online" will implement in UI after mobile launch;
ebooks now creaded for html contenttype
2018-04-07 17:11:36 -04:00
eric
b88d678058
add loading from sitemap list
2018-02-20 13:35:00 -05:00
Raymond Yee
ae6e852c67
update doab.json
2017-01-12 14:31:47 -08:00
eric
9dfe1fb927
add new json file
...
from
https://raw.githubusercontent.com/Gluejar/regluit/7a8e34d9bb7b04476831df
16eb7ae701f9c00612/bookdata/doab.json?token=AA4jMVwm5w-4_QDYCt-gg3MqBNWm
2g9aks5YSdbLwA%3D%3D
2016-12-02 15:48:27 -05:00
eric
1c52c42e60
doab author parsing and loader command
2016-11-29 15:37:02 -05:00
eric
83d9c7574f
new data
2016-10-28 14:39:48 -04:00
Raymond Yee
39209c786e
use doab.json from 569a7f3557
2016-10-26 11:09:40 -07:00
Raymond Yee
5e82fa81f0
committing doab.json from 441c203baa/doab.json
2016-10-24 10:00:58 -07:00
Raymond Yee
967bd2dcae
code can now load description, subjects and covers for the pdf files
2014-07-24 16:29:28 -07:00
Raymond Yee
0fad9dd102
code that is basically working in IPython notebook for loading work.description, edition.publication_date, work.subjects, and edition.publisher_name
2014-07-23 16:09:18 -07:00
Raymond Yee
2fac485f08
the doab.json resulting from loading all the ISBNs, as well as the language of the works, produced by 57e54e0d22
2014-07-03 09:55:38 -07:00
Raymond Yee
265420dd74
some code to load DOAB records...no code here yet for how I processed the DOAB records into json format yet.
2014-06-04 15:23:47 -07:00