Commit Graph

7194 Commits (a6b02d387ecc14e467494589bedd49d7a2f54779)

Author SHA1 Message Date
Eric Hellman fc3037ec4a
Merge pull request #893 from Gluejar/maintenance2020
define default stapled
2020-07-30 19:58:45 -04:00
eric cc43bb0c51 define default stapled 2020-07-30 19:45:10 -04:00
Eric Hellman 8a87b00c08
Merge pull request #892 from Gluejar/maintenance2020
athabasca and usu
2020-07-30 18:06:09 -04:00
eric 2f9f6be0c6 add usu harvest 2020-07-30 17:52:08 -04:00
eric affecd28b0 add Athabasca 2020-07-30 17:34:45 -04:00
Eric Hellman d1f00cc546
Merge pull request #891 from Gluejar/maintenance2020
bloomsbury harvest
2020-07-30 17:00:05 -04:00
eric f793914279 add bloomsbury harvest 2020-07-30 16:48:44 -04:00
eric 9d98b2a0cb change user_agent for single dl, too 2020-07-30 16:47:17 -04:00
eric ec5996c0be add flage to strip intersitial header pages 2020-07-30 16:46:27 -04:00
Eric Hellman 5fd4c1d790
Merge pull request #890 from Gluejar/maintenance2020
add pulp harvest
2020-07-30 13:36:57 -04:00
eric c3d317da19 add pulp harvest 2020-07-30 13:22:23 -04:00
eric 462e097965 one time use 2020-07-30 13:21:44 -04:00
Eric Hellman 284effbb0e
Merge pull request #889 from Gluejar/maintenance2020
remove duplicate chaps from nomos
2020-07-30 11:31:49 -04:00
eric e4a34a0ba5 mgmt command to clear nomos 2020-07-30 11:21:22 -04:00
eric a4191a99d0 omit duplicates in nomos harvest 2020-07-30 10:36:53 -04:00
Eric Hellman 4280ffbe83
Merge pull request #888 from Gluejar/maintenance2020
misc cleanup
2020-07-29 20:00:17 -04:00
eric 64b03fd40f add OAPEN "harvest" 2020-07-29 19:52:32 -04:00
eric c0505f299b email-change reversion? 2020-07-29 15:57:07 -04:00
eric 8052fde357 some delinting 2020-07-29 15:38:58 -04:00
eric 7a6332d641 fix card-declined error for anon user 2020-07-29 15:26:35 -04:00
Eric Hellman 281a0e5848
Merge pull request #887 from Gluejar/maintenance2020
Exception handling, single ebook harvest
2020-07-29 14:17:41 -04:00
eric 6566afd92f support single ebook harvest 2020-07-29 13:34:11 -04:00
eric 73b863450e handle RecursionError 2020-07-29 12:51:14 -04:00
eric 1c380b0e9f add connection refused handling in get_soup 2020-07-29 11:52:45 -04:00
Eric Hellman e5371b5e21
Merge pull request #886 from Gluejar/maintenance2020
add harvests
2020-07-28 21:07:52 -04:00
eric b25b269a45 add springer harvest 2020-07-28 20:59:31 -04:00
eric 1f2b223c0f add frontiersin harves 2020-07-28 20:59:13 -04:00
eric 01f7273023 add nomos harvest 2020-07-28 20:58:45 -04:00
eric 066b81fb74 add digitalis harvest 2020-07-28 20:58:25 -04:00
eric 19d39cf4a6 add code to deal with ebooks already harvested from different source 2020-07-28 20:57:46 -04:00
Eric Hellman 68612ab55d
Merge pull request #885 from Gluejar/maintenance2020
add ksp.kit.edu harvest
2020-07-28 10:00:44 -04:00
eric b799b2a4c9 increase harvest limit to 500 2020-07-28 09:27:40 -04:00
eric bf73124250 deal with no head 2020-07-27 20:39:45 -04:00
eric 74005584d0 add kit.edu 2020-07-27 19:06:15 -04:00
Eric Hellman b2a8f9fc8c
Merge pull request #884 from Gluejar/maintenance2020
harvest for degruyter and transcript
2020-07-27 18:08:58 -04:00
eric a14a94aba1 fix jbe condition 2020-07-27 17:54:16 -04:00
eric e306f319ce add transcript verlag harvest 2020-07-27 17:53:59 -04:00
eric 5932bc09ed allow harvest to harvest multiple ebooks 2020-07-27 17:53:32 -04:00
eric 26e32a4738 sometimes there's no contenttype header! 2020-07-27 17:50:21 -04:00
eric 2f28b32fbf also use disposition from contenttyper 2020-07-27 17:49:04 -04:00
eric dd76c112e9 fix degruyter harvest 2020-07-27 17:47:53 -04:00
Eric Hellman 9fd50192e3
Merge pull request #883 from Gluejar/maintenance2020
add error handling for doab 404s
2020-07-26 16:15:26 -04:00
eric 15424eaf4d add error handling for doab 404s 2020-07-26 16:06:33 -04:00
Eric Hellman 0d447fe583
Merge pull request #882 from Gluejar/maintenance2020
improve handling when G doesn't return an item with same isbn
2020-07-24 13:38:19 -04:00
eric d86ce969b8 improve handling when G doesn't return an item with same isbn 2020-07-24 13:00:08 -04:00
Eric Hellman a2d295ce9b
Merge pull request #881 from Gluejar/maintenance2020
bugfix
2020-07-23 16:15:20 -04:00
eric 1a8813832e bugfix 2020-07-23 15:48:11 -04:00
Eric Hellman 57823a719b
Merge pull request #880 from Gluejar/maintenance2020
refactor harvest.py
2020-07-23 11:42:19 -04:00
eric eb6ca2d570 refactor harvest.py
also don't remake ebooks
2020-07-23 10:42:34 -04:00
Eric Hellman 9b871ae7ab
Merge pull request #879 from Gluejar/maintenance2020
doab and harvest
2020-07-22 19:47:21 -04:00