Commit graph

142 commits

Author SHA1 Message Date
Karl Voit
043c3ea3e8 fixed regex warnings by adding stringprefixes
according to https://docs.python.org/3/reference/lexical_analysis.html#string-and-bytes-literals
2020-06-17 19:24:28 +02:00
Karl Voit
3521a853c9 added Konica Minolta scan filename timestamp 2 2020-06-17 18:25:48 +02:00
Karl Voit
b8b7970327 simplified FH pattern 2020-06-05 11:52:17 +02:00
Karl Voit
3474ae846f simplified Presse pattern 2020-06-05 11:46:14 +02:00
Karl Voit
06bd08e10c added Emacs gif-screencast pattern 2020-06-05 11:44:59 +02:00
Karl Voit
71e327bcf6 added Konica Minolta scan filename timestamp 2020-05-29 16:28:44 +02:00
Karl Voit
d7fa57777b README: added organize as alternative 2020-05-27 22:23:20 +02:00
Karl Voit
9514933178 added BHAK Anwesenheitsbestaetigung 2020-03-05 15:12:36 +01:00
Karl Voit
db84ea05fd README: Example section 2020-03-04 16:46:58 +01:00
Karl Voit
7ea74a2e8e added newspaper bill 2020-03-04 16:24:49 +01:00
Karl Voit
644cdb9da4 README: Extending with your own regular expressions 2020-03-01 14:36:24 +01:00
Karl Voit
76998327f0 removed old comments + build_string_via_indexgroups() 2020-03-01 14:21:09 +01:00
Karl Voit
cb1f43bfd7 moving from hard coded RegEx index to named groups: finished but with outcommented old info 2020-03-01 14:15:09 +01:00
Karl Voit
ab16535991 moving from hard coded RegEx index to named groups (ongoing) 2020-02-29 23:40:17 +01:00
Karl Voit
9b88d4852b moving from hard coded RegEx index to named groups (ongoing) 2020-02-29 22:56:50 +01:00
Karl Voit
6d043c8d2e moving from hard coded RegEx index to named groups (ongoing) 2020-02-29 19:14:27 +01:00
Karl Voit
c9ffea1e64 moving from hard coded RegEx index to named groups (ongoing) 2020-02-29 17:15:19 +01:00
Karl Voit
2e531be5c2 test_get_datetime_string_from_named_groups() and test_get_datetime_description_extension_filename() 2020-02-29 17:14:01 +01:00
Karl Voit
209534a397 added patterns for smart recorder (Android) 2020-02-29 11:48:57 +01:00
Karl Voit
fc90b97c4e README: added link to fs-curator 2020-02-20 09:53:11 +01:00
Karl Voit
1299de9ae9 NEWSPAPER1 pattern
Die Presse
2019-12-04 10:49:48 +01:00
Karl Voit
5f6328e2c1 added signal attachment pattern 2019-11-23 16:12:05 +01:00
Karl Voit
cf4e7a171e "Could not read PDF content": warning->info 2019-11-23 16:11:42 +01:00
Karl Voit
af90a2a2d2 README: fixed GitHub bug with VERSE environments 2019-10-19 16:02:44 +02:00
Karl Voit
2525a0dbf2 bugfixes for info.json + its support for ORF TVthek 2019-10-19 15:27:36 +02:00
Karl Voit
21a505eee3 added derive_new_filename_from_json_metadata handling for YouTube-dl 2019-10-19 14:10:45 +02:00
Karl Voit
0dbdc168ca re-ordered function definitions 2019-10-19 12:53:09 +02:00
Karl Voit
207728809d fixed issue with manually entered URL parsing 2019-10-19 12:15:24 +02:00
Karl Voit
49b1b6aba1 git ignore virtualenv 2019-10-19 11:28:41 +02:00
Karl Voit
9a0499b90f added PDF file patterns for Boox Max 2 exports 2019-10-10 13:44:30 +02:00
Karl Voit
9c7ff7f86a removed pytest call shell script and added misc things to gitignore 2019-10-10 13:09:09 +02:00
Karl Voit
a5b9d45865 Appended ORF MediathekView pattern variant with _sd_ 2019-09-30 10:10:48 +02:00
Karl Voit
c566b1d9e6 tests: added test_film_url_regex 2019-09-30 10:10:29 +02:00
Karl Voit
aaff6f253f adapted changed FILM_URL_REGEX; improved debugging and help texts 2019-09-21 10:35:41 +02:00
Karl Voit
5fc36d3e69 updated MEDIATHEKVIEW_LONG_WITH_DETAILED_TIMESTAMPS_REGEX
which now may also contain characters (not just digits) in some parts I
don't understand yet.
2019-09-03 14:23:51 +02:00
Karl Voit
e86f33a98f disabled size plausibility unit tests
because feature was disables
2019-09-03 14:23:05 +02:00
Karl Voit
1029d17146 added minimum duration check for plausibility check 2019-08-26 10:48:45 +02:00
Karl Voit
530d945ce1 added workaround for salary PDF files
PyPDF2 doesn't support new PDF encryption id:2019-05-24-guessfilename-salary
2019-05-24 17:32:53 +02:00
Karl Voit
1c65c523eb added handling for oekostrom bills 2019-05-05 17:16:47 +02:00
Karl Voit
40a010f6a6 added support for Android Bokeh photographs to IMG_INDEXGROUPS 2019-03-10 12:18:26 +01:00
Karl Voit
7c411ba4e6 adding pattern for MediathekView v13 2018-11-14 10:56:36 +01:00
Karl Voit
f37855945d fix for previous pattern 2018-11-01 22:20:18 +01:00
Karl Voit
fabfc6d29a added ORF Mediathek pattern when original filename is missing 2018-11-01 11:27:25 +01:00
Karl Voit
9650e813c3 updated download URL format 2018-07-06 08:32:14 +02:00
Karl Voit
0ee2ebf32c also accept http URLs (instead of https only) 2018-06-16 11:42:33 +02:00
Karl Voit
09bcc1acb5 added MEDIATHEKVIEW_RAW_REGEX_STRING
for raw ORF MediathekView downloads as a fall-back when wget/curl
download has to replace malfunctioning MediathekView
2018-06-15 21:12:00 +02:00
Karl Voit
085cbe156e fixed issue with ORF MediathekView chunk that spans over midnight 2018-06-10 22:43:47 +02:00
Karl Voit
890e70785f added plausibility size checks for ORF 2018-06-09 18:08:36 +02:00
Karl Voit
f079077dc7 added MEDIATHEKVIEW_LONG_WITHOUT_DETAILED_TIMESTAMPS_REGEX
for ORF chunks without detailed time-stamps but with quality indicators
2018-06-09 16:02:46 +02:00
Karl Voit
9a9fac31d5 added MEDIATHEKVIEW_SHORT_REGEX and manual/interactive fall-back handling for truncated ORF file names 2018-06-09 14:56:20 +02:00