Skip to content

Converts BDFR output into pretty human-readable text files

License

Notifications You must be signed in to change notification settings

DownrightNifty/bdfr2text

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

bdfr2text

Converts output files from Bulk Downloader for Reddit into pretty text files like this:

Issues and PRs are welcome.

Usage

$ git clone https://github.com/DownrightNifty/bdfr2text.git
$ cd bdfr2text
$ python3 bdfr2text.py INPUT_DIR OUTPUT_DIR

INPUT_DIR is the output dir of bdfr archive. See python3 bdfr2text.py -h.

Only JSON or YAML (not XML) output from BDFR is supported. If converting YAML files, PyYAML is necessary (but this should already have been installed by BDFR). Otherwise, no dependencies.

Parsable output

The --parsable-out (-p) option produces a parsable output by escaping delimiters used by bdfr2text found in the Reddit posts. It replaces [ with [, ] with ], and --- with ┄.

Searching through output

--parsable-out makes the output text files searchable with your favorite programs. Personally, I use Sublime Text, which can search entire folders and supports regex. For example, you could use the following regex to search for the string "query" within Reddit comments (excluding metadata blocks): query(?=[^\]]+\[)

See also

About

Converts BDFR output into pretty human-readable text files

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages