rga: ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc.
You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
Go to file
phiresky d10113668a don't care about 32bit linux 5 years ago
.vscode pass around config object 5 years ago
ci remove pcre stuff 5 years ago
exampledir tesseract single threaded 5 years ago
src tesseract single threaded 5 years ago
.gitignore pass around config object 5 years ago
.travis.yml don't care about 32bit linux 5 years ago
Cargo.lock sqlite3 bundled 5 years ago
Cargo.toml sqlite3 bundled 5 years ago
README.md sqlite3 bundled 5 years ago
rustfmt.toml initial working version 5 years ago

README.md

rga - ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc

Linux build status Crates.io

similar:

todo

  • jpg adapter (based on object classification / detection (yolo?)) for fun
  • 7z adapter (couldn't find a nice to use rust library)

considerations

  • matching on mime (magic bytes) instead of filename
  • allow per-adapter configuration options

Setup

rga should compile with stable Rust. To install it, simply run

apt install build-essential pandoc poppler-utils
cargo install ripgrep_all

rga --help

Some rga adapters run external binaries

Development

To enable debug logging:

export RUST_LOG=debug
export RUST_BACKTRACE=1

Also rember to disable caching with --rga-no-cache or clear the cache in ~/.cache/rga to debug the adapters.