rga: ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc.
You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
Go to file
phiresky d0a1f8962b doc 5 years ago
.vscode pass around config object 5 years ago
ci fixes 5 years ago
exampledir tesseract single threaded 5 years ago
src doc 5 years ago
.gitignore pass around config object 5 years ago
.travis.yml skip arm build for now 5 years ago
Cargo.lock sqlite3 bundled 5 years ago
Cargo.toml sqlite3 bundled 5 years ago
README.md doc 5 years ago
rustfmt.toml initial working version 5 years ago

README.md

rga - ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc

rga is a tool to recursively search for text in many different types of files. It is based on the awesome ripgrep.

Linux build status Crates.io

Future Work

  • photograph adapter (based on object classification / detection (yolo?)) for fun, based on something like this. Tried, but very hard to integrate (especially state of the art approaches).
  • 7z adapter (couldn't find a nice to use rust library)

considerations

  • matching on mime (magic bytes) instead of filename
  • allow per-adapter configuration options

Setup

rga should compile with stable Rust. To install it, simply run

apt install build-essential pandoc poppler-utils
cargo install ripgrep_all

rga --help

Some rga adapters run external binaries

Development

To enable debug logging:

export RUST_LOG=debug
export RUST_BACKTRACE=1

Also rember to disable caching with --rga-no-cache or clear the cache in ~/.cache/rga to debug the adapters.

Similar tools