2
0
mirror of https://github.com/qarmin/czkawka synced 2024-10-31 21:20:19 +00:00
Go to file
2020-12-27 12:11:49 +01:00
.github/workflows Fix appimage crash by adding PNG version of icon (#126) 2020-12-21 20:32:59 +01:00
czkawka_cli Release 2.0.0 2020-12-23 23:12:50 +01:00
czkawka_core Add cache support for similar images (#139) 2020-12-27 10:56:26 +01:00
czkawka_gui Add selecting images with it's size (#138) 2020-12-26 12:53:40 +01:00
data/icons Update Icon (#120) 2020-12-17 16:45:55 +01:00
instructions Add cache support for similar images (#139) 2020-12-27 10:56:26 +01:00
misc Move Snap to root folder 2020-12-27 12:11:49 +01:00
pkgs Remove GPL AUR package from repo 2020-12-23 21:23:55 +01:00
snap Move Snap to root folder 2020-12-27 12:11:49 +01:00
.gitignore Release version 1.5.1 2020-12-08 17:51:18 +01:00
.rustfmt.toml Removed almost all occurrences of println from core 2020-09-11 15:52:06 +02:00
Cargo.lock Add cache support for similar images (#139) 2020-12-27 10:56:26 +01:00
Cargo.toml Remove orbtk frontend (#119) 2020-12-16 11:20:42 +01:00
Changelog.md Release 2.0.0 2020-12-23 23:12:50 +01:00
LICENSE Add new windows dark theme (#125) 2020-12-21 18:22:59 +01:00
README.md Update instructions 2020-12-23 19:35:30 +01:00

com github qarmin czkawka

Czkawka is written in Rust, simple, fast and easy to use app to remove unnecessary files from your computer.

Features

  • Written in memory safe Rust
  • Amazingly fast - due using more or less advanced algorithms and multithreading support
  • Free, Open Source without ads
  • CLI frontend, very fast to automate tasks
  • GUI GTK frontend - uses modern GTK 3 and looks similar to FSlint
  • Light/Dark theme match the appearance of the system(Linux only)
  • Saving results to a file - allows reading entries found by the tool easily
  • Rich search option - allows setting absolute included and excluded directories, set of allowed file extensions or excluded items with * wildcard
  • Image previews to get quick view at the compared photos
  • Multiple tools to use:
    • Duplicates - Finds duplicates basing on size(fast), hash(accurate), first 1MB of hash(moderate)
    • Empty Folders - Finds empty folders with the help of advanced algorithm
    • Big Files - Finds provided number of the biggest files in given location
    • Empty Files - Looks for empty files across disk
    • Temporary Files - Allows finding temporary files
    • Similar Images - Finds images which are not exactly the same(different resolution, watermarks)
    • Zeroed Files - Find files which are filled with zeros(usually corrupted)
    • Same Music - Search for music with same artist, album etc.
    • Invalid Symbolic Links - Shows symbolic links which points to non-existent files/directories

Czkawka

Requirements

If you are using Windows or Mac binaries, there is no specific requirements.
Same with Appimage on Linux(except having system 18.04+ or similar).
But compiled GUI binaries on Linux or compiling it on your own os require to install this packages:

Ubuntu/Debian

sudo apt install cargo libgtk-dev

Fedora/CentOS

sudo yum install cargo gtk3-devel glib2-devel

Usage

Precompiled binaries

Precompiled binaries are available here - https://github.com/qarmin/czkawka/releases/.
If the app does not run when clicking at a launcher, run it through a terminal.

Appimage

Appimage files are available in release page - https://github.com/qarmin/czkawka/releases/

For now looks that there is a bug with this format, because it doesn't allow to open two images/files at once.

Cargo

The easiest method to install Czkawka is to use Cargo command(you must have installed GTK libraries in OS)

cargo install czkawka_gui
cargo install czkawka_cli

You can update package by typing same command.

Snap

Snap also are available, but there is no access to

sudo snap install czkawka

Flatpak

Maybe someday

Debian/Ubuntu repository and PPA

Tried to set up it, but for now I have problems described in this issue

https://salsa.debian.org/rust-team/debcargo-conf/-/issues/21

AUR - Arch Linux Package (unofficial)

Czkawka is also available in Arch Linux's AUR from which it can be easily downloaded and installed on the system.

yay -Syu czkawka-git

This is unofficial package, so new versions will not be always available.

Devel versions

Artifacts from each commit you can also download here - https://github.com/qarmin/czkawka/actions

Compilation

Requirements

Rust 1.46 - probably lower also works fine(1.40 is needed by GTK)
GTK 3.22 - for GTK backend

For now only Linux (and maybe also macOS) is supported

  • Install requirements for GTK
apt install -y libgtk-3-dev

Compilation from source

  • Download the source
git clone https://github.com/qarmin/czkawka.git
cd czkawka
  • Run GTK GUI
cargo run --bin czkawka_gui

For Linux-to-Windows cross-building instruction look at the CI. GUI GTK

  • Run CLI(this will print help with a lot of examples)
cargo run --bin czkawka_cli

CLI

Benchmarks

Since Czkawka is written in Rust and aims to be a faster alternative to FSlint (written in Python), we need to compare the speed of these tools.

I tested it on SSD Disk 256GB GoodRam and i7 4770 CPU.

I prepared a directory and performed a test without any folder exceptions(I removed all directories from FSlint and Czkawka from other tabs than Include Directory) which contained 229868 files which took 203,7 GB and 13708 duplicates files in 9117 groups which took 7.90 GB.

Minimum file size to check I set to 1 KB on all programs

App Executing Time
FSlint 2.4.7 (Second Run) 86s
Czkawka 1.4.0 (Second Run) 12s
DupeGuru 4.0.4 (Second Run) 28s

I used Mprof for checking memory usage FSlint and Dupeguru, for Czkawka I used Heaptrack. To not get Dupeguru crash I checked smaller directory with 217986 files and 41883 folders.

App Idle Ram Max Operational Ram Usage Stabilized after search
FSlint 2.4.7 62 MB 84 MB 84 MB
Czkawka 1.4.0 9 MB 66 MB 32 MB
DupeGuru 4.0.4 80 MB 210 MB 155 MB

Similar Images which check 332 files which takes 1,7GB

App Scan time
Czkawka 1.4.0 58s
DupeGuru 4.0.4 51s

Similar Images which check 1421 image files which takes 110,1MB

App Scan time
Czkawka 1.4.0 25s
DupeGuru 4.0.4 92s

So still is a big room for improvements.

Comparsion other tools

Czkawka FSlint DupeGuru
Language Rust Python Python/Objective C
OS Linux, Windows, Mac(only CLI) Linux Linux, Windows, Mac
Framework GTK 3 (Gtk-rs) GTK 2 (PyGTK) Qt 5 (PyQt)/Cocoa
Ram Usage Low Medium Very High
Duplicate finder X X X
Empty files X X
Empty folders X X
Temporary files X X
Big files X
Similar images X X
Zeroed Files X
Music duplicates(tags) X X
Invalid symlinks X X
Installed packages X
Invalid names X
Names conflict X
Bad ID X
Non stripped binaries X
Redundant whitespace X
Multiple languages(po) X X
Project Activity High Very Low High

Contributions

Contributions to this repository are welcome.

You can help by creating:

  • Bug report - memory leaks, unexpected behavior, crashes
  • Feature proposals - proposal to change/add/delete some features
  • Pull Requests - implementing a new feature yourself or fixing bugs, but you have to pay attention to code quality. If the change is bigger, then it's a good idea to open a new issue to discuss changes.
  • Documentation - There is insruction which you can improve.

The code should be clean and well formatted (Clippy and fmt are required in each PR).

Name

Czkawka is a Polish word which means hiccup.

I chose this name because I wanted to hear people speaking other languages pronounce it.

This name is not as bad as it seems, because I was also thinking about using words like żółć, gżegżółka or żołądź, but I gave up on these ideas because they contained Polish characters, which would cause difficulty in searching for the project.

At the beginning of the program creation, if the response concerning the name was unanimously negative, I prepared myself for a possible change of the name of the program, but the opinions were extremely mixed.

License

Code is distributed under MIT license.

Icon is created by jannuary and licensed CC-BY-4.0.

Windows dark theme is used from AdMin repo - https://github.com/nrhodes91/AdMin with MIT license

Program is completely free to use.

"Gratis to uczciwa cena" - "Free is a fair price"