2
0
mirror of https://github.com/Y2Z/monolith synced 2024-11-17 03:25:39 +00:00
Go to file
2024-03-25 01:30:17 -10:00
.github get rid of warnings from old version of Shopify/upload-to-release 2024-01-14 06:11:51 -10:00
assets/icon add raster icon along with its Blender scene 2020-07-01 05:54:48 -04:00
dist 🩹 fix README to prevent 'ERROR: failed to solve: invalid reference format: repository name must be lowercase' 2024-03-25 01:06:43 -10:00
docs update Markdown files 2021-10-20 15:46:08 -10:00
snap Update snapcraft.yaml 2020-06-26 14:57:52 -04:00
src upgrade html5ever to 0.26.0 2024-01-14 06:41:46 -10:00
tests upgrade html5ever to 0.26.0 2024-01-14 06:41:46 -10:00
.adr-dir add ADR describing asset minimization 2020-03-15 23:04:03 -04:00
.gitignore add support for working with local assets 2020-03-22 15:48:23 -04:00
Cargo.lock upgrade html5ever to 0.26.0 2024-01-14 06:41:46 -10:00
Cargo.toml switch Rust edition from 2018 to 2021 2024-01-14 07:04:32 -10:00
Dockerfile reduce size of docker image 2021-07-11 20:00:39 +02:00
LICENSE switch license to CC0-1.0 2021-02-28 19:54:46 -10:00
Makefile forcefully set document's charset to UTF-8 2021-02-23 23:35:35 -10:00
monolith.nuspec add chocolatey spec 2020-12-31 15:30:41 -10:00
README.md Scoop as install source 2024-03-25 01:30:17 -10:00

monolith build status on GNU/Linux monolith build status on macOS monolith build status on Windows

 _____     ______________    __________      ___________________    ___
|     \   /              \  |          |    |                   |  |   |
|      \_/       __       \_|    __    |    |    ___     ___    |__|   |
|               |  |            |  |   |    |   |   |   |   |          |
|   |\     /|   |__|    _       |__|   |____|   |   |   |   |    __    |
|   | \___/ |          | \                      |   |   |   |   |  |   |
|___|       |__________|  \_____________________|   |___|   |___|  |___|

A data hoarders dream come true: bundle any web page into a single HTML file. You can finally replace that gazillion of open tabs with a gazillion of .html files stored somewhere on your precious little drive.

Unlike the conventional “Save page as”, monolith not only saves the target document, it embeds CSS, image, and JavaScript assets all at once, producing a single HTML5 document that is a joy to store and share.

If compared to saving websites with wget -mpk, this tool embeds all assets as data URLs and therefore lets browsers render the saved page exactly the way it was on the Internet, even when no network connection is available.


Installation

Using Cargo (cross-platform)

cargo install monolith

Via Homebrew (macOS and GNU/Linux)

brew install monolith

Via Chocolatey (Windows)

choco install monolith

Via Scoop (Windows)

scoop install main/monolith

Via MacPorts (macOS)

sudo port install monolith

Using Snapcraft (GNU/Linux)

snap install monolith

Using Guix (GNU/Linux)

guix install monolith

Using AUR (Arch Linux)

yay monolith

Using aports (Alpine Linux)

apk add monolith

Using FreeBSD packages (FreeBSD)

pkg install monolith

Using FreeBSD ports (FreeBSD)

cd /usr/ports/www/monolith/
make install clean

Using pkgsrc (NetBSD, OpenBSD, Haiku, etc)

cd /usr/pkgsrc/www/monolith
make install clean

Using containers

docker build -t y2z/monolith .
sudo install -b dist/run-in-container.sh /usr/local/bin/monolith

From source

Dependency: libssl

git clone https://github.com/Y2Z/monolith.git
cd monolith
make install

Using pre-built binaries (Windows, ARM-based devices, etc)

Every release contains pre-built binaries for Windows, GNU/Linux, as well as platforms with non-standard CPU architecture.


Usage

monolith https://lyrics.github.io/db/P/Portishead/Dummy/Roads/ -o portishead-roads-lyrics.html
cat index.html | monolith -aIiFfcMv -b https://original.site/ - > result.html

Options

  • -a: Exclude audio sources
  • -b: Use custom base URL
  • -B: Forbid retrieving assets from specified domain(s)
  • -c: Exclude CSS
  • -C: Read cookies from file
  • -d: Allow retrieving assets only from specified domain(s)
  • -e: Ignore network errors
  • -E: Save document using custom encoding
  • -f: Omit frames
  • -F: Exclude web fonts
  • -h: Print help information
  • -i: Remove images
  • -I: Isolate the document
  • -j: Exclude JavaScript
  • -k: Accept invalid X.509 (TLS) certificates
  • -M: Don't add timestamp and URL information
  • -n: Extract contents of NOSCRIPT elements
  • -o: Write output to file (use “-” for STDOUT)
  • -s: Be quiet
  • -t: Adjust network request timeout
  • -u: Provide custom User-Agent
  • -v: Exclude videos

Whitelisting and blacklisting domains

Options -d and -B provide control over what domains can be used to retrieve assets from, e.g.:

monolith -I -d example.com -d www.example.com https://example.com -o example-only.html
monolith -I -B -d .googleusercontent.com -d googleanalytics.com -d .google.com https://example.com -o example-no-ads.html

Dynamic content

Monolith doesn't feature a JavaScript engine, hence websites that retrieve and display data after initial load may require usage of additional tools.

For example, Chromium (Chrome) can be used to act as a pre-processor for such pages:

chromium --headless --incognito --dump-dom https://github.com | monolith - -I -b https://github.com -o github.html

Proxies

Please set https_proxy, http_proxy, and no_proxy environment variables.


Contributing

Please open an issue if something is wrong, that helps make this project better.



License

To the extent possible under law, the author(s) have dedicated all copyright related and neighboring rights to this software to the public domain worldwide. This software is distributed without any warranty.


Keep in mind that monolith is not aware of your browsers session