Extract HTML using CSS selectors in the command-line.
https://crates.io/crates/select-html
Bauke 29dd410b3b | ||
---|---|---|
source | ||
tests | ||
.envrc | ||
.gitignore | ||
Cargo.lock | ||
Cargo.toml | ||
LICENSE | ||
Makefile.toml | ||
README.md | ||
flake.lock | ||
flake.nix | ||
rustfmt.toml | ||
rustup-toolchain.toml | ||
shell.nix |
README.md
Select 👁 HTML
Extract HTML using CSS selectors in the command-line.
Features
- Input HTML using stdin or a file.
- Output full elements, inner text or attribute values.
- Trim excess whitespace from output.
Installation
Cargo
With a working Rust and Cargo installation, you can install select-html
from Crates.io.
cargo install select-html
Binaries
Precompiled x86_64-unknown-linux-gnu
binaries are available on the Releases page.
Examples
# Use stdin to pipe HTML to select-html.
curl -s https://example.org | select-html
# Or use `--file`.
select-html --file 'example.html'
# Select and output HTML.
select-html 'title'
# Output only the inner text of selected elements.
select-html '.example' --text
# Output attribute values.
select-html '[property^="og:"]' -a 'property' -a 'content'
Usage
USAGE:
select-html [OPTIONS] <SELECTOR>
ARGS:
<SELECTOR> The CSS selector to use
OPTIONS:
-a, --attribute <ATTRIBUTE> Output the attribute's value from the selected element, can be
used multiple times
--file <FILE> A HTML file to read, if not specified stdin will be used instead
-h, --help Print help information
-t, --text Output inner text of the selected elements
--trim Trim whitespace from selected items
-V, --version Print version information
Development
With Nix flakes and direnv installed and enabled, all the required dependencies are automatically loaded from shell.nix
. Then cargo-make can be used to build, deploy and lint the code. The available tasks are all described in the Makefile.toml
configuration.
Feedback
Found a problem or want to request a new feature? Email me@bauke.xyz and I'll see what I can do for you.
License
Distributed under the AGPL-3.0-or-later license, see LICENSE for more information.