MyNixOS website logo
Description

Efficient and simple HTML/XML parsing library.

taggy is a simple package for parsing HTML (and should work with XML) written on top of the attoparsec library, which makes it one of the most efficient (space and time consumption wise) on hackage.

This is the root module of taggy. It reexports everything from the package. See each module's docs for details about the functions and types involved in taggy.

While we've been testing the parser on many pages, it may still be a bit rough around the edges. Let us know on github if you have any problem.

If you like to look at your HTML through various optical instruments, feel free to take a look at the companion taggy-lens package we've put up together. It makes HTML parsing a piece of cake.

If you want to parse a document as list of tags and go through it as some kind of stream by just picking what you need, head to Text.Taggy.Parser and take a look at Text.Taggy.Parser.taggyWith and Text.Taggy.Parser.run.

If you want to parse the document as a DOM tree and traverse it to find the information you need, use Text.Taggy.DOM.parseDOM. This is especially useful when used in conjunction with taggy-lens.

If you build some HTML manually or just transform some existing DOM tree and want to turn it into a Data.Text.Lazy.Text head to Text.Taggy.Renderer and look at Text.Taggy.Renderer.render.

Metadata

Version

0.2.1

Executables (1)

  • bin/taggy

Platforms (75)

    Darwin
    FreeBSD
    Genode
    GHCJS
    Linux
    MMIXware
    NetBSD
    none
    OpenBSD
    Redox
    Solaris
    WASI
    Windows
Show all
  • aarch64-darwin
  • aarch64-genode
  • aarch64-linux
  • aarch64-netbsd
  • aarch64-none
  • aarch64_be-none
  • arm-none
  • armv5tel-linux
  • armv6l-linux
  • armv6l-netbsd
  • armv6l-none
  • armv7a-darwin
  • armv7a-linux
  • armv7a-netbsd
  • armv7l-linux
  • armv7l-netbsd
  • avr-none
  • i686-cygwin
  • i686-darwin
  • i686-freebsd
  • i686-genode
  • i686-linux
  • i686-netbsd
  • i686-none
  • i686-openbsd
  • i686-windows
  • javascript-ghcjs
  • loongarch64-linux
  • m68k-linux
  • m68k-netbsd
  • m68k-none
  • microblaze-linux
  • microblaze-none
  • microblazeel-linux
  • microblazeel-none
  • mips-linux
  • mips-none
  • mips64-linux
  • mips64-none
  • mips64el-linux
  • mipsel-linux
  • mipsel-netbsd
  • mmix-mmixware
  • msp430-none
  • or1k-none
  • powerpc-netbsd
  • powerpc-none
  • powerpc64-linux
  • powerpc64le-linux
  • powerpcle-none
  • riscv32-linux
  • riscv32-netbsd
  • riscv32-none
  • riscv64-linux
  • riscv64-netbsd
  • riscv64-none
  • rx-none
  • s390-linux
  • s390-none
  • s390x-linux
  • s390x-none
  • vc4-none
  • wasm32-wasi
  • wasm64-wasi
  • x86_64-cygwin
  • x86_64-darwin
  • x86_64-freebsd
  • x86_64-genode
  • x86_64-linux
  • x86_64-netbsd
  • x86_64-none
  • x86_64-openbsd
  • x86_64-redox
  • x86_64-solaris
  • x86_64-windows