Ansifilter converts ANSI terminal escape sequences to HTML, RTF, BBCode, Pango Markup, LaTeX and Plain TeX. It also converts ANSI art files (CP437, BIN, XBIN, TND) to HTML or RTF.
Some tasks are too menial for a dedicated script but still too cumbersome even with the many neat one-liner options of perl -E. This small script fills the gap: various one-letter commands & magic variables (with meaningful aliases too) and more nifty loop options take Perl programming to the command line. Fully imports List::Util. With no program on the command line, starts a pl Shell.
TEA is powerful text editor that provides hundreds of text processing functions. It supports QML plugins and external scripts. TEA can open plain text files, FB2, ODT, RTF, DOCX, Abiword, KWord KWD, SWX, PDF, DJVU. Other features: Built-in MC-like file manager. Spellchecker (using the aspell or/and Hunspell). Tabbed layout engine. Syntax highlighting for C, C++, Bash script, BASIC, C#, D, Fortran, Java, LilyPond, Lout, Lua, NASM, NSIS, Pascal, Perl, PHP, PO (gettext), Python, Seed7, TeX/LaTeX,
µBlock Origin is an efficient and general-purpose web filter plugin for browsers. It's primarily used for ad blocking of course. Runs in Firefox, Chrome, Opera Blink, Safari, Fennec, and other Chromium-based browsers. It automatically uses EasyList, EasyPrivacy and Malware Domain filter lists, but also supports hpHosts, MVPS HOSTS, Spam404 and many other curated block lists. µBlock is speedier and more memory conservative than other ad blockers, and is completely open source, does not undermine
mlbfilt can remove one or more multi-line text blocks from the output of a command or file. It can be conveniently used as an error filter. mlbfilt is geared towards usage within interactive shell sessions by handling colorized output sensibly, flushing as soon as possible and being able to handle multiple large patterns efficiently.
Paperwork is a GUI to make papers easily searchable using OCR. The basic idea behind Paperwork is "scan & forget" : You should be able to just scan a new document and forget about it until the day you need it again.
A collection of Audio plugins
tlgu will convert an input_file from Thesaurus Linguae Graeca (TLG) and Packard Humanities Institute (PHI) representation to a Unicode (UTF-8) output_file which can then be read or searched using available pattern matching tools, like grep and awk. The TLG/PHI representation consists of "beta-code" text and citation information. The TLG / PHI and Epigraphical corpuses include the majority of classical Hellenic and Latin works and inscriptions. Several options are available, including splitting
orcad_convert will dump the contents of an OrCAD L.P. SDT III/IV schematic (.sch) input_file or will convert it to a KiCad eeschema or gEDA gschem text-based schematic format in output_file. orcad_lib_convert will dump the contents of an OrCAD L.P. SDT III/IV library source (.src) input_file or will convert it to a KiCad component library (.lib) output_file or to gEDA symbol files (.sym) in directories having the same name as the input library file.
htmLawed is a PHP script to process text with HTML markup to make it more compliant with HTML standards and administrative policies. It works by making HTML well-formed with balanced and properly nested tags, neutralizing code that may be used for cross-site scripting (XSS) attacks, allowing only specified HTML tags and attributes and URL protocols through black- or white-lists. It can also tidy/pretty-print HTML, make relative URLs absolute, check for spam, etc. It is small (single file of ~50
ppip is a reimplementation of the venerable CP/M utility PIP - Peripheral Interchange Program
A small Unix command line tool that can be used to extract data from tables in a HTML-encoded text file. Outputs the (stripped from other HTML tags and (possible whitespace) data as a CSV-formatted file/on stdout. Should handle recursive tables and the most common incorrect HTML errors (missing </td>, </th> or </tr> tags.
tblutils complements the usual unix tools like cut and paste by providing enhanced versions that support column labels through-out, so that you can extract columns by name (tblcut), filter data using a mathematical expression (tblfilter), re-order columns without caring about the column index (tblcsort), join multiple files on a common index without having to pre-sort them (tblmerge), and much more.
cencode encodes a stream or file into quoted c-string escape sequences, suitable for use directly in C code. It is an universal alternative to dedicated export formats like XPM, unportable resource forks, linker flags, and Windows resources where dynamic loading of external data is not possible or undesired.