tlgu will convert an input_file from Thesaurus Linguae Graeca (TLG) and Packard Humanities Institute (PHI) representation to a Unicode (UTF-8) output_file which can then be read or searched using available pattern matching tools, like grep and awk. The TLG/PHI representation consists of "beta-code" text and citation information. The TLG / PHI and Epigraphical corpuses include the majority of classical Hellenic and Latin works and inscriptions. Several options are available, including splitting
µBlock Origin is an efficient and general-purpose web filter plugin for browsers. It's primarily used for ad blocking of course. Runs in Firefox, Chrome, Opera Blink, Safari, Fennec, and other Chromium-based browsers. It automatically uses EasyList, EasyPrivacy and Malware Domain filter lists, but also supports hpHosts, MVPS HOSTS, Spam404 and many other curated block lists. µBlock is speedier and more memory conservative than other ad blockers, and is completely open source, does not undermine
Ansifilter converts ANSI terminal escape sequences to HTML, RTF, BBCode, Pango Markup, LaTeX and Plain TeX. It also converts ANSI art files (CP437, BIN, XBIN, TND) to HTML or RTF.
TEA is powerful text editor that provides hundreds of text processing functions. It supports QML plugins and external scripts. TEA can open plain text files, FB2, Ebook, ODT, DOCX, Abiword, KWord KWD, SWX, PDF, DJVU. Other features: Built-in MC-like file manager. Speech Dispatcher support Spellchecker (using the aspell or/and Hunspell). Tabbed layout engine. Syntax highlighting for C, C++, Bash script, BASIC, C#, D, Fortran, Java, LilyPond, Lout, Lua, NASM, NSIS, Pascal, Perl, PHP, PO (gettext
A collection of Audio plugins
Support complete timeline editing functions, including move, crop, cut, thumbnail preview, scale and delete. Support more flexible and easily blueprint system. Blueprint is represented in the form of nodes, which can handle complex functions through nodes and flows. Support about 45+ built-in media filters and 70+ built-in media transitions. Support about 10 video and audio analysis tools. Support multiple audio and video codecs, including ProRes, H.264, H.265, VP9, etc. Support import and edit
jQuery UI widget for structured queries like "Contacts where Firstname starts with A and Birthday before 1/1/2000 and State in (CA, NY, FL)"...
mlbfilt can remove one or more multi-line text blocks from the output of a command or file. It can be conveniently used as an error filter. mlbfilt is geared towards usage within interactive shell sessions by handling colorized output sensibly, flushing as soon as possible and being able to handle multiple large patterns efficiently.
Paperwork is a GUI to make papers easily searchable using OCR. The basic idea behind Paperwork is "scan & forget" : You should be able to just scan a new document and forget about it until the day you need it again.
orcad_convert will dump the contents of an OrCAD L.P. SDT III/IV schematic (.sch) input_file or will convert it to a KiCad eeschema or gEDA gschem text-based schematic format in output_file. orcad_lib_convert will dump the contents of an OrCAD L.P. SDT III/IV library source (.src) input_file or will convert it to a KiCad component library (.lib) output_file or to gEDA symbol files (.sym) in directories having the same name as the input library file.
htmLawed is a PHP script to process text with HTML markup to make it more compliant with HTML standards and administrative policies. It works by making HTML well-formed with balanced and properly nested tags, neutralizing code that may be used for cross-site scripting (XSS) attacks, allowing only specified HTML tags and attributes and URL protocols through black- or white-lists. It can also tidy/pretty-print HTML, make relative URLs absolute, check for spam, etc. It is small (single file of ~50
ppip is a reimplementation of the venerable CP/M utility PIP - Peripheral Interchange Program
A small Unix command line tool that can be used to extract data from tables in a HTML-encoded text file. Outputs the (stripped from other HTML tags and (possible whitespace) data as a CSV-formatted file/on stdout. Should handle recursive tables and the most common incorrect HTML errors (missing | , or
tags.