Groonga is a fast and accurate full text search engine based on inverted index. One of the characteristics of Groonga is that a newly registered document instantly appears in search results. Also, Groonga allows updates without read locks. These characteristics result in superior performance on real-time applications. Groonga is also a column-oriented database management system (DBMS). Compared with well-known row-oriented systems, such as MySQL and PostgreSQL, column-oriented systems are more
Yioop software provides many of the same features of larger search portals: Search Results. Yioop comes with a crawler that can be used to crawl the open web or a selection of URLs of your choice. It can index popular archive formats like Wikipedia XML-dumps, arc, warc, Open Directory Project-RDF (now Curlie.org), as well as dumps of emails or databases. Once you have created Yioop indexes of your desired data sources, Yioop can serve as a search engine for your data. It supports "crawl mix
tlgu will convert an input_file from Thesaurus Linguae Graeca (TLG) and Packard Humanities Institute (PHI) representation to a Unicode (UTF-8) output_file which can then be read or searched using available pattern matching tools, like grep and awk. The TLG/PHI representation consists of "beta-code" text and citation information. The TLG / PHI and Epigraphical corpuses include the majority of classical Hellenic and Latin works and inscriptions. Several options are available, including splitting
A file searching tool, able to perform in-name and in-text matching, as well searching by file type and last modified time. It is also capable of performing indexing by using an mlocate database.
DocFetcher is an Open Source desktop search application: It allows you to search the contents of files on your computer. — You can think of it as Google for your local files. The application runs on Windows, Linux and Mac OS X, and is made available under the Eclipse Public License.
UniversalCodeGrep (ucg) is an extremely fast grep-like tool specialized for searching large bodies of source code. It is intended to be largely command-line compatible with Ack, to some extent with ag, and where appropriate with grep. Search patterns are specified as PCRE regexes.
mnoGoSearch is an open source full-featured web search engine software.
CloneManager allows to find duplicate files, and those which haven't been backed up. It may skip files based on size, and use directory structures for search grouping. It's implemented in Java and available in English and French.