DataparkSearch

A full-featured open sources web-based search engine released under the GNU General Public License.
Download

DataparkSearch Ranking & Summary

Advertisement

  • Rating:
  • License:
  • GPL
  • Price:
  • FREE
  • Publisher Name:
  • Maxim Zakharov
  • Publisher web site:
  • http://www.dataparksearch.org/

DataparkSearch Tags


DataparkSearch Description

A full-featured open sources web-based search engine released under the GNU General Public License. DataparkSearch Engine is a web-based search engine released under the GNU General Public License, full-featured and designed to organize search within a web-site, group of web-sites, intranet or local system.DataparkSearch consists of two parts. The first part is indexing mechanism (indexer). Indexer walks over html hypertext references and stores found words and new references into database. The second part is web CGI front-end to provide search using data collected by indexer. Here are some key features of "DataparkSearch": · Support for http, https, ftp, nntp and news URL schemes. · htdb virtual URL scheme support for indexing SQL databases. · text/html, text/xml, text/plain, audio/mpeg (MP3) and image/gif mime types built-in support. · External parsers support for other document types. · Ability to index multilingual sites using content negotiation. · Searching all of the word forms using ispell affixes and dictionaries. · Fuzzy searching based on acronyms and abbreviations. · Stop-words, synonyms and acronyms lists. · Boolean query language support. · Neural network based Popularity Rank. · Results sorting by relevancy, popularity rank, last modified time and by importance (a multiplication of relevancy and popularity rank). · Various character sets support. · Accent insensitive search. · Phrases segmenting for Chinese, Japanese, Korean and Thai languages. · mod_dpsearch - search module for Apache web server. · Internationalized Domain Names support. What's New in This Release: · Busy timemout has been increased for SQLite. · Fixes for sub-document recoding and content-length calculation for a document with sub-documents. · A fix for incomplete passing text items from a subdocument to parant document. · The command parser has been fixed for case when a section in allin: operator contains character '_' or '-'. · SkipHrefIn command has been added. Use it to skip some HTML tags from new href lookup. · SEASections command has been added. Use it to specify the list of sections which are used to construct SEA summary. · A possible trap on an empty document has been fixed. · A Disallow command in robots.txt doesn't lead to document removal from database anymore. · An error has been fixed in uncompression of big files. · Quffix command has been added. · Searchd cleans-up now the search cache on config loading/reloading. · A bug in stored check-up has been fixed. · Time zone processing has been added for Last-Modified header and meta. · MakePrefixes command has been added. Use it to produce all prefixes for words in a document. This is suitable for making suggestions.


DataparkSearch Related Software