DataparkSearch versions


Previous years: 2003-2004, 2005, 2006, 2007.
Latset snapshot
A function of canonical lanuage name has been fixed. The languages with names below "ru" in lexical order were affected.
The maximim length of log records has been enlarged to 480 bytes, the MUST size for a syslog message.
MaxHrefsPerServer command has been added. Uset it to limit the maximum number of hrefs accepted per server during one indexer run.
Limit command has been extended to accept SQL-based limits.
25 Apr 2009: 4.52, 2,175,210 bytes, 25.04.2009, 16:06 MSK
Busy timemout has been increased for SQLite.
Fixes for sub-document recoding and content-length calculation for a document with sub-documents.
A fix for incomplete passing text items from a subdocument to parent document.
The command parser has been fixed for case when a section in allin<section>: operator contains character '_' or '-'.
SkipHrefIn command has been added. Use it to skip some HTML tags from new href lookup.
SEASections command has been added. Use it to specify the list of sections which are used to construct SEA summary.
A possible trap on an empty document has been fixed.
A Disallow command in robots.txt doesn't lead to document removal from database anymore.
An error has been fixed in uncompression of big files.
Quffix command has been added.
Searchd cleans-up now the search cache on config loading/reloading.
A bug in stored check-up has been fixed.
Time zone processing has been added for Last-Modified header and meta.
MakePrefixes command has been added. Use it to produce all prefixes for words in a document. This is suitable for making suggestions.
31 Dec 2008: 4.51, 2,159,219 bytes, 31.12.2008, 18:57 MSK
Exact as in query string matching has been added for relevance calculation.
CAS based synchronization has been implemented for i386/x86_64 platform.
The ActionSQL command has been added. Use it to execute SQL-queries with document related data while indexing.
The support for KOI8-C (an extension of KOI8-R with old-Russian letters) charset has been added.
FastHrefCheck command has been added. Use it to skip href checking against server list during parsing.
SubDocCnt command has been added. Use it to specity the maximal number of sub-documents indexed per one document.
SubDocLevel command has been added. Use it to specify maximal nesting level for sub-documents.
HrefSection proccessing has been fixed in XML parser.
$(url.directory) meta-variable has been added.
storedoc.cgi accepts now the name of template in &tmplt= CGI-parameter.
Accept: HTTP header has been fixed for case when pattern is used for Content-Type in MIME command.
A bug in result merging has been fixed for multi-dbaddr mode.
allin<section>: operator has been added to the search query language.
storedoc.cgi takes now document from remote host if it unable to fetch it from stored database.
26 Jul 2008: 4.50, 2,112,004 bytes, 27.07.2008, 22:13 MSK
Default value for PopRankSkipSameSite command has been changed to "yes".
Possible memory leak has been fixed for a sub-document indexed from stored database.
The strict option has been added for Section command.
A word break has been added for French-style contractions.
Big lists of Russian and English synonyms have been added.
MaxSiteLevel command accept now a negative argument to group URLs on subdirectory basis.
The SkipUnreferred command has been extended to delete unreferred documents if necessary.
Del log processing has been fixed in splitter for case when cache log is empty.
Some German letters automatically replace by bi-letter combinations in accent-free search mode. ß -> ss, ä -> ae, ö -> oe, ü -> ue.
SQLite3 support has been added. Use --with-sqlite3 option for configure to enable it.
Indexing has been fixed for documents with several versions in different languages. You need to execute "indexer -Erehashstored" command when upgrade.
HTML parser understands now <!-- google_ad_section_start -->, <!-- google_ad_section_start(weight=ignore) --> and <!-- google_ad_section_end --> comments as tags to include/exclude content for indexing.
Relevance calculation has been improved for case when acronyms and abbreviations are used.
12 Feb 2008: 4.49, 2,493,884 bytes, 13.02.2008, 13:21 MSK
String tokenization has been improved. For example, "c--" and "c#" are now cosidered as words.
A subdocument indexing technique has been implemented.
LongestTextItems command has been added. Use it to specify the number of longest text items to index.
The support has been added for georgian-academy and georgian-ps charsets.
URL data preloading has been fixed for multi-DBAddr configurations.
HTML parser is now skiping indexing within tags with visibility set to none or hidden in style attribute.
Subnet command has been fixed.
$*(x) type of template meta-variable has been added. Use it to HTML-escape value without search words highlighting.
$(np) and $(p) have been fixed in "resbot" and "bottom" sections of search template.
PagesInGroup command has been added. Use it to specify the number of additional pages from the same site when google-like groupping is enabled.
ServerWeight command has been fixed.


Geo Visitors Map who's online