SEARCH
NEW RPMS
DIRECTORIES
ABOUT
FAQ
VARIOUS
BLOG

 
 
Changelog for xapian-omega-1.4.21-lp155.1.1.x86_64.rpm :

* Fri Sep 30 2022 Antonio Larrosa - Update to 1.4.21:
* documentation: + Consistently say \"macOS\" not \"Mac OS X\", \"OS X\", etc.
* indexers / omindex: + Add support for gzip-compressed SVG files (.svgz). + Handle in SVG. Previously only <dc:title> inside <metadata> was considered. If both are present, <title> now takes precedence. <BR>* testsuite: + omegatest: Add skip-for-32-bit-time_t mechanism and use it to conditionally enable some testcases which fail on platforms with 32-bit time_t. <BR>* build system: + Update to use AX_CXX_COMPILE_STDCXX which is a replacement for AX_CXX_COMPILE_STDCXX_11 (which we were using) which also supports newer C++ standards versions which will be useful. For C++11 the only difference seems to be that the macro now checks for attribute support - we use C++11 attributes so that seems a good thing.<BR>* Sun Sep 04 2022 Carsten Ziepke <kieltuxAATTgmail.com>- Update to 1.4.20: <BR>* indexers: + OpenDocument: Previously we only inserted an implicit space before each paragraph. Now we insert them both before and after each paragraph and heading, and before forced each line-break and tab. + Add extension mapping for .awt (Abiword templates). + Index metadata from XPS files. + -G and -C short options were documented in --help but not previously actually handled. Reported by David Bremner. + Show --max-size required argument in --help output. + Remove lingering handling for database backends without slot bounds since all backends have been required to support these since 1.4.11. <BR>* scriptindex: + Process an incomplete final line from a dump file. Previously if the final line lacked a newline scriptindex would quietly ignore it (unless it was the only line). + The `unique` action now takes an optional `missing` parameter to specify what to do if a record doesn\'t trigger the unique action or triggers it with an empty value. The default is now to issue a warning and create a new document (the same as before, except that there was only previously a warning for the empty value case). In Omega 1.5.0 the default will change to an error as that seems a better default, but is less compatible with potential existing use. + Explicitly allow multiple blank lines in input files. Previously such extra blank lines were treated as empty records and in many cases these got quietly skipped, but e.g. with the new UNIQUE checks this could result in a warning or error. + If we hit an error while parsing the index script we used to exit right away, but now we finish parsing the index script since it\'s more helpful to report all the errors in an index script rather than the user having to fix them one by one. This requires us to sensibly recover after each index script parse error - if you find a case where this recovery triggers further bogus errors please report it and we\'ll try to improve the recovery. + In four cases while handling input data (two cases of bad hex data fed to `hextobin`, an input data line without a `=`, and `load` failing to load the specified file) we\'d emit a diagnostic that was labelled as an \"error\" but really it was handled as a warning as we kept reading input and the \"error\" didn\'t affect the exit status. It doesn\'t really make sense to continue in any of these cases so we now exit with non-zero status right away. + A parameter in the index script which should be an integer but isn\'t, or should be positive but isn\'t now gives an error rather than a warning since an error seems more helpful. + All diagnostics issued while parsing the index script now include column information. + Avoid forcibly flushing the output stream after every message. <BR>* portability: + Require PCRE2 instead of PCRE. The original PCRE is now EOL and unmaintained (last release was June 2021). In omega it\'s potentially used to process input from the internet, so security is a real concern hence we\'re switching to PCRE2.- Run spec-cleaner, BuildRequires pcre2-devel instead of pcre-devel<BR>* Thu Aug 27 2020 Antonio Larrosa <alarrosaAATTsuse.com>- Update to 1.4.17: + documentation: <BR>* Document comment format supported by scriptindex index scripts. We\'ve supported comments on a line by themselves and introduced with a # since scriptindex was first added back in 2002, but it seems have never actually been documented before now. + omega: <BR>* Check for SERVER_PROTOCOL=INCLUDED before anything which might throw an exception so that if it is set we suppress the Content-Type: when reporting such exceptions. Spotted by Gaurav Arora. <BR>* Report get_description() for Xapian::Error exceptions instead of get_msg(). This means we now report the exception\'s type, context (useful for network errors), and errno information. <BR>* Avoid leaking MyStopper object. The object essentially has the lifespan of omega itself, but becomes unreachable when the QueryParser object is destroyed. To make it easier to use leak-checking tools, hand ownership of this object to the QueryParser object. + testsuite: <BR>* omegatest: Tell leak sanitizer not to report leaks for allocations which aren\'t explicitly released on exit - the OS will reclaim all memory from the process at this point and explicitly releasing everything just takes time for no real benefit. We will still see leaks of objects which become unreachable during a run.- Update to 1.4.16: + indexers: <BR>* Fix handling of XML empty tag syntax when there\'s a quoted parameter right before the closing `/>`. This caused `<title xml:lang=\"en-US\"/>` to treat the body text as the document title. Spotted by Gaurav Arora. <BR>* omindex: Fix killing of filter child process if the parent process receives a signal. Spotted by Gaurav Arora. + omega: <BR>* Reject $setrelevant without an argument list. This has never been documented as allowed, and previously crashed with a segfault. Fixes #802, reported by Gaurav Arora. <BR>* If there\'s an error opening the databases we now close any we managed to open successfully before the error so that things like $dbsize can\'t end up reporting values for a subset of the specified databases. + portability: <BR>* Use our own autoconf cache variable namespace (xo_cv_ prefix instead of ac_cv_) to avoid colliding with standard autoconf macro use if config.site or a shared config.cache is used. The former case caused a build failure for the OpenBSD port with 1.4.15, reported by Lucas R.<BR>* Thu Apr 30 2020 Antonio Larrosa <alarrosaAATTsuse.com>- Update to 1.4.15: <BR>* documentation: + Update documentation about how to add a new format to omindex. <BR>* indexers: + Check for a BOM on HTML files, which for HTML5 should determine the encoding. <BR>* omega: + Allow $if{COND} without any actions which is useful as a way to evaluate something but ignore the result if you just want the side effects. Indeed we were already recommending to use it if you want to ignore the return value of $log. Fixes bug introduced in 1.4.14. + Add OmegaScript support for $jsonbool{COND} for encoding a boolean value for use in JSON. This is equivalent to $if{COND,true,false} but more readable. + Add OmegaScript support for $jsonobject{} which allows producing a JSON object from an OmegaScript map. + Allow specifying a format to $jsonarray{} so it is no longer restricted to producing an array of strings. + Add $keys{MAP} OmegaScript command which gives a sorted list of the keys from an OmegaScript map. <BR>* portability: + Simplify probes for snprintf. The broken snprintf in libbsd in Linux libc4 is from ~25 years ago so way too ancient to matter now, and all callers already handle the pre-ISO semantics of returning -1 for an undersize buffer so we don\'t need to run a test program to probe for this at configure time, which is more cross-compile friendly. + Avoid deprecation warning on recent Linux. We were including sys/sysctl.h if it existed, which it does on Linux but we don\'t actually use it there. Including it now warns that it is deprecated, so skip including it under Linux.<BR>* Wed Jan 15 2020 Antonio Larrosa <alarrosaAATTsuse.com>- Update to 1.4.14: <BR>* documentation: + Improve omindex --help docs for --duplicates. <BR>* indexers: + Add built-in support for iso-8859-15 so we can handle it without iconv. This charset is a variant of iso-8859-1 with 8 characters changed, most notably including the euro currency symbol. It\'s the most commonly seen charset we didn\'t have built-in support for. <BR>* omega: + Fix error handling in $lookup. We now check for errors from cdb_init() and cdb_get(). We\'ve never checked for errors from cdb_init(), while for cdb_get() this bug was introduced by a warning fix in 1.2.20.- Update to 1.4.13: <BR>* documentation: + Document that $log will start to return an error message in 1.5.0, and that one can wrap it using a $if with no action now to be future-proof. <BR>* indexers: + Optimise converting us-ascii to UTF-8 to do nothing, like we already do when converting UTF-8 to UTF-8. + scriptindex: - Add new \'gap\' action which provides a way to leave a gap in the term positions between fields to prevent phrases and positional operators from matching across fields. <BR>* templates: + Future-proof use of $log against changes in 1.5.0.- Update to 1.4.12: <BR>* documentation: + Improve docs for OmegaScript $hitlist{}. + Fix RST formatting errors in omega docs. + Clarify use of Q prefix for unique ID terms - it was described as \"reserved\", but the use of \"Q\" is really just a convention (and in fact omindex uses \"U\" not \"Q\"). + Clarify scriptindex\'s weight action takes parameter >= 0. + Correct typo in OmegaScript $add parameter documentation. <BR>* indexers: + omindex: - Fix typo in mimetypes used for Apple iWork documents (\"apply\" instead of \"apple\") which meant that these documents weren\'t actually being indexed. Patch from Bruno Baruffaldi. - Pipe input to ps2pdf as this accepts input on stdin. + scriptindex: - If parsedate action\'s format includes %z adjust for the timezone if possible (this requires the non-POSIX tm_gmtoff member of struct tm) and flag an error for other platforms. - If parsedate action\'s format include %Z flag an error as that doesn\'t seem to be usefully supported by strptime() anywhere. - Fix parsedate action to treat formats without a timezone as being UTC instead of localtime. - Add date=unixutc. The existing date=unix works in localtime which is unhelpful if you want to use it on the output of parsedate since that\'s in UTC; date=unixutc is just like date=unix except it always works in UTC. - The date action now emits a warning for invalid values. The documentation used to say \"invalid values are ignored at present\", but it\'s more helpful to flag bad data than quietly ignore it. - We now check the date action\'s parameter at script parse time and unknown values result in an error and nothing being indexed. Previously an unknown format uselessly resulted in the terms D, M and Y literally being added to every document. - The split action now supports a new \"prefixes\" split style. This gives all the prefixes from the split, so split=/,prefixes on a file path gives all parent directories. <BR>* omega: + Remove documented limitation of $subdb and $subid - the implementation assumed that each omega database name corresponded to a single Xapian database, and if a database name referred to a stub database file expanding to multiple Xapian databases then they would misbehave. Such cases are now handled properly as well. + Extend $addfilter to support adding negated filters via a new optional second argument which specifies the type of filter to add. + Stop $sort from needlessly ensuring the match has run. + Handle corner case of nested $hitlist gracefully instead of potentially entering an infinite loop. <BR>* testsuite: + omegatest: Avoid setting TZ globally during tests as that hides bugs where behaviour depends on the local timezone when it shouldn\'t. + omegatest: Support testing when built using LeakSanitizer by suppressing leak reports for cached compiled pcre regular expressions. These aren\'t released when the program exits but aren\'t memory leaks. <BR>* build system: + Remove outdated deprecation warning suppression which was there to support building from git in the run up to 1.3.2 - a development version which is nearly 5 years ago now. <BR>* portability: + Fix problems with fallback strptime() implementation which was being included in the wrong binary, and was lacking a required const_cast on the return value. + Rework setenv() compatibility handling. Now that Solaris 9 is dead we can assume setenv() is provided by Unix-like platforms (POSIX requires it). For other platforms, provide a compatibility implementation of setenv() so the compatibility code is encapsulated in one place rather than replicated at every use.- Update to 1.4.11: <BR>* indexers: + omindex: - outlookmsg2html: Handle Subject, Date, and From headers. <BR>* omega: + In $div and $mod we were converting a non-zero denominator from string to int twice for no good reason. <BR>* testsuite: + omegatest: Fix testcase which was failing if the local timezone was behind UTC. This testcase was added in 1.4.10. + omegatest: Tweak to not fail when $time not supported - it seems that the OS time functions we use report an error on GNU Hurd for unknown reasons. <BR>* build system: + Sync up probes for OS time functions in omega\'s configure with those in xapian-core which may solve $time not being supported on GNU Hurd. <BR>* portability: + Add missing includes of <cerrno>. + Stop using htonl()/ntohl() in a non-network context which should improve portability to platforms without a POSIX-like socket API.- Update to 1.4.10: <BR>* documentation: + Use https for URLs where supported. <BR>* indexers: + omindex: - Index .apxl and .kth files as Apple Keynote. The .apxl extension is used for the XML files inside .key bundles/directories which hold the text content of the presentation, and by handling them we can index .key directories more usefully. It seems they are also sometimes found by themselves. Keynote themes have a .kth extension, and key2text can also handle these. - Pipe input to pdftotext, pdfinto and dpkg. These tools all support piping an input file on stdin, which can be a little more efficient when we already have the file open (e.g. to determine its type using libmagic, or to calculate its checksum). - An empty string for the start directory is now flagged as an error. Previously `/` was used instead, which is unlikely to be what is wanted (and `/` can be explicitly specified if that really is what is wanted). - Fix emulation of stderr redirection when the indexer\'s stderr has been closed. We try to avoid using the shell when running external filters, and emulate 2>/dev/null in commands, but if the indexer\'s stderr was closed this emulation was buggy and would make give the filter a closed stderr instead of one redirected to /dev/null. - When emulating redirection to /dev/null, we now open /dev/null once and dup that fd each time which is a little more efficient and simplifies the code. + scriptindex: - date=unix is now a no-op for empty input - previously it would unhelpfully add boolean date terms for 1970-01-01. - Warn for empty filename in LOAD action. Previously this gave a slightly confusing error: \"Couldn\'t load file \'\': No such file or directory\" - Unknown command-line options now cause scriptindex to give a non-zero exit status. <BR>* testsuite: + omegatest: Add testcase for SPAN.n on different slots. + omegatest: Update expected QueryParser output for the xapian-core change to produce flatter Query trees. <BR>* build system: + Use AM_ICONV to detect iconv() which should handle non-system install of GNU libiconv properly. <BR>* portability: + Provide fall-back strptime() implementation for platforms which don\'t provide it, using the C++11 std::get_time() function. We use strptime() directly where it\'s available as some older C++11 compilers seem to lack std::get_time() (GCC 4.8 for example). This is used by the parsedate action, which was added in 1.4.6.<BR>* Tue Nov 20 2018 Antonio Larrosa <alarrosaAATTsuse.com>- Update to 1.4.9 <BR>* indexers: + omindex: + Try harder to avoid opening a file being indexed more than once by reusing the file descriptor in more cases. + Hint to the OS not to cache output from external filters which require using a temporary file. + scriptindex: + If the LOAD action successfully opens a file but hits a read error the error message now reports the file name correctly. Previously it would report the partial file contents read so far instead of the file name. <BR>* portability: + We no longer call posix_fadvise() with POSIX_FADV_NOREUSE under Linux, since it\'s still not implemented there. We also now only call posix_fadvise() with POSIX_FADV_DONTNEED right before we close the file descriptor under Linux.- Update to 1.4.8 <BR>* documentation: + Assorted minor documentation improvements. <BR>* indexers: + omindex: + Improve date handling in .eml files. We now handle a \"Date:\" header without the day of the week, which is allowed by RFC822 and RFC2822 (though seems rare in practice). If the date can\'t be parsed, we now just omit the date information rather than failing to process the file. + Add support for indexing Apple iWork documents (Keynote (.key), Numbers (.numbers) and Pages (.pages)) using libetonyek. Currently only the file variants are handled since omindex doesn\'t currently support indexing a directory as a document. + Index Visio files using vsd2xhtml. + Extend --filter to support filters which produce SVG as output. + Handle SVG embedded in XML with svg: namespace prefix. + Add --read-filters option to read a list of filters from a file, each line of which is a rule as passed to --filter. Based on a patch from Gaurav Arora. + Add new --mime-type-match option which allows specifying a MIME Content-Type for a given shell filename pattern pattern (with the special Content-Type values \"ignore\" and \"skip\" supported, as for --mime-type). + Adjust --mime-type to allow \':\' in the extension. A valid MIME Content-Type can\'t contain a colon, so if the argument to --mime-type contains more than one colon it makes more sense to split at the <BR>*last<BR>* colon (we used to split at the first), as an extension could conceivably contain a colon. Mostly this change is for consistency with the new - -mime-type-match option, where the leafname pattern could reasonably contain a colon. + Remove failed entries for ignored files. If a file is mapped to pseudo-mimetype \"ignore\" then remove any existing failure record for it so that ignored files so we don\'t potentially end up with a lot of cruft failure records for files we are no longer trying to index. + If a file fails to index due to failing to allocate enough memory we now try to flag it as failed to index so it will be skipped by default on future runs. This should help to avoid indexing getting stuck on problematic files. + Add a \"pages\" field with the number of pages in the document where we know how to determine this (currently only for PDF files for which pdfinfo reports this information). + Handle initially empty database exactly the same was as when --overwrite is specified. This probably has no user-visible consequences, but it\'s cleaner for the handling to be exactly the same. + scriptindex: + Improve scriptindex diagnostic messages. All diagnostics are now labelled as \"error\", \"warning\" or \"note\" as appropriate, and we now consistently report \"FILE:LINE:\" (and also \"COLUMN:\" in most cases) to make it clearer where the problem lies. + Add new \"split\" action which splits the text on a specified delimiter and executes the following actions for each piece. Based on a patch by Gaurav Arora. + Missing whitespace after the closing \" on an action argument is now flagged as an error. Previously scriptindex would attempt to parse the following characters as the next action. + Support C-like escapes for quoted parameter values. Notably this means it is now possible to include `\"` in quoted parameter values. <BR>* omega: + Value-based date range filters can now be specified via CGI parameters START.N, END.N and/or SPAN.N where N is a value slot number, allowing multiple concurrent filters on different slots to be specified. + Support YYYY and YYYYMM limits in term-based date ranges. Previously value-based date ranges supported these as limits, but term-based date ranges gave an error. + Add stem_strategy option and deprecate existing stem_all option in favour of this new more versatile option. + Support \"natural\" $sort option via new flag \"#\" which sorts embedded natural numbers in numerical order. + Support numeric $sort option via new flag \"n\", similar to GNU sort -n. + Rewrite field parsing to be more efficient, and store fields in an unordered_map for faster lookup. <BR>* testsuite: + htmlparsetest: Test whitespace collapsing. <BR>* portability: + omegatest: Avoid \"set -\". The autoconf manual notes that POSIX no longer requires this, and that with traditional shells it resets -v and -x which makes debugging harder. + omegatest: Fix shell printf quoting issues which were a latent bug on macOS. + Drop special handling for Compaq C++. We never actually achieved a working build using it, and I can find no evidence that this compiler still exists, let alone that it was updated for C++11 which we now require.<BR>* Wed Oct 24 2018 Antonio Larrosa <alarrosaAATTsuse.com>- Update to 1.4.7 <BR>* New OmegaScript $unique command. The existing $uniq only removes adjacent entries (like the Unix uniq command) so to fully remove duplicates you need a sorted input. Sometimes it is desirable to remove duplicates from an unsorted list without changing the order of the entries which are left, so add $unique to do that. If the list is sorted already, then $uniq is more efficient. <BR>* Fix $map to cleanly reject a single argument. <BR>* templates/query: Merge multiple entries in the term frequency information, which came from searching several prefixes by default. Reported by Alistair Buxton on #xapian-discuss. <BR>* When multiple words with the same stem are in the query string we now fully eliminate duplicates when showing term frequency information.<BR>* Tue Jul 03 2018 alarrosaAATTsuse.com- Update to 1.4.6 <BR>* Fix generate_sample() (used by OmegaScript $truncate and omindex) to return an empty sample instead of throwing an exception when the requested sample size is less than the size of the truncation indicator string. Fixes https://trac.xapian.org/ticket/754 . <BR>* Check for the HTML5 doctype or legacy doctype declaration and use default charset UTF-8 if either is present. Previously we always used ISO-8859-1, which is correct for older HTML versions, but not for HTML5. <BR>* See also http://xapian.org/docs/xapian-omega-1.4.6/NEWS<BR>* Thu Dec 14 2017 alarrosaAATTsuse.com- Update to 1.4.5 <BR>* See http://xapian.org/docs/xapian-omega-1.4.5/NEWS<BR>* Fri Jun 30 2017 alarrosaAATTsuse.com- Update to 1.4.4 <BR>* See http://xapian.org/docs/xapian-omega-1.4.4/NEWS<BR>* Wed Feb 01 2017 alarrosaAATTsuse.com- Update to 1.4.3 <BR>* See http://xapian.org/docs/xapian-omega-1.4.3/NEWS- Update to 1.4.2 <BR>* See http://xapian.org/docs/xapian-omega-1.4.2/NEWS<BR>* Fri Nov 11 2016 alarrosaAATTsuse.com- Update to 1.4.1 <BR>* See http://xapian.org/docs/xapian-omega-1.4.1/NEWS<BR>* Sat Jul 09 2016 tittiatcokeAATTgmail.com- Update to 1.4.0 <BR>* See http://xapian.org/docs/xapian-omega-1.4.0/NEWS<BR>* Wed Apr 13 2016 tittiatcokeAATTgmail.com- Update to 1.2.23 <BR>* http://xapian.org/docs/xapian-omega-1.2.23/NEWS<BR>* Thu May 21 2015 tittiatcokeAATTgmail.com- Update to 1.2.21 <BR>* http://xapian.org/docs/xapian-omega-1.2.21/NEWS<BR>* Thu Mar 12 2015 mpluskalAATTsuse.com- Update to 1.2.20 <BR>* See http://xapian.org/docs/xapian-omega-1.2.20/NEWS<BR>* Thu Feb 19 2015 mpluskalAATTsuse.com- Tiny spec file cleanups- Enable tests during build- Add gpg signature<BR>* Tue Oct 21 2014 tittiatcokeAATTgmail.com- Update to 1.2.19 <BR>* See http://xapian.org/docs/xapian-omega-1.2.19/NEWS<BR></DIV> </td> <td bgcolor="#F2F2F2"> </td> </tr> </table></td> </tr> </table> </td></tr> <tr><td> <table width="100%" border="0" align="center" cellpadding="0" cellspacing="0"> <tr align="left" valign="top"> <td height="10" bgcolor="#A00B0F" width="100%"><img src="/images/spacer.gif" width="759" height="10" alt=""></td> </tr> </table> </td></tr> <tr><td> <table width="100%" border="0" align="center" cellpadding="0" cellspacing="0"> <tr align="left" valign="top"> <td height="10" width="150"><img src="/images/index_30.jpg" width="130" height="47" alt=""> </td> <td align="left"><A HREF="https://www.icm.edu.pl"><img src="/images/logo_ICM_nazwa_EN_czarne.png" height="40" alt="ICM"></A> </td> <td width="30%" align="right"><FORM ACTION="/out.php3" METHOD="post" TARGET="_blank"><INPUT TYPE="hidden" NAME="station" VALUE="1"><INPUT TYPE="image" SRC="/images/index_32.jpg" name="station" alt="Designed by station75" width="209" height="47" border="0"></FORM></td> </tr> </table> </td></tr> </table> </body> </html>