Installing over an older version and other notes

Note
Ubuntu commands installed as snap packages can’t create arbitrary files under /tmp. This is for example the case by default for pdftk which is used by Recoll to extract PDF attachments. For best results, set TMPDIR to a location which belongs to you (e.g. inside your home, with something like export TMPDIR=~/tmp in your shell startup script). Recoll could conceivably work around the problem all by itself, but I find it in bad taste to create temporary files in an arbitrary location inside your home.

1.20-33 indexes are fully backward compatible. No need to reindex when upgrading.

Always reset the index if you do not know by which version it was created (e.g.: you’re not sure it’s at least 1.18). The best method is to quit all Recoll programs and delete the index directory (rm -rf ~/.recoll/xapiandb), then start recoll or recollindex.

recollindex -z will do the same in most, but not all, cases. It’s better to use the rm method, which will also ensure that no debris from older releases remain (e.g.: old stemming files which are not used any more).

On Windows, the index is located by default in C:/Users/[me]/AppData/Local/Recoll/xapiandb

Case/diacritics sensitivity is off by default. It can be turned on only by editing recoll.conf ( see the manual). If you do so, you must then reset the index.

Changes in Recoll 1.33

  • Query processing: simplify queries a bit more before sending to Xapian, was necessary to allow using OP_FILTER for path filtering. This is a relatively significant change with a potential for introducing issues, hence the medium version bump.

  • GUI: allow specifying a fixed geometry for the results list viewport by setting RECOLL_RESULTS_GEOMETRY=widthxheight . For people with fixed-width result formats CSS.

  • recollq: add option to extract a result document into a file.

  • Replace application/x-flac with audio/flac for FLAC audio files.

  • Fix web queue processing for non-default configuration directories.

  • Fix encoding issue in PDF attachment extraction.

  • GUI: result list: fix issue with webengine builds not displaying Icons. Paging still not working right with webengine (QTBUG-105842). Main builds revert/remain to webkit.

  • Misc. small adjustments.

Minor releases at a glance

  • 1.33.4

    • Minor change in rclaudio.py for better support of dsf files.

  • 1.33.3

    • New KDE Krunner plugin: get recoll search results in krunner. No changes needed to main recoll code, allowing this to appear in a minor release.

    • GUI: Fixed memory leak when opening / closing the side filter panel. Also, avoid rebuilding it in somes cases where it was not needed.

    • Extract comments from MS-office docx documents

    • Add recoll.ini to skippedNames to avoid hits on terms from previously failed searches.

  • 1.33.2

    • New x modifier to trigger term expansion in a phrase search. Previously, phrase searches were always verbatim, with no way to adjust.

    • New stemexpandphrases configuration parameter to default to expanding phrase terms.

    • Fix handling or file or directory names with white space at the end: an URL ending with such a name would be stripped of the space/tab characters, making it impossible to open the document.

    • Fixed document field aliasing not working if the alias name was not all lower case.

    • GUI: result list: do not display snippets without a highlight area. This could occur before when searching for a phrase or group, and the snippet contained the search terms, but not the phrase/proximity match.

    • GUI: fixed query processing losing some filtering information (e.g. dates, subspec) in some cases. Would esp. interfere with the use of the query fragments tool.

    • GUI: the preview window font was sometimes not adjusted in function of the result list one.

    • GUI: add thunderbird-open-message.sh script to help thunderbird open arbitrary message/rf822 files, even if their name extension is not .eml.

    • GUI: make Ctrl+Q work in the fragments window.

    • PDF: fixed XMP metadata extraction in some case where the XML structure was unexpected.

    • PDF: take advantage of newer poppler pdfinfo versions to extract custom properties (pdfinfo -custom).

    • rclaudio: allow outputting custom, non-standard tags. Better processing of multiple values.

    • Indexer: only update the status file if it actually changed: will avoid a lot of gratuitous disk writes.

  • 1.33.1

    • Fix the GUI simple search which was broken in 1.33.0 when switching filters on/off.

    • Windows GUI directory side filters: the computed paths were wrong on Windows.

    • Apply stemming to terms containing a single dash. These were not expanded before.

    • Make sure that a single double-quoted word is not stem-expanded (act as if it was capitalized). Expanding a quoted term is unexpected.

    • New textunknownasplain configuration variable to index all files with no known association but identified as text/xxx by the "file" or "xdg-mime" command.

    • Linux real time: fix monitoring under topdirs members which are symbolic links.

    • Exclude Tamil characters from unac processing (experimental for now).

    • Small updates to the build files to accommodate the new rclgrep utility.