International ILIAS Blog

English-language blog on ILIAS with news and background information

Expert Search Tips using Lucene

Kruse, Fabian [Fabian] - 12. Feb 2016, 11:52

In order to find content in an installation, ILIAS offer you three different search possibilities: a direct search, an index search and a Lucene search.[1] Although the Lucene search has to be configured additionally, it is nevertheless the first choice as a search tool for large installations as it delivers accurate search results even with large repositories. The Lucene search is also popular as it allows files to be searched for content irrespective of their format (PDF, Word, Excel...).  Less well known, however, is Lucene's extensive syntax that supports searching with placeholders and various operators. Here are some useful tricks for using a Lucene search!

The search box in ILIAS already offers not just the option of entering the term you are searching for, but also the possibility to restrict your search to a specific part of a repository, to search only within certain types of Object and, from ILIAS 5.1 onwards, to search only for Objects that were created before or after a certain date. On top of these features, the Lucene syntax also offers:

  • ILIAS owner:homer searches for content with the term ILIAS, within Objects belonging to the user homer.
  • tag:ILIAS shows all ILIAS Objects that have been tagged with the term ILIAS.

The Lucene search also supports wildcards. To do this, certain symbols are used to replace single or multiple characters (letters or numbers) within the search term, thereby increasing the number of hits.

  • If you enter the search term organi?ation you will get hits for both organisation and organization. The ? stands as a placeholder for any single letter or number.
  • If you search for Politi* you will get, among others, the results Politics, Political, Politician, etc... By entering Lea*ing, you would get, amongst others, the results Learning, Leaving etc... The symbol * stands as a placeholder for multiple, single or zero characters.

Lucene searches also support the use of Boolean Operators such as AND or NOT. Using such operators allows you to define requirements and connections for your search terms.

  • Searching for templates AND course only yields hits in which both the search terms templates and course are present.
  • Entering +competency mathematics would result in hits that must contain the term competency and can contain the term mathematics.
  • Searching for member NOT group show all hits that include the term member but not the term group.
  • Entering didactic -"didactic templates" leads to hits that contain the term didactic but excludes results that include the combination of terms didactic templates.

And now..... have fun searching!

Thanks go to Stefan Meyer who has not only been the developer of the search for many years now, but also reminded me that I had already written about the Lucene search syntax in my ILIAS-Praxisbuch 2011 and could therefore reread that for today's blog post.


[1] Lucene is a programme library for full text searches distributed by the Apache Software Foundation. The library is subject to the Apache licence and is delivered as part of the ILIAS software packet. For further information - https://lucene.apache.org/core/

No comment has been posted yet.