This content is archived. See latest version here.

Last updated: Feb 09 2016


Episerver Find is a powerful, scalable query platform that can index and query large amounts of structured or unstructured data of any type, create custom search functionality, and build advanced navigation for non-hierarchical content. This topic describes built-in functionality and the underlying technology, as well as supported languages in Find.

Functionality

The following functionality is included out-of-the-box:

  • Multi-language stemming
  • Phonetics
  • Best bets
  • Related queries
  • Highlighted summaries
  • Autocomplete
  • Search-as-you-type
  • Search in files/attachments
  • Custom weighting of results
  • Statistics and search optimization

Technical foundation


Episerver Find is built using ElasticSearch. The Episerver Find API enforces a number of conventions and restrictions to be aware of, such as adding mappings (which is automatically handled by the conventions) and adding and removing indexes.

Integration

Episerver Find can be easily extended and integrated with Episerver CMS and Commerce, using a .NET based client API. Interaction with the REST API through Javascript is done using JSON. Episerver Find is available as an on-premise solution or a software service. Episerver Find is not limited to being used from .NET. 

Architecture overview

Supported languages

Supported languages undergo an analysis process that breaks down text based on specific language characteristics. This process is designed to give better search results based on how a language's words are constructed. While content in additional languages may be added to the index, no analysis is done for unsupported languages. To learn more about the analysis process, see https://www.elastic.co/guide/en/elasticsearch/guide/current/analysis-intro.html.

Likewise, Find optimization only works with supported languages. So Find's advanced search capabilities are not applied to unsupported languages. However, content in unsupported languages is added to the index so is searchable.

The following indexing languages are supported by Episerver Find:

  • Arabic
  • Armenian
  • Basque
  • Brazilian
  • Bulgarian
  • Catalan
  • Chinese
  • Cjk (Chinese, Japanese, and Korean)
  • Czech
  • Danish
  • Dutch
  • English
  • Finnish
  • French
  • Galician
  • German
  • Greek
  • Hindi
  • Hungarian
  • Indonesian
  • Irish
  • Italian
  • Latvian
  • Norwegian
  • Persian
  • Polish
  • Portuguese
  • Romanian
  • Russian
  • Spanish
  • Swedish
  • Thai
  • Turkish

Compound splitting

​​​Many European languages contain compound words, such as the English term "steel thermos." In Swedish, that phrase is "ståltermos," one word. Compound words adversely affect relevancy for normal free-text search engines, especially for e-commerce, ​and can result in lower conversion rates.

Episerver Find uniquely includes a feature called compound splitting, which analyzes each word and discovers compound words. To continue the previous example, a visitor can search for "termos" and get a relevant match for "ståltermos". Most search solutions (including Elastic Search) do not include such functionality. And, solutions that do usually employ a less sophisticated approach that does not give the same high relevancy and associated conversion rates.

Compound splitting is available for Swedish and Norwegian. For English, compound splitting is not necessary.

Related topics

Comments