Episerver Find is built using ElasticSearch. The Episerver Find API enforces a number of conventions and restrictions to be aware of, such as adding mappings (which is automatically handled by the conventions) and adding and removing indexes.
Last updated: Dec 09 2016
Architecture and languages
Episerver Find is a powerful, scalable query platform that can index and query large amounts of structured or unstructured data of any type, create custom search functionality, and build advanced navigation for non-hierarchical content. This topic describes built-in functionality and the underlying technology, as well as supported languages in Find.
The following functionality is included out-of-the-box:
- Multi-language stemming
- Best bets
- Related queries
- Highlighted summaries
- Search in files/attachments
- Custom weighting of results
- Statistics and search optimization
Note: To ensure maximum availability and scaling flexibility, Episerver Find uses dynamic IP ranges. If you for example require whitelisting of IP addresses in the firewall, make sure that your firewall supports domain-based whitelisting.
Note: Find does not support direct use of the JSON API, as there is no way to secure the connection without exposing the access key. In general, do not implement client-side requests directly to Find.
When Find is purchased, your organization orders an index and support for a specified set of languages. Search queries that find content in a supported language can deliver richer, more nuanced search results. This is because that content is run through a language analyzer, whichbreaks down text based on a language's characteristics. For example, the English analyzer might use stemming analysis to identify fish as the root word for fishing, fished, fishes, fisher, and fisherman. By understanding how a language's words are constructed, Find can recognize several versions of a word as the same term and, thereby, provide better search results. Likewise, Find optimization only works with supported languages.
While content in unsupported languages is added to the index, making it searchable, no analysis is done for such content. See Elasticsearch Analysis and Analyzers.
Episerver Find's analysis and optimization features can work with these languages.
- Cjk (Chinese, Japanese, and Korean)
See also: Language-specific queries
Many European languages contain compound words, such as the English term "steel thermos." In Swedish, that phrase is "ståltermos," one word. Compound words adversely affect relevancy for normal free-text search engines, especially for e-commerce, and can result in lower conversion rates.
Episerver Find uniquely includes a feature called compound splitting, which analyzes each word and discovers compound words. To continue the previous example, a visitor can search for "termos" and get a relevant match for "ståltermos". Most search solutions (including Elasticsearch) do not include such functionality. And, solutions that do usually employ a less sophisticated approach that does not give the same high relevancy and associated conversion rates.
Compound splitting is available for Swedish and Norwegian.
Turning decompounding on/off [New in Episerver.Find 12.3.0]
The default query setting is to not decompound the query string. To enable decompounding, use this syntax:
.For("query", x => x.Analyzer = Language.Swedish.Analyzer)
Here is how it works. If a user submits a search term fotbollsmatch, the query only matches fotbollsmatch/er/en/… and not (as it did previously) ‘fotboll/ar/en/..’ and match/er/en/….
On the other hand, if a user submits the search term fotboll, the search matches fotboll/ar/en… ,fotbollsmatch/er/en/…, and fotbollsplan/er/en.