Try our conversational search powered by Generative AI!

EPiServer find external crawlers bringing back too much content.

Vote:
 

EPiServer Find Version - 13.0.1.0

Hi There,

I am new to EPiServer, EPiServer Find and EPiServer external content crawlers so be gentle.

I am currently working on a new EPiServer project and one of our requirements is to show search results in our new site based on crawled content from the old site.

However, the content that is returned contains all the text from the pgae including menu items, advertising text, hidden text etc. Is there a way to tell the connector to exclude all content that is not in a

tag, or exclude all content not within a certain div?

Currently we are getting everything back and then searching for a certain set of words in the excerpt and then excluding all text before that. Unfortunately not all pages contain the set of words, and in the future the client may change the layout of the page causing this solution to break. 

Thank you in advance and feel free to ask any questions.

#196112
Aug 20, 2018 4:38
* You are NOT allowed to include any hyperlinks in the post because your account hasn't associated to your company. User profile should be updated.