Show Menu
TOPICS×

About Words & Language

You can use Words & Language to determine how search terms are matched to the content of your web pages.

Using Words & Language

Before the effects of Words & Language settings are available to site visitors, including any changes you make to those settings, you must regenerate your site index. Regenerating, unlike indexing, does not involve crawling your web pages and takes only a few seconds.

Configuring how search terms are matched to your web content

You can use Words & Language to determine how site search/merchandising matches search terms to the content of your web pages.
To configure how search terms are matched to your web content
  1. On the product menu, click Linguistics > Words & Language .
  2. On the Words & Languages page, set the options that you want.
    Option
    Description
    Case Sensitivity
    Not selected by default.
    Determines whether uppercase letters are distinguished from lowercase letters. For example, when selected, "Succeed" is distinguished from "succeed", and the search results can vary between the two.
    Diacritic Sensitivity
    Selected by default.
    Determines whether words that contain diacritic characters are distinguished from words that do not. For example, when selected, "pagina" is distinguished from "página". Deselect this option if you have a website that uses non-English languages.
    Numbers
    Selected by default.
    Determines whether words that contain digits are indexed.
    Ignore Apostrophes
    Not selected by default.
    Apostrophes are removed from queries. For example, a search for "Tree's" would return the same results as a search for "Trees".
    Ignore Hyphens
    Not selected by default.
    Hyphens are removed from queries. For example, a search for "blue-bell" would return the same results as a search for "bluebell".
    Partial Alphanumeric Matching
    Not selected by default.
    When selected, this option lets you split tokens on alphabetical-numerical transitions to allow free-text matches on part or product tokens.
    For example, suppose that you have a product identifier of 910XT in the body content of one or more pages on a website. When this option is not selected, Adobe Search&Promote finds matches for this product identifier when searching for 910XT . And, with Search Concat-Div-Enable turned on, Adobe Search&Promote would also find 910 XT . However, it would not find instances of 910 or XT exclusively.
    When you select Partial Alphanumeric Matching , the indexer breaks these mixed alphanumeric tokens into multiple tokens. For example, a product identifier such as XYZ123 is indexed into three tokens: XYZ123 , XYZ , and 123 . Such functionality allows for search-time free-text matching on any of these variants.
    In another example, suppose that you have the product identifier AB910XT . If you select Partial Alphanumeric Matching and have Search Concat-Div-Enable turned on, Adobe Search&Promote indexes it as AB910XT , AB , 910 , and XT . Then, when a user searches for 910XT , for example, the search expands to also find instances of 910XT , 910 , or XT .
    Note: Search Concat-Div-Enable is not enabled by default. Contact Technical Support to activate the feature for your use.
    Note: Partial Alphanumeric Matching is applied globally to all indexed fields. However, it only affects free-text matching; it does not affect exact matching or range matching.
    Sound-Alike Matching
    Selected by default.
    Words that sound alike are matched such as "health" and "helth". This feature allows your customer to easily search despite misspelling a word.
    Alternate Word Forms
    Default is Default Alternate Word Forms .
    You can select from the following options in the Alternate Word Forms drop-down list:
    If phrase stemming is enabled in Adobe Search&Promote , be aware that alternate word forms also occur within phrases.
    Language
    Default is English (United States) .
    The selected language ensures that date and numeric values are parsed according to the conventions used in the selected part of the world.
    When Alternate Word Forms is set to Default Alternate Word Forms or to Domain Dictionary , word forms and word endings change according to the linguistic rules for the selected language.
    By default, the Language setting is not used to determine the language of pages read from your website. The language for a read page is determined from its HTTP headers or from metatags within the page itself. Your website could contain pages in many different languages. Each page is correctly read and indexed, regardless of the language that is selected here.
    If you use a Unicode character set encoding such as UTF-8 for some pages on your website, make sure that the language for each of those pages is correctly specified. If the appropriate HTTP headers or metatags do not exist for your Unicode documents, you can use Settings > Metadata > Injections to specify the appropriate language.
    Check Apply to documents with no specified language? to use the Language setting for pages read from your website that have no explicit setting. Use this setting when only some of your documents do not have language settings. Use Settings > Metadata > Injections if either none of your documents have language settings, or the set of affected documents is a well-known and manageably small list.
    Use Decompounder?
    Note: This feature is only used for Danish and German. Also, this feature is not enabled by default. Contact Technical Support to activate the feature for your use. After it is enabled, the Use Decompounder? option only appears in the user interface if you select Danish or German from the Language drop-down list described earlier in this table.
    When you select Use Decompounder? , the service breaks down Danish or German compound words, which allow the indexing of component words along with the original compound words.
    To see how this feature works, enter words into the text field, and then click Test .
  3. Click Save Settings .
  4. To preview the results of your changes, click regenerate your staged site index to rebuild your staged website index.
  5. (Optional) Do one of the following: