Regular expression for the header keys in a quoted e-mail

Detection of header fields found in contained emails. Detected subjects and sent dates are compared to the rm_subject or rm_sentdate crawler base properties of root emails.

Field names like Subject or From are added by email programs to contained emails. If a field name is not specified in this table, its content is added to the previous field. For example, if the subject entered by the user has several lines, only the first line is marked Subject. The following lines are automatically added to the line marked as Subject. Therefore, all fields that appear in an email header should be specified in this table, even if they are not used for computing the thread hash value.

Note: The default regular expressions cover common email header types. For non-latin character sets, such as arabic or CJK characters, regular expressions must be added.

Location: Document model: Categorization > Content analysis > Thread computation Pre-processing and Re-Hash
Base property (header field)

Header field names as used in contained emails.

Allowed values: any character
Default value:
  • These header fields are listed by default.
    • SUBJECT
    • SENTDATE
    • FROM
    • TO
    • CC
    • IMPORTANCEFLAG
Regular expression for this header field

Regular expression that corresponds with a header field in a contained email.

Allowed values: Any character, except | which is used as separator between regular expressions.
Default value:
  • None