Which of the following is *not* a mapping rule for tokens Canonicalization?
1) Removing characters such as hyphen, periods and accents.
2) Reducing all letters to lower case (case-folding)
3) Translating words from other languages to standard English.
4) Expands abbreviations into their full form (In4matx → informatics).