Assigning a Parsing Culture to a Record

When you configure an Open Parser stage to use culture-specific parsing grammars, the parsing grammars for each culture are applied to each input record in the order the cultures are listed in the Open Parser stage. However, if you want to apply a specific culture's parsing grammar to a record, you can add a field named CultureCode. The field must contain one of the supported culture codes listed in the following table.

Culture Codes

Culture codes consist of a two-letter lowercase language code and a two-letter uppercase country or region code. For example, "es-MX" for Spanish (Mexico) and "en-US" for English (United States). In cases where a two-letter language code is not available, a three-letter code is used, for example "uz-Cyrl-UZ" for Uzbek (Uzbekistan, Cyrillic). A language is specified by only the two-digit lowercase language code. For example, "fr" specifies the neutral culture for French, and "de" specifies the neutral culture for German.

Note: There are two culture names that follow a different pattern. The cultures "zh-Hans" (Simplified Chinese) and "zh-Hant" (Traditional Chinese) are neutral cultures. The culture names represent the current standard and should be used unless you have a reason for using the older names "zh-CHS" and "zh-CHT".

The following table shows the supported culture codes.

Language (Culture/Region)

Culture Code

Global Culture

Global Culture

Afrikaans

af

Afrikaans (South Africa)

af-ZA

Albanian

sq

Albanian (Albania)

sq-AL

Arabic

ar

Arabic (Algeria)

ar-DZ

Arabic (Bahrain)

ar-BH

Arabic (Egypt)

ar-EG

Arabic (Iraq)

ar-IQ

Arabic (Jordan)

ar-JO

Arabic (Kuwait)

ar-KW

Arabic (Lebanon)

ar-LB

Arabic (Libya)

ar-LY

Arabic (Morocco)

ar-MA

Arabic (Oman)

ar-OM

Arabic (Qatar)

ar-QA

Arabic (Saudi Arabia)

ar-SA

Arabic (Syria)

ar-SY

Arabic (Tunisia)

ar-TN

Arabic (U.A.E.)

ar-AE

Arabic (Yemen)

ar-YE

Armenian

hy

Armenian (Armenia)

hy-AM

Azeri

az

Azeri (Azerbaijan, Cyrillic)

az-Cyrl-AZ

Azeri (Azerbaijan, Latin)

az-Latn-AZ

Basque

eu

Basque (Basque)

eu-ES

Belarusian

be

Belarusian (Belarus)

be-BY

Bulgarian

bg

Bulgarian (Bulgaria)

bg-BG

Catalan

ca

Catalan (Catalan)

ca-ES

Chinese

zh

Chinese (Hong Kong SAR, PRC)

zh-HK

Chinese (Macao SAR)

zh-MO

Chinese (PRC)

zh-CN

Chinese (Simplified)

zh-Hans

Chinese (Singapore)

zh-SG

Chinese (Taiwan)

zh-TW

Chinese (Traditional)

zh-Hant

Croatian

hr

Croatian (Croatia)

hr-HR

Czech

cs

Czech (Czech Republic)

cs-CZ

Danish

da

Danish (Denmark)

da-DK

Divehi

dv

Divehi (Maldives)

dv-MV

Dutch

nl

Dutch (Belgium)

nl-BE

Dutch (Netherlands)

nl-NL

English

en

English (Australia)

en-AU

English (Belize)

en-BZ

English (Canada)

en-CA

English (Caribbean)

en-029

English (Ireland)

en-IE

English (Jamaica)

en-JM

English (New Zealand)

en-NZ

English (Philippines)

en-PH

English (South Africa

en-ZA

English (Trinidad and Tobago)

en-TT

English (United Kingdom)

en-GB

English (United States)

en-US

English (Zimbabwe)

en-ZW

Estonian

et

Estonian (Estonia)

et-EE

Faroese

fo

Faroese (Faroe Islands)

fo-FO

Farsi

fa

Farsi (Iran)

fa-IR

Finnish

fi

Finnish (Finland)

fi-FI

French

fr

French (Belgium)

fr-BE

French (Canada)

fr-CA

French (France)

fr-FR

French (Luxembourg)

fr-LU

French (Monaco)

fr-MC

French (Switzerland)

fr-CH

Galician

gl

Galician (Spain)

gl-ES

Georgian

ka

Georgian (Georgia)

ka-GE

German

de

German (Austria)

de-AT

German (Germany)

de-DE

German (Liechtenstein)

de-LI

German (Luxembourg)

de-LU

German (Switzerland)

de-CH

Greek

el

Greek (Greece)

el-GR

Gujarati

gu

Gujarati (India)

gu-IN

Hebrew

he

Hebrew (Israel)

he-IL

Hindi

hi

Hindi (India)

hi-IN

Hungarian

hu

Hungarian (Hungary)

hu-HU

Icelandic

is

Icelandic (Iceland)

is-IS

Indonesian

id

Indonesian (Indonesia)

id-ID

Italian

it

Italian (Italy)

it-IT

Italian (Switzerland)

it-CH

Japanese

ja

Japanese (Japan)

ja-JP

Kannada

kn

Kannada (India)

kn-IN

Kazakh

kk

Kazakh (Kazakhstan)

kk-KZ

Konkani

kok

Konkani (India)

kok-IN

Korean

ko

Korean (Korea)

ko-KR

Kyrgyz

ky

Kyrgyz (Kyrgyzstan)

ky-KG

Latvian

lv

Latvian (Latvia)

lv-LV

Lithuanian

lt

Lithuanian (Lithuania)

lt-LT

Macedonian

mk

Macedonian (Macedonia, FYROM)

mk-MK

Malay

ms

Malay (Brunei Darussalam)

ms-BN

Malay (Malaysia)

ms-MY

Marathi

mr

Marathi (India)

mr-IN

Mongolian

mn

Mongolian (Mongolia)

mn-MN

Norwegian

no

Norwegian (Bokmål, Norway)

nb-NO

Norwegian (Nynorsk, Norway)

nn-NO

Polish

pl

Polish (Poland)

pl-PL

Portuguese

pt

Portuguese (Brazil)

pt-BR

Portuguese (Portugal)

pt-PT

Punjabi

pa

Punjabi (India)

pa-IN

Romanian

ro

Romanian (Romania)

ro-RO

Russian

ru

Russian (Russia)

ru-RU

Sanskrit

sa

Sanskrit (India)

sa-IN

Serbian

sr

Serbian (Serbia, Cyrillic)

sr-Cyrl-CS

Serbian (Serbia, Latin)

sr-Latn-CS

Slovak

sk

Slovak (Slovakia)

sk-SK

Slovenian

sl

Slovenian (Slovenia)

sl-SI

Spanish

es

Spanish (Argentina)

es-AR

Spanish (Bolivia)

es-BO

Spanish (Chile)

es-CL

Spanish (Colombia)

es-CO

Spanish (Costa Rica)

es-CR

Spanish (Dominican Republic)

es-DO

Spanish (Ecuador)

es-EC

Spanish (El Salvador)

es-SV

Spanish (Guatemala)

es-GT

Spanish (Honduras)

es-HN

Spanish (Mexico)

es-MX

Spanish (Nicaragua)

es-NI

Spanish (Panama)

es-PA

Spanish (Paraguay)

es-PY

Spanish (Peru)

es-PE

Spanish (Puerto Rico)

es-PR

Spanish (Spain)

es-ES

Spanish (Spain, Traditional Sort)

es-ES_tradnl

Spanish (Uruguay)

es-UY

Spanish (Venezuela)

es-VE

Swahili

sw

Swahili (Kenya)

sw-KE

Swedish

sv

Swedish (Finland)

sv-FI

Swedish (Sweden)

sv-SE

Syriac

syr

Syriac (Syria)

syr-SY

Tamil

ta

Tamil (India)

ta-IN

Tatar

tt

Tatar (Russia)

tt-RU

Telugu

te

Telugu (India)

te-IN

Thai

th

Thai (Thailand)

th-TH

Turkish

tr

Turkish (Turkey)

tr-TR

Ukrainian

uk

Ukrainian (Ukraine)

uk-UA

Urdu

ur

Urdu (Pakistan)

ur-PK

Uzbek

uz

Uzbek (Uzbekistan, Cyrillic)

uz-Cyrl-UZ

Uzbek (Uzbekistan, Latin)

uz-Latn-UZ

Vietnamese

vi

Vietnamese (Vietnam)

vi-VN