Street Rules Description
There are 17 predefined street parser rules, each of which is described in this topic. The rules are applied in the order listed, which means if you disable a rule, it may have an impact on the rules that follow.
Guidelines
■ The six STREET_BEFORE_PATTERN_BUS_WORD_PROC_RULEx rules are applied prior to pattern look up.
■ The eleven STREET_LEVELx_RULEy rules are applied when a pattern is not found.
■ It is a good idea to familiarize yourself with the terms used in the intrinsic attributes and specific attributes topics before proceeding.
■ In this topic, horizontal lines are used to group related rules.
Allow only one COMPLEX-TYPE attribute to remain
(STREET_BEFORE_PATTERN_BUS_WORD_PROC_RULE1)
This rule allows only one COMPLEX-TYPE attribute to remain in a street pattern. Any preceding COMPLEX-TYPE attributes revert back to their intrinsic attribute. For example:
GRAND ESTATE APARTMENTS
ALPHA COMPLEX-TYPE COMPLEX-TYPE
becomes
ALPHA ALPHA COMPLEX-TYPE
Force any HIGHWAY or ROUTE attribute without pairs to STREET-TYPE
(STREET_BEFORE_PATTERN_BUS_WORD_PROC_RULE2)
This rule reassigns any HIGHWAY or ROUTE attribute without a paired number to STREET-TYPE. For example, in ROUTE 66, the pattern is ROUTE ROUTE-NUM. In 209 MIDDLESEX ROUTE, ROUTE is assigned a STREET-TYPE attribute.
Force any UNITS without pairs to COMPLEX-TYPE
(STREET_BEFORE_PATTERN_BUS_WORD_PROC_RULE3)
The UNIT and UNIT-NUM attributes are usually paired as the name and number of a unit or building (for example, Building 12). This rule assigns the COMPLEX-TYPE attribute to any UNIT without a corresponding UNIT-NUM. For example:
JOHN HANCOCK BUILDING
ALPHA ALPHA UNIT
becomes
ALPHA ALPHA COMPLEX-TYPE
Activate processing of double DIRECTIONS
(STREET_BEFORE_PATTERN_BUS_WORD_PROC_RULE4)
This rule combines two adjacent DIRECTION attributes into a single DIRECTION attribute. For example:
101 NORTH EAST MAIN STREET
HSNO DIR DIR ALPHA STREET-TYPE
becomes
HSNO DIR ALPHA STREET-TYPE
Split hyphens when processing houses (except in Hawaii, New York or valid range)
(STREET_BEFORE_PATTERN_BUS_WORD_PROC_RULE5)
If a token contains a hyphenated HSNO, this rule splits the token into two separate tokens (except as noted above). For example, if the token contains 10-11, the rule splits it into 10 and 11, and assigns the HSNO attribute to each.
Combine multiple ALPHA tokens
(STREET_BEFORE_PATTERN_BUS_WORD_PROC_RULE6)
This rule instructs the CDP to combine multiple ALPHA tokens into a single ALPHA token. For example:
GREAT QUEEN ROAD
ALPHA ALPHA TYPE
becomes
ALPHA TYPE
Note: The rule STREET_LEVEL6_RULE1 performs similarly to STREET_BEFORE_PATTERN_BUS_WORD_PROC_RULE6, except that it is run only after five failed pattern look ups. Therefore, both rules must be set to ‘N’ to avoid combining adjacent ALPHA tokens into a single token.
__________________________________________________
Skip leading CARE-OF attribute
(STREET_LEVEL2_RULE1)
This rule instructs the CDP to skip a token with the CARE-OF attribute if it is the first token on the line. For example:
C/O 123 MAIN STREET
CARE-OF HSNO ALPHA STREET-TYPE
becomes
HSNO ALPHA STREET-TYPE
Skip IGNORE attribute and everything that follows it
(STREET_LEVEL2_RULE2)
The rule tells the CDP to ignore a token with an IGNORE attribute, and any tokens that follow it.
Remove pairs of two-part tokens
(STREET_LEVEL2_RULE3)
This rule removes the readily identified pairs of two-part tokens and stores the data in the pr_dwelling_name and pr_dwelling_number fields. For example:
23 Davis Rd Floor 2 Apt 35
HSNO ALPHA STREET-TYPE FLOOR FLOOR-NUM APT APT-NUM
The FLOOR and APT pairs are removed from the line and pattern and the data stored in pr_dwelling1 name and number and pr_dwelling2 name and number fields, respectively.
_________________________________________________
Force STREET-TYPE1 and SEC-STR-TYPE to STREET-TYPE
(STREET_LEVEL3_RULE1)
This rule reassigns tokens of STREET-TYPE1 (leading street type) and SEC-STR-TYPE to STREET-TYPE. For example:
123 Bowling Green St
HSNO ALPHA SEC-STR-TYPE STREET-TYPE
becomes
HSNO ALPHA STREET-TYPE STREET-TYPE
Force STREET to ALPHA
(STREET_LEVEL3_RULE2)
Certain words are assigned a STREET-NAME attribute in the CDP table. This rule reassigns any STREET-NAME attribute to an ALPHA attribute. For example:
101 Main St
HSNO STREET-NAME STREET-TYPE
becomes
HSNO ALPHA STREET-TYPE
Force NUMBER to ALPHA
(STREET_LEVEL3_RULE3)
The NUMBER attribute is a house number or street name expressed as a word. For example, in 25 Fifth Ave, FIFTH would have a NUMBER attribute. The initial pattern is HSNO NUMBER STREET-TYPE. This becomes HSNO ALPHA STREET-TYPE after the rule is applied.
Force ALPHA-1SPECIAL with hyphen (-) or quote (") to ALPHA
(STREET_LEVEL3_RULE4)
The ALPHA-1SPECIAL attribute represents characters that are all alphabetic except for one that can be any non-alphabetic and non-numeric character. This rule reassigns the ALPHA attribute to any token that contains a hyphen or quote mark and has the ALPHA-1SPECIAL attribute. For example:
Werner-vonBraun Str 25
ALPHA-1SPECIAL STREET-TYPE NUMERIC
becomes
ALPHA STREET-TYPE NUMERIC
__________________________________________________
Force S-DIRECTION to DIRECTION
(STREET_LEVEL4_RULE1)
The S-DIRECTION attribute is typically the abbreviated form (N, S, E, NE, SE, NW, SW) of a DIRECTION (North, South, East, West, and so on). This rule reassigns an S-DIRECTION attribute to a DIRECTION attribute. For example:
101 N Main St
HSNO S-DIRECTION ALPHA STR-TYPE
becomes
HSNO DIRECTION ALPHA STR-TYPE
__________________________________________________
Force non-last TYPE to ALPHA
(STREET_LEVEL5_RULE1)
This rule assigns any STREET-TYPE attribute that is not the last in the line to an ALPHA attribute. For example:
55 MAPLE STREET RD
HSNO ALPHA STREET-TYPE STREET-TYPE
becomes
HSNO ALPHA ALPHA STREET-TYPE
Force a single ALPHA preceded by HSNO to APARTMENT-NUM
(STREET_LEVEL5_RULE2)
This rule assigns the attribute APARTMENT-NUM to a token with a 1ALPHA intrinsic attribute that is preceded by an HSNO token. For example:
101 A Main St
HSNO 1ALPHA ALPHA STREET-TYPE
becomes
HSNO APARTMENT-NUM ALPHA STREET-TYPE
__________________________________________________
Combine adjacent ALPHA tokens into a single ALPHA
(STREET_LEVEL6_RULE1)
This rule combines adjacent ALPHA tokens into a single token with an ALPHA attribute. This feature greatly simplifies patterns for multi-word street or complex names. For example:
63 Martin Luther King Blvd
HSNO ALPHA ALPHA ALPHA STREET-TYPE
or
68 John Kennedy Street
HSNO ALPHA ALPHA STREET-TYPE
both become
HSNO ALPHA STREET-TYPE
where the ALPHA attribute logically encompasses all of the original ALPHA tokens.
Note: | The rule STREET_BEFORE_PATTERN_BUS_WORD_PROC_RULE6 performs similarly to STREET_LEVEL6_RULE1, except that it is run prior to pattern look up. Therefore, both rules must be set to ‘N’ to avoid combining adjacent ALPHA tokens into a single token. |