Overview of the N-FOCUS Soundex
Soundex is the phonetic algorithm used by N-FOCUS to encode last names by their sound when pronounced in English. The basic aim is for names with the same pronunciation to be encoded as the same phonetic value so that matching can occur despite minor differences in spelling.
The Soundex code for a name as used by N-FOCUS consists of a letter followed by up to seven numbers: the letter is the first letter of the name, and the numbers encode the remaining consonants. Similar sounding consonants share the same number. Vowels are coded only when they appear as the first letter of a name.
The exact algorithm is as follows:
- Retain the first letter of the name.
- Encode remaining letters as numbers in
the following manner:
- 1 is assigned to B, F, P, V
- 2 is assigned to C, G, J, K, Q, S, X, Z
- 3 is assigned to D, T
- 4 is assigned to L
- 5 is assigned to M, N
- 6 is assigned to R
- Double letters are encoded as one number
- Values
not encoded include:
- A, E, I, O, U, Y, H, W
- JR, SR, and I through VIII
Examples:
Name | Soundex Value |
---|---|
Bakename | B255 |
Buchanan | B255 |
Betty | B3 |
Booth | B3 |
Boyd | B3 |
Jansen | J525 |
Jensen | J525 |
Johnson | J525 |
(11-2010)