Usage of Soundex Codes for Surnames

The Soundex/Miracode is an alphanumeric coding system designed to keep together the names that sound similar regardless of their spelling. Only certain key letters are used. Vowels are ignored. The result is a letter followed by exactly three digits.

Soundex codes are used to help index the persons found on the federal census records. The 1880 census was the first to have Soundex codes applied.

To determine the soundex code for a given surname, use the following rules:

  1. write down the first letter of the surname
  2. all letters following the first letter must be coded into a three-digit number using their soundex codes. As soon as three digits are gotten, all remaining letters are ignored.
  3. all double letters (e.g. the letter 'l' in William) are coded as a single letter
  4. if the initial letter and the second letter have the same code then ignore the second letter
  5. if two different consecutive letters have the same code then the second letter is ignored
  6. if all of the letters are used up but the three digits have not been determined then use zero for the remaining digits
  7. surname prefixes should be ignored. These include: D', De, de, dela, della, Di, Du, La, Le, Van, Von

Soundex is not perfect. Consider the following:

The soundex codes are:

CodeLetters
0 A,E,H,I,O,U,W,Y
1 B,F,P,V
2 C,G,J,K,Q,S,X,Z
3 D,T
4 L
5 M,N
6 R

Some examples are:

NameLetters CodedSoundex Code
Allricht l,r,c A462
de Mille l M400
DuBois b,s D120
Eberhard b,r,r E166
Engebrethson n,g,b E521
Heimbach m,b,c H512
Hanselmann n,s,l H524
Henzelmann n,z,l H524
Herman r,m,n H655
Hildebrand l,d,b H431
Kavanagh v,n,g K152
Lukaschowsky k,s,s L222
McDonnell c,d,n M235
McGee c M200
McGhee c M200
O'Brien b,r,n O165
Opnian p,n,n O155
Oppenheimer p,n,m O155
Riedemanas d,m,n R355
Scott t S300
Van Lind n,d L530
Waggoner g,n,r W256
Zita t Z300
Zitzmainn t,z,m Z325

Soundex codes apply to the following federal census records:

Census YearStates Applicable
1880 for all states but only for the households with children age ten and under
1890 none, this census was lost in a fire
1900 all states
1910 twenty one states: Alabama, Arkansas, California, Florida, Georgia, Illinois, Kansas, Kentucky, Louisiana, Michigan, Mississippi, Missouri, North Carolina, Ohio, Oklahoma, Pennsylvania, South Carolina, Tennessee, Texas, Virginia, West Virginia
1920 all states