What We Need to Give to Our Descendants
A person's ancestry is a personal and yet valuable treasure. And, it should be shared, freely, with everyone -- at least, for privcacy purposes, all family before the great grandparents. However, much genealogical materials are being lost. That means future generations will lose the benefit of having them. I am talking about
tombstone lettering becoming unreadable, old newspapers crumbling to dust, people pitching the family history materials of a deceased family member,
public records destroyed by fire or flood or tornado. family photographs fading, and the list goes on.
Once these things are gone, they are gone.
What should we do about this? There is only one answer: everything of genealogical value must be digitized into a computer. Then, it is all made available, FOR FREE, to everyone.
But putting all this material together this is a monsterously huge, expensive (billions?) and time-consuming (decades?) endeavor. But, like so many other big issues our society faces today, it needs to be done. And who can argue against preserving families for future generations?
A Proposal
- A non-profit organization needs to be established just for this purpose
- All information digitized by this organization should be available for free to the public (like the LDS church has done)
- A structured manner of the data must be established at the beginning. Here is one way:
- Every digitized document must be labeled to identify where the original document is located and when it was digitized
- Each document must have all of its genealogical information carefully extracted into a pre-determined format so that it can be put into a searchable database
- Similar to an Internet Wiki, the extracted information must allow anyone to add comments to it and/or link it to other related extracted information
- Two states should be the pilot for this endeavor, this allows the process to be learned and smoothed over before the other states are done
- Large central computers with sufficient capacity (it grows over time) will be linked to the Internet. Appropriate software for accessing the data is on these central computers.
- Because of the possibility of identity theft, only information before 1940 should be included except for such things as death records, obituaries, tombstones and the like
- It will take from 15 to 30 years to initially complete all states
- Over the years, information after 1940 should be digitized. By this point, the available information would be well documented as to its availability. And, the volume would be smaller than the initial round of digitizing. This is similar to how every ten years the U.S. Census Bureau releases another census.
How Would The Digitizing Process Work?
There would certainly be much behind-the-scenes tasks going on such as administrative, equipment maintenance, recruitment, trainers, oversight, etc. But here are the highights of the main tasks:
- A survey is needed of what materials are available in each state to digitized. Some will be in private hands and permission is needed from each of them.
- The appropriate equipment needs to be purchased and tested. This equipment would include laptops, scanners, digital cameras and a very large capacity central computer. Since this endeavor will last many years, equipment will become outdated over time and newer, better, faster equipment will become available.
- Procedures for digitizing needs to be established.
- An army of people would need to be gathered and trained on the procedures of how to properly use the equipment and perform the established procedures for digitizing.
- A schedule of which states will be done in which order needs to be determined.
- The trained people would digitize the identified materials per state.
- The digitized information is uploaded to the central computers.
- Quality control is done to assure:
- all available material was indeed digitized (that is, did anything get missed?)
- is everything legible?
- Highly trained certified persons would extract from the digitized records the individual data elements
- Anyone can specify that two or more data elements are for the same person or that two or more persons are related
What Happens to the Digitized Information Once it is Available?
Let's say that a town book of births, deaths and marriages has been digitized, page by page. Each page contains multiple names and dates and other information. It may be hard to read. This is where the highly trained certified people step in.
These certified people review everything and create (on the central computer) what are called Extractions. An extraction is one specific piece of information about one person in one event. For example, for the town book, here is what the extractions would be for a marriage, a birth and a death (actually, more detailed than this, but it gives the concept):
Source |
Volume:Page |
Event Type |
Date/Time |
Location |
Person |
Role |
Additional Information |
Original |
Smallville KS Town Records |
6:24 |
Marriage |
1881-09-18 |
Smallville, Johnson county, KS, Baptist Church |
John Smith, Jr. |
Groom |
minister Joe Jones |
Image Link |
Smallville KS Town Records |
6:24 |
Marriage |
1881-09-18 |
Smallville, Johnson county, KS, Baptist Church |
Jane Johnson |
Bride |
minister Joe Jones |
Image Link |
Smallville KS Town Records |
14:87 |
Birth |
1883-05-15 |
Smallville, Johnson county, KS |
John Smith, Jr. |
Newborn |
|
Image Link |
Smallville KS Town Records |
14:87 |
Birth |
1883-05-15 |
Smallville, Johnson county, KS |
John Smith, Sr. |
Father |
|
Image Link |
Smallville KS Town Records |
14:87 |
Birth |
1883-05-15 |
Smallville, Johnson county, KS |
Jane Smith |
Mother |
|
Image Link |
Smallville KS Town Records |
8:37 |
Death |
1889-02-25 |
Smallville, Johnson county, KS |
Jane Smith |
Deceased |
|
Image Link |
Smallville KS Town Records |
8:37 |
Burial |
1889-02-25 |
Smallville, Johnson county, KS, Oaklawn Cemetery |
Jane Smith |
Interred |
|
Image Link |
These Extractions are the information at its most basic lowest level. And, given this detail, anyone could easily go back to the original digitized record to verify it, if necessary (which might call for a correction to the extraction data).
After enough Extractions are available, anyone can begin to tie them together (on the central computer) using Assertions. Assertions are someone's declaration that two or more Extractions are for the same person. When an Assertion is made, the person must identify themself and give a reason WHY s/he believes that the Extractions are for the same person. That way, others can come along at a later time and agree or disagree and specify their reason WHY. In the example Extractions above, someone could say that the deceased Jane Smith is the same person as the Jane Johnson that was married to John Smith, Sr. But to do so, they need to specify some reasonable reason for making that Assertion. The reason could be that they are just guessing because of the timeframe and in the same town. Or, maybe there is another Extraction that says that John Smith Sr. was married in 1891.
After enough Assertions are available, a person can be indicated as having a Relationship to another person (e.g. spouse, cousin, sibling, parent/child, etc.). And, over time, enough Relationships indicate a Family.
What Information Should Be Digitized?
These types of information sources should be digitized. Note that the number after each type denotes primary (1) or secondary (2) information.
- Adoption/Guardianship [1]
- Association, certification [2]
- Association, membership [2]
- Association, position held [2]
- Birth [1]
- Census, federal [2]
- Census, mortality schedule [2]
- Census, state [2]
- Church, membership and/or positions held [2]
- Church, minutes [2]
- Church: baptism/christening [2]
- Church: marriage [2]
- Court Appearance [1]
- Death [1]
- Directory, city [2]
- Education (attended any grade level) [1]
- Education, graduation/degree/major/honors [1]
- Election Results [1]
- Estate, administration [1]
- Estate, inventory [1]
- Estate, settlement [1]
- Family bible [2]
- Freeman [2]
- Honor/Award [2]
- Immigration [1]
- Interment, funeral service [2]
- Interment, obituary [2]
- Interment, tombstone/records [2]
- Land, deed [1]
- Land, purchase/sale [1]
- Land, transaction [1]
- License, driver's [1]
- License, occupational [1]
- License, professional [1]
- Marriage [1]
- Marriage, announcement [2]
- Marriage, banns/bond [2]
- Marriage, divorce/annulment [1]
- Marriage, intentions [2]
- Meeting (of an organization) Minutes [2]
- Military (discharged/mustered out) [2]
- Military (enlistment/mustered in) [2]
- Military (promotion) [2]
- Military (war/battles) [2]
- Military pension [1]
- Naturalization of citizenship [1]
- Oath Taken [2]
- Occupation/Job,employee/pension [2]
- Photograph [2]
- Residence, lived at [2]
- Residence, ship passenger [2]
- Residence, warning out [2]
- Taxes, assessment/payment [1]
- Town Meeting Minutes [1]
- Will, inclusion/heir [1]
- Will, proven [1]
- Will, written [1]
- Witness [1]
Proposed Schedule of States by Year
The states should be grouped so as to divide resources (i.e. people and equipment). Thus, a state with a large collection of materials to digitize should be grouped with a state with a medium size collection of materials and a state with a small size collection of materials. In the case of some states with a very large collection of materials, only two states should be in that group.
Given that criteria, here is a proposed grouping of states, one group per year. Note that at this juncture it is quite difficult to say how long any of this will actually take, it is only a guess.
LARGE |
MEDIUM |
SMALL |
|
WA |
RI |
MA |
IA |
DE |
TN |
AZ |
AK |
FL |
CT |
NV |
UT |
WI |
|
IN |
MS |
KS |
MI |
SC |
OR |
TX |
MT |
|
NJ |
VT |
ID |
VA |
MO |
HI |
MD |
KY |
AL |
NY |
|
ND |
GA |
WV |
AR |
PA |
|
SD |
MN |
NH |
DC |
OH |
|
NE |
IL |
CO |
|
OK |
LA |
WY |
CA |
ME |
|
NC |
NM |
|
Copyright © 2004-
by Larry Wilson, all rights reserved