Main contributor: Natalie Webb
Working With Gedcom Files

As a genealogist, it is important to understand GEDCOM (Genealogical Data Communication) files. These files are the established standard for sharing genealogical data across different platforms. GEDCOM files are essential tools for exchanging data with colleagues or integrating it into various genealogy software programs.

Research your ancestors on MyHeritage

Understanding GEDCOM Files

What is GEDCOM?

The GEDCOM file format (.ged) is a widely adopted standard for storing and transferring family tree data between genealogy programs. Originally developed by FamilySearch, GEDCOM was designed to ensure compatibility across different genealogical software programs. Just as document formats like .docx or image formats like .jpg are standard, GEDCOM serves the same purpose for genealogical data. The latest version, GEDCOM 7.0, was introduced in 2021, but many genealogy programs still support version 5.5.1, which remains commonly used.

Structure of a GEDCOM File

GEDCOM files are organized hierarchically. Each line begins with a level number (starting with 0), followed by a tag (such as INDI for individuals or FAM for families), and the relevant data associated with that tag. Here is a partial GEDCOM file downloaded from MyHeritage:

0 HEAD
1 GEDC
2 VERS 5.5.1
2 FORM LINEAGE-LINKED
1 CHAR UTF-8
1 LANG English
1 SOUR MYHERITAGE
2 NAME MyHeritage Family Tree Builder
2 VERS 5.5.1
2 _RTLSAVE RTL
2 CORP MyHeritage.com
1 DEST MYHERITAGE
1 DATE 05 SEP 2024
1 FILE Exported by MyHeritage.com from Legacy Demo 75 People in Webb Web Site on Thu, 05 Sep 2024
1 _PROJECT_GUID 6690277A1C323E610024280905725114
1 _EXPORTED_FROM_SITE_ID 565219041
0 @I1@ INDI
1 NAME James Henry /Heeney/
2 GIVN James Henry
2 SURN Heeney
1 SEX M
1 BIRT
2 DATE OCT 1848
2 PLAC New Jersey, USA
1 DEAT Y
2 DATE 13 JAN 1907
2 PLAC New York City, New York, USA
2 NOTE <p>!NAME:U.S., City Directories, 1822-1995, U.S., City Directories, 1822-1995, Ancestry.com, Ancestry.com Operations, Inc. Date:2011 Place:Lehi, UT, USA</p><p><p></p></p><p><p>!NAME:1880 United States

Note that the first name in the tree, James Henry Heeney, does not appear until almost halfway down the segment. The initial lines are reserved for information about the file itself, such as download date and software program.

GEDCOM files are not easy to work with in text format. While it can technically be opened in a text editor, as above, it is recommended to use genealogy-specific software, such as Legacy 10 or MyHeritage, which can properly view and manage the file’s structure.

Working With GEDCOM Files

What can you do with a GEDCOM file?

GEDCOM files are primarily used for saving and transferring genealogical data between different software platforms. After exporting a GEDCOM file from one program, it can be imported into another. Additionally, it is prudent to save GEDCOM files as backup copies of your genealogical research.

Most genealogy software includes functionality to create GEDCOM files, typically found under the “File” or “Export” menu. Some programs provide options to export specific individuals or branches, while others require exporting the entire family tree. It is advisable to assign a clear, descriptive name to the GEDCOM file for easy identification.

To import a GEDCOM file into a different program, navigate to the "Import" section, usually found under "File" or "Import" menus in the software. Some programs allow you to customize the import by selecting data such as living individuals, media links, or sources. Tailor these settings based on your project or recipient’s needs.

Upon importing the file, it is critical to verify that all data has been transferred correctly. Check that the number of individuals, relationships, dates, and generations matches the original data set to ensure the integrity of the import.

Troubleshooting Common GEDCOM Import Issues

Data Corruption

If a GEDCOM file appears corrupted or fails to import, open the file in a text editor to locate and correct any structural issues. Ensure that the original GEDCOM formatting is maintained. Alternatively, downloading a fresh copy of the file may resolve the issue.

Missing Data

Occasionally, certain data may not transfer properly. This issue often arises when custom fields are used that are not supported by the receiving software. In such cases, manual entry of the missing data may be necessary after import.

Character Encoding Issues

GEDCOM files can encounter character encoding problems, particularly with special characters or non-Latin alphabets. To prevent this, ensure that both the exporting and importing programs are set to the same character encoding standard, typically UTF-8.

Best Practices for Managing GEDCOM Files

Regular Backups

Always create backups of your original GEDCOM files before importing or editing them. This precaution helps avoid data loss in case of errors during the process. Regular backups of your research are also recommended as a safeguard against software or hardware failures.

File Naming Conventions

Utilize consistent and descriptive naming conventions for GEDCOM files, such as “Smith_FamilyTree_2024.ged.” When sharing GEDCOM files, it is helpful to include the original researcher’s name to easily track the source of the data.

Documentation

Maintain thorough documentation of any modifications made to GEDCOM files after export. Record the software used, settings applied, and any challenges encountered during the process.

Other Uses for GEDCOM Files

Merging GEDCOM Files

When working with multiple GEDCOM files, many genealogy programs offer a merge function. It is crucial to carefully review duplicates and resolve conflicting information to ensure accuracy.

Data Analysis Using GEDCOM Files

Advanced tools and scripts can analyze GEDCOM files to identify trends in migration patterns, common ancestors, or other genealogical data. These analytical methods provide deeper insights into family histories and broader demographic shifts.

See also

Explore more about Working With GEDCOM Files

Retrieved from ""