Main contributor: Thomas MacEntee
Image via www.vpnsrus.com

Artificial intelligence (AI) has emerged as a powerful tool for genealogists, facilitating the translation, transcription, and summarization of a wide array of genealogy records. From deciphering a handwritten baptismal record from the 1800s to interpreting a faded newspaper obituary, AI platforms offer invaluable assistance in generating useful content for genealogical research.

Understanding artificial intelligence

Artificial Intelligence (AI) refers to computer-based systems designed to replicate human intelligence, performing tasks that typically require human cognitive functions. Key aspects of AI include "deep learning" and "generative AI."

  • Deep Learning: This facet of AI emulates the human brain's ability to recognize patterns by analyzing vast amounts of data. It is particularly effective in interpreting photos, audio, and text.
  • Generative AI: This technology creates new content—photos, audio, and text—based on user inputs. It relies on extensive training data to understand patterns and generate outputs that align with user queries.

AI Platforms for genealogy

While many genealogy vendors are integrating AI translation and transcription features, several popular AI platforms are publicly accessible and worth considering:

  • ChatGPT: Meaning “Chat Generative Pre-trained Transformer,” ChatGPT is the most popular publicly accessible artificial intelligence platform.
  • Copilot: Copilot is an AI-powered intelligent assistant that helps you get answers and inspirations from across the web, supports creativity and collaboration, and helps you focus on the task at hand.
  • Claude: Claude is an artificial intelligence chatbot created by the company Anthropic that is designed to generate text content and engage in conversations with users using human-like responses.
  • Gemini: Developed by Google, Gemini describes itself as “a family of AI models developed by Google's AI research labs DeepMind and Google Research. Gemini is Google's largest and most flexible AI model, able to run on data centers and mobile devices.”

Transcription using AI

The transcription capabilities of AI platforms are exceptional, offering solutions for converting audio, video, and handwritten documents into text. This is particularly beneficial for genealogists who have previously struggled with manual transcription of older documents.

Despite its advanced capabilities, AI-generated transcriptions are not always flawless. The accuracy can be influenced by the quality of the training data and examples, especially when dealing with older handwriting styles or languages such as English "secretary hand" or Latin used in German records. It is essential to review AI-generated transcriptions to ensure their accuracy before incorporating them into genealogical research.

Case study: Translation of death notice of Robert Austin

Gemini transcription of Death of Robert Austin newspaper article dated 5 September 1924, The Lowville Journal-Republican, Lowville, New York
Gemini transcription of Death of Robert Austin newspaper article dated 5 September 1924, The Lowville Journal-Republican, Lowville, New York

Consider the transcription of a death notice for Robert Austin, published on September 5, 1924, in The Lowville Journal-Republican, Lowville, New York. Robert Austin, the brother of John Ralph Austin (1896-1976), tragically drowned while attempting to rescue a child at Long Beach, New York. The newspaper article detailing this incident was digitized and uploaded to Gemini with the prompt "Transcribe exactly."

This case illustrates the practical application of AI in genealogy, highlighting how AI platforms like Gemini can accurately transcribe historical documents, thereby preserving valuable genealogical information.

Here is the transcribed text:

DROWNED AT LONG BEACH.

Robert Austin, Formerly of Lowville,

Loses Life on Labor Day

While Bathing.

Robert Austin, 30, son of Mr. and

Mrs. William D. Austin, formerly of

Gemini transcription of Death of Robert Austin newspaper article dated 5 September 1924, The Lowville Journal-Republican, Lowville, New York
Gemini transcription of Death of Robert Austin newspaper article dated 5 September 1924, The Lowville Journal-Republican, Lowville, New York

Lowville, was drowned at Long Beach,

1.. 1.. Monday, but no particulars of

the accident are at hand. The Austin

family removed from this village to

New York several years ago. Robert

Austin was born here October 13.

1994. At the time of his death he held

the responsible position of office

manager for the D. W. Griffith Motion

Picture Corporation. being in charge

of the New York office. He was mar-

ried about two years ago. Besides his

wife and parents he leaves four broth-

ers, Maurice, Harry, Ralph and Law-

ton, all of New York city.

Source: Uploaded article “Drowned at Long Beach” with the query “Transcribe exactly.” Gemini, Gemini Advanced version, Google, 8 Feb 2024, https://gemini.google.com/.

Translation using AI

The capability of AI to translate documents and images containing text from one language to another is a remarkable and invaluable tool for genealogists. However, this technology is not without its limitations.

Before translating a document, it is crucial to review its origin and determine its creation date. For instance, a Dutch language document from the 17th century may not yield as accurate results as a more recent document in the same language.

For legal documents or those requiring high accuracy, it is advisable to employ the services of a certified translator. Conduct interviews with the translator or the translation service to ensure they have the expertise to handle older documents, particularly those from the 17th century.

Case Study: Translation of a page from Le troisième centenaire de l'Édit de Nantes en Amérique et en France (1889)

Le troisième centenaire de l'Édit de Nantes en Amérique et en France (1889) page 3
Le troisième centenaire de l'Édit de Nantes en Amérique et en France (1889) page 3

Consider the translation of a page from "Le troisième centenaire de l'Édit de Nantes en Amérique et en France" (1889). In this instance, ChatGPT was tasked with translating an uploaded image of page 3 from the book. The prompt used was "translate to English."

The resulting translation is as follows:

On Thursday, April 14, the second and last session took place at the Church of the Holy Spirit. Our kind English colleague, Mr. A. Giraud Browning, spoke to us about the hospital for poor Huguenot refugees in London, and about those who founded it. — A society that also deals a lot with the origins of the American people is the Holland Society. Among other things, we owe it the publication of the oldest register of baptisms and marriages of the Church of New Paltz, where we find many French names associated with the first refugees who were Walloons or Dutch. This society is represented by its secretary Mr. Theo. M. Banta who reads a few pages from Mr. G. Wildeman on the Walloon Church of Haarlem. — Reverend Vedder, from the French Church of Charleston in South Carolina, replaces Mr. T. W. Bacot who had been delegated by her. With great wit and good grace, Mr. Vedder speaks to us about the current descendants of the many Huguenots who once settled in Charleston. — Those from Virginia, equally numerous and often influential today, are represented by Colonel Richard L. Maury. — Finally, after a few remarks from Mr. A. T. Clearwater and J. C. Pumpelly on the influence of the Huguenots in New York and in the state of New Jersey, Mr. George T. Davis tells us the history of New Rochelle, the charming summer city founded in 1688 by the Huguenots of New York.

chatgpt 4O translate Le troisième centenaire de l'Édit de Nantes en Amérique et en France p3 20240717
chatgpt 4O translate Le troisième centenaire de l'Édit de Nantes en Amérique et en France p3 20240717

The Banquet. On Thursday evening, the official ceremonies ended, as they do every year, with a magnificent banquet served in the most beautiful room of the best restaurant in New York, Delmonico's, at the corner of 5th Avenue and 14th Street. The room, brilliantly illuminated, is adorned with Louis XV-style moldings. The wall behind the head table (covered with a flowered tablecloth) is decorated with American, English, French, and Italian flags, among others, in memory of the France of yesteryear. A beautiful white silk banner dotted with golden lilies is prominently displayed. The entrance is ceremonially made, in procession, with the new president offering his arm to the French delegate and taking his place at the center of the head table reserved for the Committee, foreign delegates, and various others.

Source: “Translate to English” prompt using digital image of Le troisième centenaire de l'Édit de Nantes en Amérique et en France, page 3, published 1989. ChatGPT, ChatGPT 40 version, OpenAI, 17 July 2024, https://chat.openai.com/

Improving accuracy

Accuracy is dependent upon a variety of factors:

  • Document/Image Quality: Ensure high-quality images or documents. If they are faded or damaged, use a photo editor to enhance clarity.
  • Document/Image Type: Not all platforms support PDF formats. Utilize JPG, PNG, or other widely accepted formats for better compatibility.
  • Document Length/Number of Images: For large documents or numerous images, process them in smaller batches to maintain accuracy and performance.
  • Print vs. Handwriting: Printed text is processed more quickly and accurately than handwriting. Handwritten documents, particularly those with complex forms (e.g., US World War I Draft Registration Cards), often pose greater challenges due to format and legibility issues.
  • Platform Variability: Different AI platforms vary in performance. Experiment with multiple platforms to find the one that delivers the best accuracy for your needs.

AI and Source Citations

Genealogy and family history research emphasize the importance of source citations for verifying relationships and facts about ancestors. Source citations document the origin and usage of records like census schedules, death certificates, and personal correspondence.

Creating source citations need not be daunting. Focus on these elements:

  • The information found.
  • The method of discovery.
  • The location of the information.
  • Locator data for other researchers.

For artificial intelligence content, here’s the formula to consider using as proposed by the Modern Language Association of America (MLA):

“[QUERY]” prompt. [NAME OF AI PLATFORM], [DATE OR VERSION OF PLATFORM], [NAME OF AI COMPANY], [DATE OF QUERY], [PLATFORM URL]

So, if ChatGPT were prompted to translate page 3 from the book Le troisième centenaire de l'Édit de Nantes en Amérique et en France, here is a source citation to use:

“Translate to English” prompt using digital image of Le troisième centenaire de l'Édit de Nantes en Amérique et en France, page 3, published 1989. ChatGPT, ChatGPT 3.5 version, OpenAI, 12 March 2024, https://chat.openai.com/.

Tips and tricks

  • Clear and Concise Instructions: Provide explicit prompts to achieve precise results. For instance, use “transcribe exactly” instead of “transcribe,” and “translate to English” rather than simply “translate.”
  • Feedback: Utilize feedback mechanisms provided by AI platforms. Feedback is crucial for improving future accuracy.
  • Task Breakdown: Divide complex tasks into simpler steps. For example, separate transcription and translation tasks to enhance accuracy.
  • Platform Recommendations: Be cautious of AI platforms suggested by search engines. Some may claim superior transcription or translation capabilities compared to established platforms.
  • Smaller Segments: When handling documents or images with handwritten text, upload single pages or smaller sections to boost accuracy.
  • Transcription of Audio and Video: Leverage AI to transcribe video or audio files. This can be especially useful for creating text records from multimedia sources.
  • Specific Prompts: Craft specific prompts such as “translate to English” when uploading documents or images, rather than generic instructions like “translate.”
  • Review Translated Text: Ensure the translated text aligns in length and context with the original. Some AI platforms may produce excessively long translations that are not practical.
  • Contextual Accuracy: While AI platforms generally translate text effectively, ensure the context is preserved, as many results may be literal rather than contextual.

Following these guidelines will significantly enhance the accuracy and reliability of your document processing and AI-assisted research endeavors.

Explore more about using AI to translate and transcribe genealogy documents

Retrieved from ""