One wrong ancestor in an online family tree gets copied into thousands of other trees, creating an unkillable misinformation cascade

devtools0 views
On Ancestry.com, when a user adds an ancestor to their tree, the platform suggests "hints" from other users' trees. If someone accepts a hint without verifying it, the error propagates. Then their tree becomes a hint source for the next person. Research has found single ancestors appearing in 46 different online family trees with wildly different birth dates, death dates, and parents — because each tree copied from a slightly different erroneous source. The most common version of this: someone attaches the wrong John Smith born in 1820 in Virginia to their tree, and within months, dozens of other trees have imported not just that wrong John Smith but his entire fabricated lineage going back generations. This matters because online family trees are not just hobby projects — they are the primary mechanism by which DNA matches are identified and relationships are determined. Ancestry's ThruLines feature uses public family trees to suggest how two DNA matches might be related. If the underlying trees contain errors, ThruLines will confidently suggest a wrong common ancestor, leading researchers down a false path that can waste months of effort. For adoptees using DNA to find biological family, a single tree error in a critical match's ancestry can send them to the wrong state, the wrong family, the wrong person — with real emotional consequences when they reach out to a stranger who turns out to be unrelated. The structural cause is that genealogy platforms have no quality control mechanism for user-submitted trees. There is no source-citation requirement for adding an ancestor. There is no flag when a tree contains logical impossibilities (a woman giving birth at age 5, a person dying before they were born). Ancestry's hint system actively incentivizes uncritical copying by making it easier to click "accept" than to verify. The platform benefits from larger, more connected trees because they drive engagement metrics and ThruLines suggestions, regardless of accuracy. The result is a massive, interconnected web of unverified genealogical claims where misinformation spreads faster than corrections.

Evidence

Analysis of how wrong information propagates: https://trentinogenealogy.com/2017/04/how-the-wrong-information-ends-up-in-your-family-tree/ | The 10 most common mistakes in online trees: https://www.thefhguide.com/blog/the-10-most-common-mistakes-in-online-family-trees/ | Copy-paste genealogy remedies: https://familytreewebinars.com/webinar/remedies-for-copy-paste-genealogy/ | Geneanet consistency checker for tree errors: https://en.geneanet.org/genealogyblog/post/2024/01/do-you-have-some-errors-in-your-family-tree-discover-our-consistency-checker-3

Comments