Do you want help finding doublets of profiles hidden in your Geni.com tree?

Started by Kenneth Ekman on Saturday, May 12, 2018
Problem with this page?

Participants:

Showing all 15 posts
5/12/2018 at 5:02 AM

Hi,

Geni.com itself searches for doublet profiles, but lately I have been disappointed since most proposed doublets are not really from the same person, but just matches with common names, and unclear birth and death dates.

Thus I have made a piece of software (FamilyStudio2), that I can use to find doublets in the geni.com tree.

I have improved my own tree by merging tens (probably over a hundred) of profiles in the last few weeks by using this tool to search for doublets.

The tool does NOT do any merging at all, it just searches for people with similar names AND birth and death dates.

Below is a sample list made yesterday (except the list includes links to the profiles too):

Name1 (Birth - Death) Name2 (Birth - Death)
Maria Olsdotter (1786 - 1786)
Maria Olsdotter (1786 - 1786)

Anna Elise Thorsell Svensson (1915-04-04 - 1915-04-04)
Anna Elise Thorsell Svensson (1915-04-04 - 1915-04-04)

Anna Elise Thorsell Svensson (1915-04-04 - 1915-04-04)
Anna Elise Thorsell Svensson (1915-04-04 - 1915-04-04)

Viktor Emanuel Linde (1900-03-09 - 1900-03-09)
Viktor Emanuel Linde (1900-03-09 - 1900-03-09)

Emanuel Stefan Söderberg (1876-02-29 - 1876-02-29)
Emanuel Stefan Söderberg (1876-02-29 - 1876-02-29)

Sigrid Kristina Jonsdotter (1837-10-24 - 1837-10-24)
Sigrid Kristina Jonsdotter (1837-10-24 - 1837-10-24)

Anders Sellén (1846-09-18 - 1846-09-18)
Anders Sellén (1846-09-18 - 1846-09-18)

Olof Olofsson Lidgren (1801-07-27 - 1801-07-27)
Olof Olofsson Lidgren (1801-07-27 - 1801-07-27)

Pehr Pehrsson (1738-12-31 - 1738-12-31)
Pehr Pehrsson (1739 - 1739)

Ingrid Abrahamsdotter (1748-10-07 - 1748-10-07)
Ingrid Abrahamsdotter (1748-10-07 - 1748-10-07)

Margareta Johanna Wikström (1815-11-20 - 1815-11-20)
Margareta Johanna Wikström (1815-11-20 - 1815-11-20)

Nils Erik Lind (1817-09-09 - 1817-09-09)
Nils Erik Lind (1817-09 - 1817-09)

Per Vilhelm Lindblom (1879-10-25 - 1879-10-25)
Per Vilhelm Lindblom (1879-10-25 - 1879-10-25)

Jonas Pettersson Rudgren (1793-01-24 - 1793-01-24)
Jonas Pettersson Rudgren (1793-01-24 - 1793-01-24)

Magnus Georgsson Schröder (1737-10-27 - 1737-10-27)
Magnus Georgsson Schröder (1737-10-27 - 1737-10-27)

Jordan Simonsson (1735 - 1735)
Jordan Simonsson (1735 - 1735)

If you want me to search your tree, write a note here or send me a private message, and I can make a check of your tree.

Note that I will not make any changes, yous create a list that you can use to merge profiles in cases when they are really matching.

This will be on a first-come first served basis. The analysis takes a couple of hours so I will only be able to handle a few per day, since I don't have more than one computer right now.

The service is completely free.

5/12/2018 at 5:38 AM

Disclaimer: The list I send is also likely to include false matches.
Please make sure every match is an actual duplicate before attempting a merger.
You should NEVER merge two profiles unless you are sure that they are the same person!

Just thought I should mention that...

5/12/2018 at 11:06 AM

I tested it on my tree and got 38 matches. About 60% of them were true matches. None of them were marked as blue mathces by Geni.

I suggest that the Geni development team should consult Kenneth on how to implement his great matching algorithm in the Geni software.

5/12/2018 at 11:46 AM

Thanks Magnus, though the algorithm is far from perfect. I have lots of ideas for improvements that might be tested in the future..

5/14/2018 at 4:22 AM

Why not link to the software that you have developed - if people wants to try it themselves?

See
* https://www.geni.com/projects/FamilyStudio2-Open-source-genealogy-s...
* https://github.com/endian02/FamilyStudio2/wiki
* https://www.facebook.com/groups/876501109184344/

6/6/2018 at 2:46 AM

This offer has now expired, but now you can use the above mentioned software to do it yourself. Just do a Completeness check, and make sure to enable the Duplicates checkbox.

6/7/2018 at 1:56 AM

I have sumarized the features of the tool, as I understand them, in point 10 at https://www.geni.com/projects/SmartCopy-Best-Practices/47044 . And also at https://www.geni.com/projects/Genealogy-Software-overview/18333 . And I removed the offer from these pages.

6/14/2018 at 2:05 AM

Thanks, Magnus.

My experience is however not that the sanity check is a "final check that everything is ok", but that it is most useful as an interactive tool to improve the tree quality. I have been using the latest features for a couple of months now and I (unfortunately) don't see a point where I will feel like im done updating the tree just by running the sanity check (I prefer sanity check before "insanity check") a couple of times per week.

12/7/2018 at 4:10 AM

I am now beta-testing a website based on the same code at http://improveyourtree.com . You are welcome to try it if you want!

1/12/2020 at 6:37 AM

The source code is now open source at https://github.com/endian02/FamilyTreeWebApp

Private User
1/16/2022 at 9:58 PM

Hi Kenneth. improveyourtree.com should be an https adddress by now. http is no longer the standard. Tried to login, but it throws back an error..

An error occurred while processing your request.
Request ID: 00-bec2839cbf562c649f1deff9f44896f0-c35ab3170a8acc5d-00

Is improveyourtree still active?

1/17/2022 at 3:44 AM

Yes, it's supposed to work ok, last I tried it. You can also use the corresponding https address.

1/17/2022 at 3:45 AM

But, yes I now also get an error! I'll look at it asap. Earliest tonight, though!

1/17/2022 at 9:23 AM

I restarted the app, and now it works for me again. Private User please let me know how it looks for you!

Showing all 15 posts

Create a free account or login to participate in this discussion