cross-posted from: https://lemmy.dbzer0.com/post/27579423
This is my first try at creating a map of lemmy. I based it on the overlap of commentors that visited certain communities.
I only used communities that were on the top 35 active instances for the past month and limited the comments to go back to a maximum of August 1 2024 (sometimes shorter if I got an invalid response.)
I scaled it so it was based on percentage of comments made by a commentor in that community.
Here is the code for the crawler and data that was used to make the map:
I pretty much only browse /all , so I’m throwing the numbers off! I don’t know myself with which communities i interact most.
Yeah I’ve noticed there aren’t many clusters that encode specific ideas (there are a few like the anime, nsfw, or sometimes instance level clusters). Most of it just seems to be a blend. Sorta disappointing.
Are they clustered based on shared userbase?
Yeah pretty much. There is also a weighting based on the percentage of comments in that community that come from that user.
Can anyone ELI5 what the axes mean?
Nothing. There were far more dimensions in the original data and the author asked the computer to squash that down into two axes in whatever way preserved groupings
One is labelled Y.
I’m assuming the other is X… but might be Z if they’re fun
This is cool, keep adding more features. Not sure if my comment wishing this existed inspired you but nice to see a proof of concept!
Actually it did so thx for that.