you are viewing a single comment's thread.

view the rest of the comments →

[–]jjdub7Gay Male Guest Commentator 8 insightful - 1 fun8 insightful - 0 fun9 insightful - 1 fun -  (2 children)

I mean, I've made a mySQL database that I use to track corporations, individuals, and news articles too.

The difference is that I use it to tell family where not to spend their money in a quarterly email, and I only track the online presence of individuals, as Twitter/Reddit handles make for a good individual unique identifier.

The reason I started tracking individuals in the first place has to do with what u/pancakewaffle mentions below - that there are collectivist lunatics in Big Tech who are itching to scour through and weaponize their employers' data against their political enemies, particularly in ways that even their oft-woke employers would fire them on the spot if detected.

Combine this with the fact that real estate titles are public records, and there are several functional projects already on Github that establish scraper APIs for a staggering number of US counties' public records (where the title deeds are most often recorded and held), unifying them into a single merged data source.

Its terrifying that between the two, such activist types essentially have the power to dox and harass people at scale, and with little interference from law enforcement, particularly if they use the Tor network to access to web or send encrypted communications.

[–]pancakewaffle 7 insightful - 1 fun7 insightful - 0 fun8 insightful - 1 fun -  (1 child)

This is both high level in description and low level in implementation so I won't dwell on it long but, crawl for author and user ids (probably in base), target those ids in intervals per api request limits, run them through a natural language set and train them. That way you can compare probabilities of a user against a suspected alt based on writing habits. It's also a bit overkill haha.

[–]jjdub7Gay Male Guest Commentator 2 insightful - 1 fun2 insightful - 0 fun3 insightful - 1 fun -  (0 children)

Oooh, interesting. Like a stylization attack, or whatever its called, as a user's fingerprint. Always on the hunt for new pet projects to expand on what I have, simply for skills development purposes - definitely adding a deep dive on this to my backlog.