Hatebase catalogues the world’s hate speech in real time so you don’t have to

INSUBCONTINENT EXCLUSIVE:
Policing hate speech is something nearly every online communication platform struggles with
Because to police it, you must detect it; and to detect it, you must understand it
increasingly valuable one.Essentially Hatebase analyzes language use on the web, structures and contextualizes the resulting data, and sells
a small but growing operation, emerged out of research at the Sentinel Project into predicting and preventing atrocities based on analyzing
What surprised us was that a lot of other NGOs [non-governmental organizations] started using our data for the same purpose
Then we started getting a lot of commercial entities using our data
that they know of
most mutable part of any language
voluminous, but it is ever-shifting
So the task of cataloguing it is a continuous one.Hatebase uses a combination of human and automated processes to scrape the public web for
uses of hate-related terms
False means no, of course
group and is attempting to reclaim it or rebuke others who use it
Those are the values that go out via the API, and users can choose to look up more information or context in the larger database, including
location, frequency, level of offensiveness, and so on
With that kind of data you can understand global trends, correlate activity with other events, or simply keep abreast of the fast-moving
bias floating in
human intelligence, in the form of a corps of volunteers and partners who authenticate, adjudicate, and aggregate the more ambiguous data
Quinn said
a word
He gave the example of a word in Nigeria, which when used between members of one group means friend, but when used by that group to refer to
someone else means uneducated
or phrases that are not offensive on their own but serve to indicate whether someone is emphasizing the slur or phrase
Other factors enter into it too, some of which a natural language engine may not be able to recognize because it has so little data
concerning them
So in addition to keeping definitions up to date, the team is also constantly working on improving the parameters used to categorize speech
Hatebrain encounters.Building a better database for science and profitThe system just ingested its millionth hate speech sighting (out of
perhaps tens times that many phrases evaluated), which sounds simultaneously like a lot and a little
A vetted, million-data-point set of words and phrases classified as hate speech or not hate speech is a valuable commodity all on its own
larger organizations looking to outsource hate speech detection for moderation purposes pay a license fee, which keeps the lights on and
That kind of threat really loosens the purse strings; If a fine could be in the tens of millions of dollars, paying a significant fraction
speech, their own AI
model, we call ourselves hate speech as a service
Relying on the kindness of rich strangers is no way to stay in business, after all
make sure the jobs that need doing have someone to do them.In the meantime it seems clear to Quinn and everyone else that this kind of
We always grapple with it, you know, in terms of, well, what role does hate speech play? What role does misinformation play? What role do
immigrants in Germany over, I want to say, 2015 to 2017
They graph it out
And its peak for peak, you know, valid for Valley
those kinds of those kinds of analyses