Data is Beautiful

2756 readers

43 users here now

Be respectful

founded 1 year ago

MODERATORS

KnowledgeIsPower@mander.xyz

WanderingPhoenix@mander.xyz

151

Mentions of the word "fascism" and its derivatives in Pravda, the main Soviet newspaper, from 1938 to 1942 (media.piefed.social)

submitted 1 week ago by misk@piefed.social to c/dataisbeautiful@mander.xyz

15 comments fedilink hide all child comments

via https://www.reddit.com/r/europe/comments/13chr5f/mentions_of_the_word_fascism_and_its_derivatives/

Percentage of pages mentioning "fascism" and its derivative words, from January 1938 until December 1942. Darkest blue is front page, the lightest blue is 6+ pages. Letters at the bottom are months.

Source, based on data from the Pravda Digital Archive.

you are viewing a single comment's thread
view the rest of the comments

[–] misk@sopuli.xyz 2 points 1 week ago* (last edited 1 week ago)

LLMs can’t deal with highly reflective languages at the moment. In English or Chinese you can assign tokens to entire words without having to account for word morphology (which is also why models fail at counting letters in words) but it falls apart quickly in Polish or Russian. The way models like ChatGPT work now is that they do their „reasoning” in English first and translate back to the query language at the end.