Maru 2022 WSD
Dataset Review
A lexicographer review of the Maru 2022 Word Sense Disambiguation dataset, reevaluating sense assignments against WordNet 3.0 across four benchmark corpora: SemEval-2013, SemEval-2015, Senseval-2, and Senseval-3.
Review Progress
Progress is saved automatically in this browser. Use "Export verdicts" to download your annotations.
Ring S1
55 instancesHigh-frequency, unambiguous instances — clearest sense assignments
Ring S2
138 instancesModerately frequent instances with some sense ambiguity
Ring S3
41 instancesLess frequent instances requiring careful disambiguation
Ring S4
75 instancesInfrequent senses with higher inter-annotator disagreement
Ring S5
48 instancesRare or specialized senses, often domain-specific
Ring S6
6 instancesExtremely rare senses — edge cases for WSD evaluation
About the Dataset
The Maru 2022 WSD dataset is a benchmark for Word Sense Disambiguation, providing gold-standard sense annotations from WordNet 3.0 across multiple evaluation corpora.
Review Methodology
Each instance is reviewed by a lexicographer who selects applicable senses, flags "neither of the above" cases, and adds free-form comments. Progress is saved locally in the browser.
Frequency Rings
Instances are grouped into six frequency rings (S1–S6), from the most common and unambiguous (S1) to the rarest and most challenging (S6) sense assignments.