Maru 2022 WSD
Dataset Reevaluation

Maru 2022 WSDDataset Review

A lexicographer review of the Maru 2022 Word Sense Disambiguation dataset, reevaluating sense assignments against WordNet 3.0 across four benchmark corpora: SemEval-2013, SemEval-2015, Senseval-2, and Senseval-3.

SemEval-2013SemEval-2015Senseval-2Senseval-3

Review Progress

0/37
Pages complete
0/363
Items reviewed
0
"Neither" verdicts
0
Comments left
Overall progress0%
Ring Distribution
S1: 55
S2: 138
S3: 41
S4: 75
S5: 48
S6: 6

Progress is saved automatically in this browser. Use "Export verdicts" to download your annotations.

Browse Review Pages
S1

Ring S1

55 instances

High-frequency, unambiguous instances — clearest sense assignments

S2

Ring S2

138 instances

Moderately frequent instances with some sense ambiguity

S3

Ring S3

41 instances

Less frequent instances requiring careful disambiguation

S4

Ring S4

75 instances

Infrequent senses with higher inter-annotator disagreement

S5

Ring S5

48 instances

Rare or specialized senses, often domain-specific

S6

Ring S6

6 instances

Extremely rare senses — edge cases for WSD evaluation

About the Dataset

The Maru 2022 WSD dataset is a benchmark for Word Sense Disambiguation, providing gold-standard sense annotations from WordNet 3.0 across multiple evaluation corpora.

Review Methodology

Each instance is reviewed by a lexicographer who selects applicable senses, flags "neither of the above" cases, and adds free-form comments. Progress is saved locally in the browser.

Frequency Rings

Instances are grouped into six frequency rings (S1–S6), from the most common and unambiguous (S1) to the rarest and most challenging (S6) sense assignments.