Dataset Reevaluation

Maru 2022 WSD
Dataset Review

A lexicographer review of the Maru 2022 Word Sense Disambiguation dataset, reevaluating sense assignments against WordNet 3.0 across four benchmark corpora: SemEval-2013, SemEval-2015, Senseval-2, and Senseval-3.

SemEval-2013SemEval-2015Senseval-2Senseval-3

Review Progress

0/37

Pages complete

0/363

Items reviewed

"Neither" verdicts

Comments left

Overall progress0%

Ring Distribution

S1: 55

S2: 138

S3: 41

S4: 75

S5: 48

S6: 6

Progress is saved automatically in this browser. Use "Export verdicts" to download your annotations.

Browse Review Pages

Ring S1

55 instances

High-frequency, unambiguous instances — clearest sense assignments

10 items

treasury, acceptance, issue, rule, discovery…

Open

10 items

discontinue, abnormal, useful, useful, new…

Open

10 items

solemn, draw, small, believe, same…

Open

10 items

mute, involvement, involvement, thing, believe…

Open

10 items

discourage, competitive, proportion, state, state…

Open

S1S2

10 items

realize, drink, symptom, quiet, bundle…

Ring transition

Open

Ring S2

138 instances

Moderately frequent instances with some sense ambiguity

10 items

policy, loss, field, field, u.s.…

Open

10 items

question, policy, law, rule, principle…

Open

10 items

capability, study, distinction, conservative, administration…

Open

10 items

input, input, know, function, now…

Open

10 items

study, industry, basis, overall, product…

Open

10 items

call, remain, continental, stand, sound…

Open

10 items

life, speak, call, hold, growth…

Open

10 items

u.s., picture, development, same, growth…

Open

10 items

authoritarian, yet, child, education, child…

Open

10 items

bright, cold, have, cost, address…

Open

10 items

vote, public, public, cost, benefit…

Open

10 items

benefit, attendance, rate, devote, attendance…

Open

10 items

difference, positive, function, severe, difference…

Open

S2S3

10 items

explode, do, bounce, u.s., cycle…

Ring transition

Open

Ring S3

41 instances

Less frequent instances requiring careful disambiguation

10 items

consider, say, say, say, say…

Open

10 items

say, believe, need, loss, say…

Open

10 items

wish, really, large, fact, ready…

Open

S3S4

10 items

do, wild, refuge, thing, negotiation…

Ring transition

Open

Ring S4

75 instances

Infrequent senses with higher inter-annotator disagreement

10 items

time, area, number, aspect, tone…

Open

10 items

know, have, specify, country, organisation…

Open

10 items

high, always, initiate, simply, make…

Open

10 items

first, loss, begin, sign, development…

Open

10 items

dominate, inherent, call, think, reform…

Open

10 items

come, grievance, realize, encourage, bright…

Open

S4S5

10 items

factor, advantage, factor, factor, comfort…

Ring transition

Open

Ring S5

48 instances

Rare or specialized senses, often domain-specific

10 items

family, argument, life, country, country…

Open

10 items

effect, best, function, public, public…

Open

10 items

say, history, variation, call, come…

Open

10 items

loss, think, problem, thing, time…

Open

S5S6

10 items

regional, level, level, reveal, state…

Ring transition

Open

Ring S6

6 instances

Extremely rare senses — edge cases for WSD evaluation

3 items

fact, think_of, family

Open

About the Dataset

The Maru 2022 WSD dataset is a benchmark for Word Sense Disambiguation, providing gold-standard sense annotations from WordNet 3.0 across multiple evaluation corpora.

Review Methodology

Each instance is reviewed by a lexicographer who selects applicable senses, flags "neither of the above" cases, and adds free-form comments. Progress is saved locally in the browser.

Frequency Rings

Instances are grouped into six frequency rings (S1–S6), from the most common and unambiguous (S1) to the rarest and most challenging (S6) sense assignments.

Maru 2022 WSDDataset Review

Review Progress

Ring S1

Ring S2

Ring S3

Ring S4

Ring S5

Ring S6

About the Dataset

Review Methodology

Frequency Rings

Maru 2022 WSD
Dataset Review