docs: update README and CONTRIBUTING for dataset_id-based API and new tasks by Mattdl · Pull Request #44 · techwolf-ai/workrb

Mattdl · 2026-03-04T11:28:37Z

Summary

README.md: Rewrite the metrics & aggregation section to document the new dataset → language → task → task_group → task_type aggregation chain, LanguageAggregationMode (4 modes), ExecutionMode (lazy/all), available ranking/classification metrics, and evaluate_multiple_models(). Move the tasks & models tables above the Usage Guide for better discoverability and add class names. Fix the checkpointing code example (wrong variable name, formatting).
CONTRIBUTING.md: Update the "Adding a New Task" guide to use the new load_dataset(dataset_id, split) signature (replacing load_monolingual_data), document cross-lingual/multi-dataset task overrides (languages_to_dataset_ids, get_dataset_languages), update test examples to use task.datasets[dataset_id] instead of task.lang_datasets[Language.EN], add sections on CI/CD workflows, conventional commit format, and keeping forks up-to-date with upstream.
Examples: Replace MELO/MELS TODO comments with actual task usage (MELORanking, MELSRanking) in all four benchmark example scripts.

Closes #43

Introduce the MELO (Multilingual Entity Linking of Occupations) and MELS (Multilingual Entity Linking of Skills) benchmarks as new ranking tasks. MELO provides 42 evaluation datasets spanning 21 languages for job title normalization into ESCO, built from crosswalks between national occupation taxonomies and ESCO published by official EU labor organizations. MELS follows the same methodology but targets skill normalization, covering 5 languages with 8 datasets. - Add MELORanking task class with 42 datasets across 21 languages - Add MELSRanking task class with 8 datasets across 5 languages - Implement get_dataset_languages() for both tasks, supporting monolingual and cross-lingual dataset variants - Add Austria and Belgium datasets to MELO (6 additional dataset IDs) - Add unit tests for dataset ID filtering and language mapping - Update README and example scripts with new tasks

…erview, metrics overview, and example of running multiple models

…nd commit formatting

federetyk and others added 11 commits February 26, 2026 18:04

chore: exclude CLAUDE.md from git

fe1f18a

docs: doc updates to README and CONTRIBUTING

abbcc03

docs: contributing guideline for keeping main up-to-date

81bbd0d

docs: README and CONTRIBUTING updates

57b7c32

docs: README update for all contributions, including task & models ov…

b4582fb

…erview, metrics overview, and example of running multiple models

docs: CONTRIBUTING.md update on CICD, examples cross-lingual tasks, a…

e7196a0

…nd commit formatting

docs: README header order switch fix

d5c4c49

chore: gitignore exclude .claude

7dbf9db

chore: combine all 4 aggregation examples in a single example

3375d0d

docs: example for running all tasks and models

b084851

Mattdl merged commit 1064cbb into main Mar 4, 2026
2 checks passed

Mattdl deleted the mdl-docs-refactor branch March 4, 2026 16:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: update README and CONTRIBUTING for dataset_id-based API and new tasks#44

docs: update README and CONTRIBUTING for dataset_id-based API and new tasks#44
Mattdl merged 11 commits intomainfrom
mdl-docs-refactor

Mattdl commented Mar 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Mattdl commented Mar 4, 2026

Summary

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants