Skip to content

Lexibank is a curated collection of lexical datasets.

The Lexibank infrastructure consists of

  • a dataset specification based on CLDF Wordlists
  • a registry of published Lexibank datasets on Zenodo
  • pylexibank - a cldfbench plugin to curate lexibank datasets

Candidates for new lexibank datasets are triaged as issues at https://github.com/lexibank/.github/issues

Pinned Loading

  1. pylexibank pylexibank Public

    The python curation library for lexibank

    Python 21 7

Repositories

Showing 10 of 188 repositories
  • tlopo Public

    The lexicon of Proto Oceanic

    lexibank/tlopo’s past year of commit activity
    TeX 0 CC-BY-4.0 0 3 0 Updated Mar 3, 2026
  • lexibank/amazonianvoices’s past year of commit activity
    Python 0 0 0 0 Updated Feb 27, 2026
  • khalidasur Public

    CLDF dataset derived from Khalid's "Grammatical Sketch of Asur" from 2020

    lexibank/khalidasur’s past year of commit activity
    Python 0 CC-BY-4.0 0 0 0 Updated Feb 25, 2026
  • chacontukanoan Public

    CLDF dataset derived from Chacon's "Revised Proposal of Proto-Tukanoan consonants" from 2014

    lexibank/chacontukanoan’s past year of commit activity
    Python 1 CC-BY-4.0 0 2 0 Updated Feb 17, 2026
  • pylexibank Public

    The python curation library for lexibank

    lexibank/pylexibank’s past year of commit activity
    Python 21 Apache-2.0 7 4 0 Updated Feb 12, 2026
  • uralex Public

    UraLex basic vocabulary dataset

    lexibank/uralex’s past year of commit activity
    TeX 4 CC-BY-4.0 5 5 0 Updated Feb 10, 2026
  • ideobank Public
    lexibank/ideobank’s past year of commit activity
    TeX 0 CC-BY-4.0 0 1 0 Updated Feb 10, 2026
  • walkerarawakan Public

    Walker and Ribeiro (2011) Arawakan dataset

    lexibank/walkerarawakan’s past year of commit activity
    Python 0 CC-BY-4.0 0 0 0 Updated Jan 6, 2026
  • northperulex Public

    CLDF dataset derived from Ugarte et al.'s "NorthPeruLex - A Lexical Dataset of Small Language Families and Isolates from Northern Peru (forthcoming)

    lexibank/northperulex’s past year of commit activity
    Python 0 CC-BY-4.0 0 3 0 Updated Dec 30, 2025
  • lundgrenomagoa Public

    CLDF dataset derived from Lundgren's "Phonological Reconstruction of Proto-Omagua–Kokama–Tupinambá" from 2020

    lexibank/lundgrenomagoa’s past year of commit activity
    Python 0 CC-BY-4.0 0 1 0 Updated Dec 22, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…