View on GitHub

openlexicon

Access to lexical databases

OpenLexicon: Easy access to lexical databases

At OpenLexicon, you will find:

  1. a directory of lexical databases
  2. scripts to query, manipulate or create lexical databases.
  3. apps to create dynamic graphical interfaces to the lexical databases. Some of the apps provide the infrastructure of http://www.lexique.org where many of lexical databases are hosted and can be queried on-line — check out, for example, http://www.lexique.org/shiny/openlexicon.
  4. documents of interest related to the processing of lexical resources.

Downloading a database to work locally (on your computer)

You can download datasets directly from the links provided in the README files. Then you would open them in Excel, in R with readr.read_csv or in Python with pandas.read_csv.

However, in R, there is a much better way: the R dataset fetcher.

It provides several advantages:

For example, to download the table of Lexique383:

    require(tidyverse)
    require(rjson)
    source('https://raw.githubusercontent.com/chrplr/openlexicon/master/datasets-info/fetch_datasets.R')
    lexique383 <- get_lexique383()

For maintainers

Contributing

Everybody is warmly encouraged to contribute, by adding new databases (To add a new dataset, you must create a .json file in datasets-info/_json. Check out How-to-install-a-new-dataset), an app, or a script.

First, check out Lexique’s google-group at https://groups.google.com/forum/#!forum/lexiqueorg. You can then contact christophe@pallier.org and boris.new@gmail.com privately, or, much better, fork the openlexicon repository (see How to fork a repo) and issue a pull request (see Creating a pull request from a fork).

TODO

Check out the TODO list

Crediting

Most databases have associated publications listed in their respective README files. They must be cited in any derivative work! The same goes for some of the scripts (read their documentation for instructions).

If needed, you can cite this repository as:

License

The files in this repository are distributed under a CC BY-SA 4.0 LICENSE (see https://creativecommons.org/licenses/by-sa/4.0/).


Time-stamp: <2019-10-05 09:48:32 christophe@pallier.org>