Attributions | Elephant

Elephant builds on open data and open-source work. This page credits the sources we depend on.

Korean Learners Dictionary (한국어기초사전)

The vocabulary data in Elephant — including word entries, definitions, bilingual translations, and thematic and semantic categories — is derived from the Korean Learners Dictionary (한국어기초사전), published by the National Institute of Korean Language (국립국어원, NIKL).

This data is used under the Creative Commons Attribution-ShareAlike 2.0 Korea (CC BY-SA 2.0 KR) license.

The original XML data was processed and imported into a relational database schema. The processing scripts are available at github.com/ketzu/krdict-to-sqlite.

What is covered

Word entries: Korean headwords, part of speech, level, pronunciation, conjugations
Definitions: Korean-language definitions and annotations
Bilingual translations: lemmas and definitions in English, Spanish, Arabic, Chinese, French, Japanese, Mongolian, Russian, Thai, Vietnamese, and Indonesian
Thematic categories (e.g. Animals, Food, Daily Life)
Semantic categories

What is not covered

Audio pronunciation files in Elephant are generated independently via Murf.ai and are not derived from NIKL audio assets.

Tatoeba

Some example sentences in Elephant are sourced from Tatoeba, a collaborative sentence corpus created by its contributor community.

This sentence data is used under the Creative Commons Attribution 2.0 France (CC BY 2.0 FR) license.

Original sentence authors are credited by Tatoeba and retain copyright over their contributions.

FSRS Spaced Repetition Algorithm

Vocabulary scheduling uses the FSRS (Free Spaced Repetition Scheduler) algorithm, developed by the open-spaced-repetition community and widely used in the Anki ecosystem. FSRS is open source and freely available.