Attributions
Elephant builds on open data and open-source work. This page credits the sources we depend on.
Korean Learners Dictionary (한국어기초사전)
The vocabulary data in Elephant — including word entries, definitions, bilingual translations, and thematic and semantic categories — is derived from the Korean Learners Dictionary (한국어기초사전), published by the National Institute of Korean Language (국립국어원, NIKL).
This data is used under the Creative Commons Attribution-ShareAlike 2.0 Korea (CC BY-SA 2.0 KR) license.
The original XML data was processed and imported into a relational database schema. The processing scripts are available at github.com/ketzu/krdict-to-sqlite.
What is covered
- Word entries: Korean headwords, part of speech, level, pronunciation, conjugations
- Definitions: Korean-language definitions and annotations
- Bilingual translations: lemmas and definitions in English, Spanish, Arabic, Chinese, French, Japanese, Mongolian, Russian, Thai, Vietnamese, and Indonesian
- Thematic categories (e.g. Animals, Food, Daily Life)
- Semantic categories
What is not covered
Audio pronunciation files in Elephant are generated independently via Murf.ai and are not derived from NIKL audio assets.
FSRS Spaced Repetition Algorithm
Vocabulary scheduling uses the FSRS (Free Spaced Repetition Scheduler) algorithm, developed by the open-spaced-repetition community and widely used in the Anki ecosystem. FSRS is open source and freely available.