Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 3 additions & 2 deletions concepticondata/conceptlists.tsv
Original file line number Diff line number Diff line change
Expand Up @@ -179,8 +179,8 @@ Lee-2011-211 Lee, S. and Hasegawa, Toshiharu 2011 211 basic English Japonic Le
Nagaraja-2013-200 Nagaraja, K. S. and Sidwell, Paul and Greenhill, Simon J. 2013 200 areal English Khasian Nagaraja2013 Nagaraja2013 This list was used for a lexicostatistical investigation of four Khasian languages. Judging from the concept labels, it was directly derived from [Gudschinsky's list of 200 items](:ref:Gudschinsky-1956-200) and is therefore not completely identical with [Swadesh's list of 200 items](:ref:Swadesh-1952-200). Furthermore, the compilers read the concept label «lie» as «lie, deceive», while most other concept lists read it as «lie, rest». 07-11
Dixon-1919-175 Dixon, Roland B. and Kroeber, Alfred L. 1919 173 specific English languages of California Dixon1919 Dixon1919 This is an early example of a concept list which was employed for the purpose of language comparison. Strictly speaking, the list itself contains much less conceptual items than 173, since the authors list items which are expressed by multiple cognate sets in the languages of California separately. In order to get these cognate sets, they used a list of 225 English words which is, unfortunately, not reported in the study. According to the authors, the list of cognate sets contains 175 items, but in the source, only 171 items are listed, with 2 further being labeled as secondary in the source. In order to keep close to the original, we decided to add their concept list in unmodified form to the concepticon. 56-61
Dyen-1964-196 Dyen, Isidore 1964 196 ranked English Malayo-Polynesian Dyen1964 Dyen1964 This list represents the original [list of 200 items by Swadesh](:ref:Swadesh-1952-200) which was ranked by comparing the amount of cognate pairs found in 89 Malayo-Polynesian languages, with items of low ranks representing items which have a high amount of cognate pairs when comparing all languages in a pairwise fashion. The author gives also a [ranked list for Indo-European languages](:ref:Dyen-1964-154). 244-246
Tryon-1983-200 Tryon, Darrel T. and Hackman, Brian D. 1983 200 areal English Melanesian Tryon1983 Tryon1983 This list is based on the two lists published in [Samarin (1967)](:bib:Samarin1967), and was used to carry out a lexicostatistic subgrouping of the languages of the Solomon islands. The compilers note that they replaced forty items which were found unsuitable for their comparison with alternative concepts. In the list, those concepts which, according to the compilers, stem from [Samarin's list of 100 items](:ref:Samarin-1967-100) are marked with one asterisk, those items which stem from [the list of 200 items](:ref:Samarin-1967-218) are marked with two asterisks, and the 40 additional items are left unmarked. This list is a subset of a larger [list of 324 items](:ref:Tryon-1983-324) that was collected and published along with the source and used for broader investigations, including the determination of regular sound correspondences. 28f
Tryon-1983-324 Tryon, Darrel T. and Hackman, Brian D. 1983 324 questionnaire English Melanesian Tryon1983 Tryon1983 This list was used to carry out comparative investigations of the languages of the Solomon islands. The compilers also provide a shorter [list of 200 items](:ref:Tryon-1983-200) which they used to carry out a lexicostatistic subgrouping of the languages. 115-118
Tryon-1983-200 Tryon, Darrell T. and Hackman, Brian D. 1983 200 areal English Melanesian Tryon1983 Tryon1983 This list is based on the two lists published in [Samarin (1967)](:bib:Samarin1967), and was used to carry out a lexicostatistic subgrouping of the languages of the Solomon islands. The compilers note that they replaced forty items which were found unsuitable for their comparison with alternative concepts. In the list, those concepts which, according to the compilers, stem from [Samarin's list of 100 items](:ref:Samarin-1967-100) are marked with one asterisk, those items which stem from [the list of 200 items](:ref:Samarin-1967-218) are marked with two asterisks, and the 40 additional items are left unmarked. This list is a subset of a larger [list of 324 items](:ref:Tryon-1983-324) that was collected and published along with the source and used for broader investigations, including the determination of regular sound correspondences. 28f
Tryon-1983-324 Tryon, Darrell T. and Hackman, Brian D. 1983 324 questionnaire English Melanesian Tryon1983 Tryon1983 This list was used to carry out comparative investigations of the languages of the Solomon islands. The compilers also provide a shorter [list of 200 items](:ref:Tryon-1983-200) which they used to carry out a lexicostatistic subgrouping of the languages. 115-118
Gabelentz-1861-24 Gabelentz, Hans Georg Conon von der 1861 24 historical German Melanesian languages Gabelentz1861 Gabelentz1861 This list is only a brief representative of the large collections of words from various languages which scholars carried out in the 18th and 19th century, starting with scholars like Peter Simon Pallas. 5-8
Miller-1994-281 Miller, John D. 1994 281 areal English Mon-Khmer Miller1994 Miller1994 In this list, the author tested the suitability of the [SIL list of 281 items](:ref:SIL-1980-281) to be applied to Mon-Khmer languages in Thailand. The author excludes several items from the original list and retains 203 words in the end. In our representation, these decisions are marked in the column STATUS, with *i* indicating retained elements, *e* excluded elements, and *p* problematic elements. There occurs also one *u*, but this is apparently a spelling error, since the author does not give any further description for that. 72-81
Thomas-1960-168 Thomas, David 1960 168 ranked English Mon-Khmer Thomas1960 Thomas1960 This list provides stability ranks for four Mon-Khmer languages and compares them with the ranks presented in the [215 item list](:ref:Swadesh-1955-215) from [Swadesh (1955)](:bib:Swadesh1955). The author does not provide all of the 215 items presented in the study by Swadesh, but only a subset of them. The 215 items themselves were first presented in [Swadesh (1950)](:ref:Swadesh-1950-215) where they formed the basis of all initial calculations of Swadesh's lexicostatistics and glottochronology. 8-10
Expand Down Expand Up @@ -465,6 +465,7 @@ Ramanujan-2019-158 Ramanujan, Keerthi and Weekes, Brendan S. 2019 158 naming te
Dunabeitia-2022-500 Duñabeitia, Jon Andoni and Baciero, Ana and Antoniou, Kyriakos and Antoniou, Mark and Ataman, Esra and Baus, Cristina and Ben-Shachar, Michal and Çağlar, Ozan Can and Chromý, Jan and Comesaña, Montserrat and Filip, Maroš and Filipović Đurđević, Dušica and Gillon Dowens, Margaret and Hatzidaki, Anna and Januška, Jiří and Jusoh, Zuraini and Kanj, Rama and Kim, Say Young and Kırkıcı, Bilal and Leminen, Alina and Lohndal, Terje and Yap, Ngee Thai and Renvall, Hanna and Rothman, Jason and Royle, Phaedra and Santesteban, Mikel and Sevilla, Yamila and Slioussar, Natalia and Vaughan-Evans, Awel and Wodniecka, Zofia and Wulff, Stefanie and Pliatsikas, Christos 2022 500 naming test, basic English American English, Australian English, Basque, British English, Catalan, Cypriot Greek, Czech, Finnish, French, German, Greek, Hebrew, Hungarian, Italian, Korean, Lebanese Arabic, Malay, Malaysian English, Mandarin Chinese, Belgium Dutch, Netherlands Dutch, Norwegian, Polish, Portuguese, Quebec French, Rioplatense Spanish, Russian, Serbian, Slovak, Spanish, Turkish, Welsh, Cantonese, Galician, Ukrainian global https://figshare.com/articles/dataset/Untitled_Item/19328939 Dunabeitia2022 This is a continuation of the [Dunabeitia et al. (2018)](:bib:Dunabeitia2018)dataset, now with 35 language varieties (v.8) (American English, Australian English, Basque, Belgium Dutch, British English, Catalan, Cypriot Greek, Czech, Finnish, French, German, Greek, Hebrew, Hungarian, Italian, Korean, Lebanese Arabic, Malay, Malaysian English, Mandarin Chinese, Netherlands Dutch, Norwegian, Polish, Portuguese, Quebec French, Rioplatense Spanish, Russian, Serbian, Slovak, Spanish, Turkish, Welsh, Cantonese, Galician and Ukrainian) in total and with familiarity ratings for 500 chosen picture stimuli of the original 750 (currently not available for all languages). Apart from Familiarity, we provide the following variables: Shannon Diversity Scores, Modal Response Percentage, 'I don't know' Response Percentage, Idiosyncratic Response Percentage.
DellaRosa-2010-417 Della Rosa, Pasquale A. and Catricà, Eleonora and Vigliocco, Gabriella and Cappa, Stefano F. 2010 417 ratings English Italian https://doi.org/10.3758/BRM.42.4.1042 DellaRosa2010 This list of 417 Italian nouns was rated concerning mode of acquisition, concreteness, imageability, familiarity, age of acquisition, context availability, and abstractness. Ratings were given by 250 Italian native speakers.
Ray-1889-78 Ray, Sidney H. 1889 78 English Oceanic languages https://zenodo.org/records/2403292 Ray1889 This list of 78 words was chosen to show the 'variation' in some Oceanic languages compared to Baki. 302-303
Tryon-1995-1310 Tryon, Darrell T. 1995 1310 questionnaire English Austronesian languages https://doi.org/10.1515/9783110884012 Tryon1995 This list comprises 1310 basic vocabulary items, modifying and extending the impactful [1110-items list by Buck (1949)](:ref:Buck-1949-1110) by 200 items that are particularly relevant to Austronesian peoples.
Huisman-2025-255 Huisman, John L. A. and McLean, Bonnie and Wu, Chieh-Hsi 2025 255 English Japonic https://doi.org/10.1093/jole/lzaf002 Huisman2025 This is a new lexical dataset, NichiRyuuLex, using an expanded concept list that consolidates the [Austronesian Basic Vocabulary Database list](:ref:Blust-2008-210), as used by [Lee and Hasegawa 2011](:ref:Lee-2011-211), the [100 item](:ref:Swadesh-1955-100) and [200 item](:ref:Swadesh-1952-200) Swadesh lists, and the Leipzig-Jakarta List [Leipzig-Jakarta List](:ref:Tadmor-2009-100). The combined list comprised 266 concepts once cleaned, but was further adapted to better fit the Japonic lexicon, resulting in a total of 256 concepts. Note that although the paper says 256 concepts there are only 255 in the list. s4-8
Schauenburg-2015-858 Schauenburg, Gesche and Ambrasat, Jens and Schröder, Tobias and Von Scheve, Christian and Conrad, Markus 2015 858 ratings German German https://doi.org/10.3758/s13428-014-0494-7 Schauenburg2015 This list contains ratings of valence, arousal, potency, authority and community for 858 German words. Participants were native German speakers recruited through university mailing lists. 720-735
Janschewitz-2008-460 Janschewitz, Kristin 2008 460 ratings English English https://doi.org/10.3758/BRM.40.4.1065 Janschewitz2008 This list of 460 English taboo, emotionally valenced, and emotionally neutral words was rated for frequency, inappropriateness, valence, arousal, and imageability by 78 native-English-speaking college students. 1065-1074
Expand Down
Loading
Loading