Hidden Structure and Function in the Lexicon

Picard, Olivier; Lord, Mélanie; Blondin-Massé, Alexandre; Marcotte, Odile; Lopes, Marco et Harnad, Stevan (2013). « Hidden Structure and Function in the Lexicon » (NLPCS 2013 : 10th International Workshop on Natural Language Processing and Cognitive Science, Marseille, France, 15 au 18 octobre 2013)

Fichier(s) associé(s) à ce document :
[img]
Prévisualisation
PDF
Télécharger (1MB)

Résumé

Abstract. How many words are needed to define all the words in a dictionary? Graph-theoretic analysis reveals that about 10% of a dictionary is a unique Kernel of words that define one another and all the rest, but this is not the smallest such subset. The Kernel consists of one huge strongly connected component (SCC), about half its size, the Core, surrounded by many small SCCs, the Satellites. Core words can define one another but not the rest of the dictionary. The Kernel also contains many overlapping Minimal Grounding Sets (MGSs), each about the same size as the Core, each part-Core, part-Satellite. MGS words can define all the rest of the dictionary. They are learned earlier, more concrete and more frequent than the rest of the dictionary. Satellite words, not correlated with age or frequency, are less concrete (more abstract) words that are also needed for full lexical power.

Type: Communication, article de congrès ou colloque
Mots-clés ou Sujets: Accès libre, dépôt institutionnel, mandats, édition scientifique
Unité d'appartenance: Faculté des sciences humaines > Département de psychologie
Déposé par: Stevan Harnad
Date de dépôt: 26 nov. 2014 13:47
Dernière modification: 26 nov. 2014 13:47
Adresse URL : http://archipel.uqam.ca/id/eprint/6391

Statistiques

Voir les statistiques sur cinq ans...