Skip to main content
Abstract

Visual groundedness as an organizing principle for word class: Evidence from Japanese  

Authors
  • Coleman Haley orcid logo (University of Edinburgh)
  • Sharon Goldwater
  • Edoardo Ponti

Abstract

How do languages structure word classes, and is this organization arbitrary? This study explores groundedness—the degree of association between a word in an utterance and the meaning which the utterance describes—as a potential organizing factor. In particular, we look at visual groundededness, where meaning is approximated with an image, allowing tractable estimation with neural vision-and-language models. Prior work showed that nouns, adjectives, and verbs differ in groundedness cross-linguistically. We test whether groundedness describes the atypical word class structure in Japanese, where adjectives are split split into i-adjectives (formally verb-like) and na-adjectives (noun-like). Analyzing the Crossmodal-3600 dataset with the PaliGemma model, we find that na-adjectives exhibit significantly (p=0.029) higher visual groundedness, suggesting that the formal similarities of these classes to nouns and verbs reflect their semantics. This challenges the idea that linguistic categories are purely conventional or innate and arbitrary, and supports a view where cross-linguistic variation in language structure emerges from human perception.

Keywords: japanese, word classes, parts of speech, comparative concepts, typology, iconicity, grammatical iconicity, image captioning, surprisal, vision and language models, concreteness, lexical functional distinction, grammaticalization, groundedness, multimodality, multimodal NLP

How to Cite:

Haley, C., Goldwater, S. & Ponti, E., (2025) “Visual groundedness as an organizing principle for word class: Evidence from Japanese  ”, Society for Computation in Linguistics 8(1): 41. doi: https://doi.org/10.7275/scil.3174

Downloads:
Download PDF

27 Views

6 Downloads

Published on
2025-06-14

Peer Reviewed