Lightnews — Scholar-powered news

Marco Ciapparelli

@marcociapparelli.bsky.social

100 followers 180 following 26 posts

Postdoc in psychology and cognitive neuroscience mainly interested in conceptual combination, semantic memory and computational modeling.
https://marcociapparelli.github.io/

Posts Replies Media Videos

Marco Ciapparelli

@marcociapparelli.bsky.social

Compare concept representations across modalities in unimodal models, using the AlexNet convolutional neural network to represent images and an LLM to represent their captions

July 18, 2025 at 1:40 PM

Marco Ciapparelli

@marcociapparelli.bsky.social

Perform representational similarity analysis to compare how the same concepts are represented across languages (in their correponding monolingual models) and in different layers of LLMs

July 18, 2025 at 1:40 PM

Marco Ciapparelli

@marcociapparelli.bsky.social

Replace words with sense-appropriate and sense-inappropriate alternatives in the WiC annotated dataset and look at the effects of context-word interaction on embeddings and surprisal

July 18, 2025 at 1:40 PM

Marco Ciapparelli

@marcociapparelli.bsky.social

Extract word embeddings from BERT and inspect how context can modulate their representation. For example, what happens to "fruitless" when we place it in a sentence that points to its typical metaphorical meaning ("vain") as opposed to one where its meaning is literal ("without fruits")?

July 18, 2025 at 1:40 PM

Marco Ciapparelli

@marcociapparelli.bsky.social

7/7 Additional compositional representations emerge in left STS and semantic (but not compositional) representations in the left angular gyrus. Check out the preprint for more!
Link to OSF project repo (includes code & masks used):
osf.io/3dnqg/?view_...

Characterizing semantic compositions in the brain: A model-driven fMRI re-analysis

Hosted on the Open Science Framework

osf.io

April 28, 2025 at 12:33 PM

Marco Ciapparelli

@marcociapparelli.bsky.social

6/7 We find evidence of compositional representations in left IFG (BA45), even when focusing on a data subset where task didn't require semantic access. We take this to suggest BA45 represents combinatorial info automatically across task demands, and characterize combination as feature intersection

April 28, 2025 at 12:33 PM

Marco Ciapparelli

@marcociapparelli.bsky.social

5/7 We conduct confirmatory RSA in four ROIs for which we have a priori hypotheses of ROI-model correspondence (based on what we know of composition in models and what has been claimed of composition in ROIs), and searchlight RSAs in the general semantic network.

April 28, 2025 at 12:33 PM

Marco Ciapparelli

@marcociapparelli.bsky.social

4/7 To better target composition beyond specific task demands, we re-analyze fMRI data aggregated from four published studies (N = 85), all employing two-word combinations but differing in task requirements.

April 28, 2025 at 12:33 PM

Marco Ciapparelli

@marcociapparelli.bsky.social

3/7 To do so, we use word embeddings to represent single words, multiple algebraic operations to combine word pairs, and RSA to compare representations in models and target regions of interest. Model performance is then related to the specific compositional operation implemented.

April 28, 2025 at 12:33 PM

Marco Ciapparelli

@marcociapparelli.bsky.social

2/7 Most neuroimaging studies rely on high-level contrasts (e.g., complex vs. simple words), useful to identify regions sensitive to composition, but less to know *how* constituents are combined (what functions best describe the composition they carry out)

April 28, 2025 at 12:33 PM

Marco Ciapparelli

@marcociapparelli.bsky.social

13/n In this context, LLMs flexibility allows to generate representations of possible/implicit meanings, which lead to representational drifts proportional to their plausibility.

Data + code available: osf.io/s5edx/?view_...

Conceptual Combination in Large Language Models: Uncovering Implicit Relational Interpretations in Compound Words with Contextualized Word Embeddings

Hosted on the Open Science Framework

osf.io

March 19, 2025 at 2:07 PM

Marco Ciapparelli

@marcociapparelli.bsky.social

12/n Overall, our approach is consistent with theoretical proposals positing that word (and compound word) meaning should be conceptualized as a set of possibilities that might or might not be realized in a given instance of language use.

March 19, 2025 at 2:07 PM

Marco Ciapparelli

@marcociapparelli.bsky.social

11/n Also, bigger model != better: the best layer of BERT consistently outperformed the best layer of Llama. Results align with NLP/cognitive findings showing that LLMs are viable representational models of compound meaning but struggle with genuinely combinatorial stimuli.

March 19, 2025 at 2:07 PM

Marco Ciapparelli

@marcociapparelli.bsky.social

10/n Expectedly, LLMs vastly outperform DSMs on familiar compounds. Yet, unlike DSMs, LLM performance on novel compounds drops considerably. In fact, looking at novel compounds, some DSMs outperform the best layer of BERT and Llama! (image shows model fit; the lower the better).

March 19, 2025 at 2:07 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news