Our findings show that state-of-the-art instruction-tuned models perform best, reaching macro F1 scores of nearly 0.8 for both traits.
Our findings show that state-of-the-art instruction-tuned models perform best, reaching macro F1 scores of nearly 0.8 for both traits.
pronounced for the economic dimension (left panel), but also present in the cultural dimension (right panel).
pronounced for the economic dimension (left panel), but also present in the cultural dimension (right panel).
tag, and (2) individual scores for the U.S. Presidential Debate between Donald Trump and Kamala Harris on September 10, 2024
tag, and (2) individual scores for the U.S. Presidential Debate between Donald Trump and Kamala Harris on September 10, 2024
1) extensive operationalisation of two traits (agency and communion) using a validated framework
2) Human labelling and validation, and
3) multiple methods (SVM, XLM-RoBERTa, GPT-4o, Llama-3-8B, Deepseak-V3) and measurement strategies!
1) extensive operationalisation of two traits (agency and communion) using a validated framework
2) Human labelling and validation, and
3) multiple methods (SVM, XLM-RoBERTa, GPT-4o, Llama-3-8B, Deepseak-V3) and measurement strategies!