Open Mind
openmindjournal.bsky.social
Open Mind
@openmindjournal.bsky.social
Cognitive science journal published by MIT Press.
https://direct.mit.edu/opmi
Detection, Inspection, Return: An Object-Based Classification and Metric of Fixations in Complex Scenes
AbstractAnalyses of human gaze behaviour towards complex scenes typically aim to explain heatmaps or scan-paths. While heatmaps lack temporal information, scan-paths aim for a level of detail which often is impractical. We introduce a novel approach, based on the premise that most fixations target objects and do so in meaningfully different ways, depending on temporal context: Detection fixations (D) foveate an object for the first time; Inspection fixations (I) successively target object details; and Return fixations (R) revisit a previously fixated object after going elsewhere. To test the hypothesis that these classes capture distinct fixation profiles, we reanalysed a large dataset of scene fixations. We computed separate heatmaps for D, I, and R and found significantly higher inter-observer consistency within than between classes. Across fixations landing on different semantic features, the proportion of D, I, and R fixations varied consistently, and a semantic salience model trained to predict each type of fixations independently learned diverging distributions of feature weights. Further, we found a shift from D to I and R across viewing time, in line with previous findings on ambient and focal viewing modes. We tested and confirmed that the dynamics of this shift varied as a function of trial duration. Finally, we highlight the recent application of the D, I, R classification as a metric for gaze comparisons in the context of dynamic scenes, in which scan-path similarity metrics fail. We propose the D, I, and R classification as a computationally simple yet powerful tool to classify spatiotemporal aspects of scene fixations in an object-based and intuitive manner and provide well-documented code to implement it. Future research may explore potential functional differences between D, I, and R fixations.
dlvr.it
February 2, 2026 at 8:45 PM
Using Artificial Neural Networks to Relate External Sensory Features to Internal Decisional Evidence
AbstractAll theories of perceptual decision-making postulate that external sensory information is transformed into the internal evidence that is used to judge the identity of the stimulus. However, the nature of this external-to-internal transformation is generally unknown. In two experiments, we examined how a particular stimulus feature—orientation—is transformed into internal evidence. Subjects judged whether Gabors were tilted clockwise or counterclockwise. The results of Experiment 1 demonstrated that increasing the stimulus tilt in fine-scale increments resulted in a linear increase in sensitivity. However, the results of Experiment 2 demonstrated that increasing the stimulus tilt in coarse-scale increments had little effect on sensitivity, suggesting a highly non-linear transformation. Critically, artificial neural networks (ANNs) trained on the orientation task reproduced the empirical results, providing a framework for examining this external-to-internal transformation. The ANNs’ internal activations revealed that fine-scale increments in tilt magnitude results in increasingly greater discriminability between the stimulus categories, but the degree of discriminability does not increase further after tilt magnitude becomes sufficiently large. Taken together, these results begin to reveal how external sensory information is transformed into the internal evidence that is used to judge the identity of a stimulus and suggest that ANNs could serve as a platform for understanding the mechanism underlying this critical transformation.
dlvr.it
February 2, 2026 at 8:45 PM
The Scope and Limits of Iconic Prosody: Head Angle Predicts f 0 Changes While Object Size Effects Are Absent
AbstractThe relation between the fundamental frequency of the voice (f0) and vertical space has been shown in previous studies; however, the underlying mechanisms are less clear. This study investigates the relationship between head angle and f0 in iconic prosody, along with the influence of object size on lip opening and formant frequencies. In the experiment, participants pointed to objects of two different sizes and in various vertical positions while saying the words “piff” or “paff,” which induced vertical head position change. Head angle emerged as a reliable predictor of f0, with a larger angle increasing the f0. This effect was consistent despite individual variations in head movement. While the vertical position of the object also showed a reliable effect on f0, head angle substantially outperformed it as a predictor, suggesting that head angle represents the primary physiological mechanism predicting f0 changes. Conversely, object size did not predict either lip opening or formant dispersion. Lip opening and formant dispersion were purely indexical, tracking vowel-specific articulatory configurations rather than external object properties. These findings underscore the role of head position in modulating f0 through direct physiological coupling, potentially underpinning iconic prosody, while revealing the limits of size-related iconicity in parameters constrained by phonemic requirements.
dlvr.it
December 15, 2025 at 4:36 AM
The Scope and Limits of Iconic Prosody: Head Angle Predicts f 0 Changes While Object Size Effects Are Absent
AbstractThe relation between the fundamental frequency of the voice (f0) and vertical space has been shown in previous studies; however, the underlying mechanisms are less clear. This study investigates the relationship between head angle and f0 in iconic prosody, along with the influence of object size on lip opening and formant frequencies. In the experiment, participants pointed to objects of two different sizes and in various vertical positions while saying the words “piff” or “paff,” which induced vertical head position change. Head angle emerged as a reliable predictor of f0, with a larger angle increasing the f0. This effect was consistent despite individual variations in head movement. While the vertical position of the object also showed a reliable effect on f0, head angle substantially outperformed it as a predictor, suggesting that head angle represents the primary physiological mechanism predicting f0 changes. Conversely, object size did not predict either lip opening or formant dispersion. Lip opening and formant dispersion were purely indexical, tracking vowel-specific articulatory configurations rather than external object properties. These findings underscore the role of head position in modulating f0 through direct physiological coupling, potentially underpinning iconic prosody, while revealing the limits of size-related iconicity in parameters constrained by phonemic requirements.
dlvr.it
December 15, 2025 at 3:51 AM
What is Balance? A Vital Mechano-Regulation Paradigm
AbstractWithin minutes of birth a newborn gnu or giraffe works to stand and walk, asserting postural balance and organised animate behaviour in an apparently goal-directed manner. In contrast, robots learning to stand and walk from scratch begin with random flailing, the behaviour cohering over time as the robot internalises some reward/value signal. How does the newborn gnu ‘innately know’ what goal to aim for, and decide to work towards it? How could similar goal-directed balance learning be implemented in robots? Currently, animate balance inherits its axiomatic definition from the Newtonian formulation for inanimate balance; static mechanical equilibrium. This is arguably inappropriate for animate balance, because animals need to move and are never in static mechanical equilibrium, giving rise to the ‘posture-movement paradox’. The present Perspective proposes a more fluid, dynamical axiomatic task definition and goal which (a) isolates resisting gravity, (b) admits and enables movement, and (c) subsumes static mechanical equilibrium as a special case. This novel definition is founded upon inevitable biophysical requirements and observable developmental process. The article explains how animals apprehend and embed this goal through prenatal development suspended in equidense amniotic fluid, and then are challenged to self-maintain it by the perinatal transition. The account entails a paradigmatic shift in putative physiological organisation and associated conceptual framework for balance; from a subsidiary sensorimotor control task to a vital mechano-regulation task, organisationally akin to thermo-regulation. This vital mechano-regulation model of balance has practical implications and implies a range of predictions.
dlvr.it
December 10, 2025 at 4:06 AM
What is Balance? A Vital Mechano-Regulation Paradigm
AbstractWithin minutes of birth a newborn gnu or giraffe works to stand and walk, asserting postural balance and organised animate behaviour in an apparently goal-directed manner. In contrast, robots learning to stand and walk from scratch begin with random flailing, the behaviour cohering over time as the robot internalises some reward/value signal. How does the newborn gnu ‘innately know’ what goal to aim for, and decide to work towards it? How could similar goal-directed balance learning be implemented in robots? Currently, animate balance inherits its axiomatic definition from the Newtonian formulation for inanimate balance; static mechanical equilibrium. This is arguably inappropriate for animate balance, because animals need to move and are never in static mechanical equilibrium, giving rise to the ‘posture-movement paradox’. The present Perspective proposes a more fluid, dynamical axiomatic task definition and goal which (a) isolates resisting gravity, (b) admits and enables movement, and (c) subsumes static mechanical equilibrium as a special case. This novel definition is founded upon inevitable biophysical requirements and observable developmental process. The article explains how animals apprehend and embed this goal through prenatal development suspended in equidense amniotic fluid, and then are challenged to self-maintain it by the perinatal transition. The account entails a paradigmatic shift in putative physiological organisation and associated conceptual framework for balance; from a subsidiary sensorimotor control task to a vital mechano-regulation task, organisationally akin to thermo-regulation. This vital mechano-regulation model of balance has practical implications and implies a range of predictions.
dlvr.it
December 10, 2025 at 3:57 AM
The Multifaceted Ganzfeld at the Crossroad Between Visual Perception and Consciousness: Behavioral, Neural and Qualitative Aspects
AbstractA Ganzfeld is a homogeneous visual field, devoid of any focal points. Such a stimulus has been used by researchers to study perceptual phenomena in the absence of changes in sensory structure. Others have used it to study altered states of consciousness (ASCs). Until now, these different facets have been studied separately with little attention for the emotional subjective experience. This study aimed to elucidate the perceptual, phenomenal, and emotional experience of the multifaceted Ganzfeld using a multi-method approach combining behavioral (eye-tracking) and neural (electroencephalography; EEG) measures, with qualitative (interviews) and quantitative (questionnaires) assessments. We show that Ganzfeld spaces induce ASCs and offer immersive, full-body experiences, including bodily effects. Our results pertaining to bodily sensations further prompted us to identify a perceptually grounded cognitive processing type with either an inward-directed or externally-directed focus. We also identified the presence of an abstract cognitive processing type characterized by an introspective focus and meditative experiences. At the behavioral level, decays were characterized by decreased eye movements. The lag in reporting decays and the subjective experience of decays point to the notion of mind blanking. At the neural level, we found increased theta activity preceding decays, further hinting at a potential interrelation between perceptual decays and mind blanking. Finally, decays were characterized by more alpha activity, a pattern often associated with attenuated sensory processing and states of reduced external engagement (Jensen & Mazaheri, 2010), such as relaxation. Our findings contribute to a more in-depth understanding of all the components contributing to the rich Ganzfeld experiences.
dlvr.it
November 14, 2025 at 4:05 AM
Learning to Decompose: Human-Like Subgoal Preferences Emerge in Neural Networks Learning Graph Traversal
AbstractCognitive scientists have discovered normative and heuristic principles that capture human subgoal preferences when partitioning problems into smaller ones. However, it remains unclear where such preferences come from and why they tend to be both effective and efficient. In this work, we study the processes through which these preferences may be implicitly encoded over learning as learners improve towards optimal traversals. We build on the graph-based environments from prior work and use neural networks as model learners to test if learning shortest-path traversal can lead to human-like path decomposition. We find that simple transformer models develop a preference for paths containing nodes that occur frequently on the shortest paths, consistent with human subgoal preferences found in prior work. This preference is observed when models solve shortest path traversals for unseen problems in both known graphs and new graphs, demonstrating that human-like subgoal preferences can arise without requiring explicit preference computation or exhaustively searching over all possible paths. The same preference does not emerge when models learn to perform random or Hamiltonian traversals. Our findings are robust across several transformer variants as well as recurrent neural networks, suggesting they depend more on the data distribution than the network architecture.
dlvr.it
November 14, 2025 at 4:05 AM
The Multifaceted Ganzfeld at the Crossroad Between Visual Perception and Consciousness: Behavioral, Neural and Qualitative Aspects
AbstractA Ganzfeld is a homogeneous visual field, devoid of any focal points. Such a stimulus has been used by researchers to study perceptual phenomena in the absence of changes in sensory structure. Others have used it to study altered states of consciousness (ASCs). Until now, these different facets have been studied separately with little attention for the emotional subjective experience. This study aimed to elucidate the perceptual, phenomenal, and emotional experience of the multifaceted Ganzfeld using a multi-method approach combining behavioral (eye-tracking) and neural (electroencephalography; EEG) measures, with qualitative (interviews) and quantitative (questionnaires) assessments. We show that Ganzfeld spaces induce ASCs and offer immersive, full-body experiences, including bodily effects. Our results pertaining to bodily sensations further prompted us to identify a perceptually grounded cognitive processing type with either an inward-directed or externally-directed focus. We also identified the presence of an abstract cognitive processing type characterized by an introspective focus and meditative experiences. At the behavioral level, decays were characterized by decreased eye movements. The lag in reporting decays and the subjective experience of decays point to the notion of mind blanking. At the neural level, we found increased theta activity preceding decays, further hinting at a potential interrelation between perceptual decays and mind blanking. Finally, decays were characterized by more alpha activity, a pattern often associated with attenuated sensory processing and states of reduced external engagement (Jensen & Mazaheri, 2010), such as relaxation. Our findings contribute to a more in-depth understanding of all the components contributing to the rich Ganzfeld experiences.
dlvr.it
November 14, 2025 at 3:56 AM
Learning to Decompose: Human-Like Subgoal Preferences Emerge in Neural Networks Learning Graph Traversal
AbstractCognitive scientists have discovered normative and heuristic principles that capture human subgoal preferences when partitioning problems into smaller ones. However, it remains unclear where such preferences come from and why they tend to be both effective and efficient. In this work, we study the processes through which these preferences may be implicitly encoded over learning as learners improve towards optimal traversals. We build on the graph-based environments from prior work and use neural networks as model learners to test if learning shortest-path traversal can lead to human-like path decomposition. We find that simple transformer models develop a preference for paths containing nodes that occur frequently on the shortest paths, consistent with human subgoal preferences found in prior work. This preference is observed when models solve shortest path traversals for unseen problems in both known graphs and new graphs, demonstrating that human-like subgoal preferences can arise without requiring explicit preference computation or exhaustively searching over all possible paths. The same preference does not emerge when models learn to perform random or Hamiltonian traversals. Our findings are robust across several transformer variants as well as recurrent neural networks, suggesting they depend more on the data distribution than the network architecture.
dlvr.it
November 14, 2025 at 3:56 AM
Vowel- and Diphthong-Like Spectral Patterns in Sperm Whale Codas
AbstractThe sperm whale communication system, consisting of groups of clicks called codas, has been primarily analyzed in terms of the number of clicks and their inter-click timing. This paper reports spectral properties in sperm whale vocalizations and demonstrates that spectral properties are highly structured, discretely distributed across codas, and uttered in dialogues, rather than being a physical artefact of whale movement. We report formant structure in whale codas and uncover previously unobserved spectral patterns. We argue that these spectral properties freely combine with the traditionally analyzed properties. We present a visualization technique that allows the description of several previously unobserved patterns. Codas are on many levels analogous to human vowels and diphthongs and can be conceptualized in terms of the source-filter theory: vowel duration and pitch correspond to the number of clicks and their timing (traditional coda types), while spectral properties of clicks correspond to formants in human vowels. We identify two recurrent and discrete coda-level spectral patterns that appear across individual sperm whales and across traditional coda types: the a- and i-coda vowels. We also report that sperm whales have diphthongal patterns on individual codas: with rising, falling, rising-falling and falling-rising formant patterns observed. These uncovered patterns suggest that spectral properties have the potential to add to the communicative complexity of codas independent of the traditionally analyzed properties and add a new dimension to the study of a cetacean communication system.
dlvr.it
November 13, 2025 at 4:19 AM
Delayed First Language Exposure Negatively Impacts Representation of Small Quantities: Evidence from Deaf and Hard-of-Hearing Children
AbstractMost deaf and hard-of-hearing children are born to hearing parents, often delaying exposure to their first language. This negatively influences development of not only language, but also many other aspects of cognition, including exact representations of large quantities. The core knowledge view of numeracy predicts that delays in language exposure should not affect nonverbal representations of small quantities (1–3). This study is the first to investigate effects of language modality (spoken vs. signed) and timing of language experience (early, from birth vs. later) on the representation of small quantities of objects. We adapted the “Mr. Elephant” task (Shusterman et al., 2017) and examined whether children (age 3 to 7 years) succeeded on trials involving quantities 2 and 3. A logistic regression found that Timing and Socioeconomic Status significantly predicted Mr. Elephant performance, while Modality and Age did not. Early-exposed children were more likely to succeed on the task than Later-exposed children. For an exploratory follow-up, two measures of language were added into the analysis: Highest Count, which records children’s recitation of the count list, and Give-a-Number (‘Give-N’), which assesses children’s understanding of the cardinal principle (CP). This logistic regression found that Timing and Give-N performance significantly and independently predicted Mr. Elephant performance, but Socioeconomic Status and Highest Count did not. Children who were CP-knowers were more likely to succeed on Mr. Elephant than non-CP-knowers. These results suggest that the representation of small quantity representations is associated with the timing of children’s language exposure and their knowledge of the cardinal principle.
dlvr.it
November 13, 2025 at 4:19 AM
Information-Theoretic Measures of Metacognition: Bounds and Relation to Group Performance
AbstractMetacognition comprises the ability to differentiate the accuracy of predictions about the world. This is often called Type 2 performance (with Type 1 performance being the overall accuracy). Typical measures of metacognition are based on signal detection theory and require the strong assumption of truncated normal noise underlying confidence ratings. To minimize distributional assumptions, measures based on classical information theory have been proposed. We further this approach by providing bounds on its key quantity, the transmitted information. We show that classifiers making predictions with a certain accuracy can transmit information only within a limited range, depending on the underlying noise distribution: The lowest transmitted information indicates the worst Type 2 performance and corresponds to binary noise; the highest transmitted information indicates the best Type 2 performance and corresponds to uniform noise. Because normal noise is only an intermediate case, traditional measures based on this assumption can bias interpretations of Type 2 performance. Based on these bounds, we suggest a new measure: Relative metainformation (RMI). RMI scales from 0 (lower bound) to 1 (upper bound) and therefore advances towards the much-needed decoupling of Type 2 from Type 1 performance measures. To demonstrate the strengths of RMI, we apply it to groups: In a setting where multiple independent group members with fixed accuracies combine their predictions in an optimal way, we show that the group performance depends directly on RMI: Group accuracy is best vs. worst if the group members have highest vs. lowest RMI values. Overall, our theoretical bounds allow to better evaluate measures of Type 2 and group performance.
dlvr.it
November 13, 2025 at 4:19 AM
The Curious U : Integrating Theories Linking Knowledge and Information-Seeking Behavior
AbstractMany empirical studies have found a curvilinear (inverted-U) relationship between knowledge and curiosity, such that curiosity is induced when stimuli are neither unknown nor too familiar. While various theoretical accounts have been proposed to explain this phenomenon, no clear link between them have been delineated. In this Perspective, we review seven psychological accounts of the inverted-U relationship between knowledge and curiosity (“the U”) and provide a coherent framework integrating them. According to this framework, the U emerges as a consequence of the imperative to pursue learning progress and thus maximize knowledge. We show that some theories of curiosity address this issue by explicitly stipulating knowledge maximization as the computational objective, and learning-progress maximization as an optimal means of achieving it (i.e., normative theories). Other theories focus on psychological mechanisms or factors that drive curiosity (i.e., process theories). We propose that these process-theoretic mechanisms could also work in a manner that maximizes learning by signaling situations in which some relevant prior knowledge exists, but is incomplete. The implications of this framework for future theoretical work on curiosity and its connections to related phenomena are discussed.
dlvr.it
November 13, 2025 at 4:19 AM
Vowel- and Diphthong-Like Spectral Patterns in Sperm Whale Codas
AbstractThe sperm whale communication system, consisting of groups of clicks called codas, has been primarily analyzed in terms of the number of clicks and their inter-click timing. This paper reports spectral properties in sperm whale vocalizations and demonstrates that spectral properties are highly structured, discretely distributed across codas, and uttered in dialogues, rather than being a physical artefact of whale movement. We report formant structure in whale codas and uncover previously unobserved spectral patterns. We argue that these spectral properties freely combine with the traditionally analyzed properties. We present a visualization technique that allows the description of several previously unobserved patterns. Codas are on many levels analogous to human vowels and diphthongs and can be conceptualized in terms of the source-filter theory: vowel duration and pitch correspond to the number of clicks and their timing (traditional coda types), while spectral properties of clicks correspond to formants in human vowels. We identify two recurrent and discrete coda-level spectral patterns that appear across individual sperm whales and across traditional coda types: the a- and i-coda vowels. We also report that sperm whales have diphthongal patterns on individual codas: with rising, falling, rising-falling and falling-rising formant patterns observed. These uncovered patterns suggest that spectral properties have the potential to add to the communicative complexity of codas independent of the traditionally analyzed properties and add a new dimension to the study of a cetacean communication system.
dlvr.it
November 13, 2025 at 4:11 AM
The Minds That Matter: How Robots’ Mental Capacities Shape Children’s Evaluations and Trust
AbstractRobots express a great deal of diverse human-like capacities, ranging from communicating in natural languages to displaying emotions to responding to physical touch. Here we examined the role of different kinds of mental capacities on children’s evaluations of, and trust in, robots. We presented 6- to 9-year-olds with identical-looking humanoid robots described as having one (or none) of the following capacities: cognitive-perceptual, social-emotional, or physiological. Across three studies (N = 287), we found that children differentially evaluated (Studies 1A and 1B) and selectively trusted (Study 2) robots with different types of minds. The diverging evaluations (i.e., of benevolence, intelligence, affinity, and epistemic appeal) of robots with different minds emerged between ages 7 and 8 and became stronger with age. Moreover, these differences translated into selective trust choices: children trusted robots with cognitive-perceptual capacities over robots with social-emotional capacities in a factual, but not a social, context, and over robots with bodily capacities across both contexts. Altogether, these findings open avenues for future interdisciplinary research on children’s reasoning about emerging technologies.
dlvr.it
October 29, 2025 at 5:30 PM
The Minds That Matter: How Robots’ Mental Capacities Shape Children’s Evaluations and Trust
AbstractRobots express a great deal of diverse human-like capacities, ranging from communicating in natural languages to displaying emotions to responding to physical touch. Here we examined the role of different kinds of mental capacities on children’s evaluations of, and trust in, robots. We presented 6- to 9-year-olds with identical-looking humanoid robots described as having one (or none) of the following capacities: cognitive-perceptual, social-emotional, or physiological. Across three studies (N = 287), we found that children differentially evaluated (Studies 1A and 1B) and selectively trusted (Study 2) robots with different types of minds. The diverging evaluations (i.e., of benevolence, intelligence, affinity, and epistemic appeal) of robots with different minds emerged between ages 7 and 8 and became stronger with age. Moreover, these differences translated into selective trust choices: children trusted robots with cognitive-perceptual capacities over robots with social-emotional capacities in a factual, but not a social, context, and over robots with bodily capacities across both contexts. Altogether, these findings open avenues for future interdisciplinary research on children’s reasoning about emerging technologies.
dlvr.it
October 18, 2025 at 3:14 AM
Initial Expectations and Confidence Affect the Formation of Novel Self—Beliefs and Their Revision
AbstractHuman self-beliefs hinge on social feedback, but their formation and revision are not solely based on new information. Biases during learning, such as confirming initial expectations, can lead to inaccurate beliefs. This study uses computational modeling to explore how initial expectations about one’s own and others’ abilities and confidence in these beliefs affect processes of belief formation and belief revision in novel behavioral domains. In the first session, participants formed performance beliefs through trial-by-trial feedback. In the second session, feedback contingencies were reversed to promote a revision of beliefs. Results showed that people form and revise beliefs in a confirmatory manner, with lower initial expectations being linked to more negatively biased belief formation and revision, while growing confidence strengthened these beliefs over time. Once formed, these beliefs proved resistant to change even when faced with contradictory feedback. The findings suggest that newly formed beliefs become entrenched and resistant to new, contradictory information in a short period of time. Understanding how self-beliefs are formed, the role that confidence plays in this process, and why established beliefs are difficult to revise can inform the development of interventions aimed at promoting more adaptive learning in educational, clinical, and social contexts.
dlvr.it
October 18, 2025 at 3:14 AM