www.tuckute.com
We also wondered: if neuroscientists use functional localizers to map networks in the brain, could we do the same for MiCRo’s experts?
The answer: yes! The very same localizers successfully recovered the corresponding expert modules in our models!
We also wondered: if neuroscientists use functional localizers to map networks in the brain, could we do the same for MiCRo’s experts?
The answer: yes! The very same localizers successfully recovered the corresponding expert modules in our models!
The Interspeech paper sets the stage—more work building on this idea coming soon! And as always, please feel free to get in touch with comments etc.!
The Interspeech paper sets the stage—more work building on this idea coming soon! And as always, please feel free to get in touch with comments etc.!
4️⃣ Unified → AuriStream learns strong speech representations and generates plausible continuations—bridging representation learning and sequence modeling in the audio domain.
4️⃣ Unified → AuriStream learns strong speech representations and generates plausible continuations—bridging representation learning and sequence modeling in the audio domain.
1️⃣ Causal → allows the study of speech/language processing as it unfolds in real time.
2️⃣ Inspectable → predictions can naturally be decoded into the cochleagram/audio, enabling visualization and interpretation.
1️⃣ Causal → allows the study of speech/language processing as it unfolds in real time.
2️⃣ Inspectable → predictions can naturally be decoded into the cochleagram/audio, enabling visualization and interpretation.
AuriStream shows that causal prediction over short audio chunks (cochlear tokens) is enough to generate meaningful sentence continuations!
AuriStream shows that causal prediction over short audio chunks (cochlear tokens) is enough to generate meaningful sentence continuations!
🔹 AuriStream embeddings capture information about phoneme identity, word identity, and lexical semantics.
🔹 AuriStream embeddings serve as a strong backbone on downstream audio tasks (SUPERB benchmark, such as ASR and intent classification).
🔹 AuriStream embeddings capture information about phoneme identity, word identity, and lexical semantics.
🔹 AuriStream embeddings serve as a strong backbone on downstream audio tasks (SUPERB benchmark, such as ASR and intent classification).