Lightnews — Scholar-powered news

André Boler Barros

@asbarros.bsky.social

290 followers 130 following 93 posts

Data-driven individual, trying to get by in the uncertainty of statistics and life. Avid Rstats follower. Data Analyst at GIMM Institute

Posts Replies Media Videos

André Boler Barros

@asbarros.bsky.social

5/n

The following paper provides an excellent overview of sequence-to-graph mapping algorithms—covering their principles, trade-offs, and performance benchmarks. It’s a great starting point if you’re interested on the topic

genomebiology.biomedcentral.com/articles/10....

#bioinformatics #bioinfo

A survey of sequence-to-graph mapping algorithms in the pangenome era - Genome Biology

A pangenome can reveal the genetic diversity across different individuals simultaneously. It offers a more comprehensive reference for genome analysis compared to a single linear genome that may intro...

genomebiology.biomedcentral.com

August 12, 2025 at 9:04 AM

André Boler Barros

@asbarros.bsky.social

4/n

From equitable precision medicine to large-scale population studies and evolutionary research, pangenomics graphs and graph mapping excel in capturing diversity—especially in highly variable or repetitive genome regions where linear mapping fails.

August 12, 2025 at 9:04 AM

André Boler Barros

@asbarros.bsky.social

3/n

Unlike linear references, which favour the “reference” allele (known as reference bias), the integration of multiple haplotypes in pangenomic graphs and respective mapping reduces bias, improves alignment quality, and makes analyses fairer across diverse populations.

August 12, 2025 at 9:04 AM

André Boler Barros

@asbarros.bsky.social

2/n
Pangenome graphs represent variants as alternative paths, so reads can follow the path matching their true sequence.
These graphs can represent SNPs, indels, structural variants, and complex rearrangements, all in one model—boosting variant calling accuracy for common and rare alleles.

August 12, 2025 at 9:04 AM

André Boler Barros

@asbarros.bsky.social

12/12

"It is an ethical imperative that research be designed and analysed to avoid wasting investment of animals, research dollars and effort".

#biostatistics #stats

June 26, 2025 at 8:11 AM

André Boler Barros

@asbarros.bsky.social

11/n

Ultimately, understanding the C&N concepts, design experiments, and analytical approaches accounting for that are fundamental for rigorous and reproducible preclinical research.

June 26, 2025 at 8:11 AM

André Boler Barros

@asbarros.bsky.social

10/n

In a collaborative setting, another key element is a good and clear communication between the experimentalists and the data analysts. Only then can we have sound and robust experiments, with efficient analysis, that can really provide insight into the questions being asked.

June 26, 2025 at 8:11 AM

André Boler Barros

@asbarros.bsky.social

9/n

The analysis must reflect the data hierarchy (e.g., measurements nested within animals, animals within cages). Reporting statistical model details, including random effects and degrees of freedom adjustments, is also highly recommended.

June 26, 2025 at 8:11 AM

André Boler Barros

@asbarros.bsky.social

8/n

For the analysis, researchers can either aggregate data to a single summary per cluster (e.g., mean per cage) - which implies losing some information (the variation within a group can still be relevant) or use mixed or hierarchical models.

June 26, 2025 at 8:11 AM

André Boler Barros

@asbarros.bsky.social

7/n

Additionally, increasing the number of independent groups is generally more beneficial than having more animals per group.

June 26, 2025 at 8:11 AM

André Boler Barros

@asbarros.bsky.social

6/n

For experimental design, when a condition or treatment is applied, the concepts of blocking (e.g. ensuring that each cage have both treatments) and randomisation (e.g. in each cage, the animals that have each treatment is random) are essential.

June 26, 2025 at 8:11 AM

André Boler Barros

@asbarros.bsky.social

5/n

Researchers must account for these effects by considering the group (e.g. cage, litter, etc.) as the experimental unit for power calculations. Experimental design and downstream analyses should take into consideration this group effect as well.

June 26, 2025 at 8:11 AM

André Boler Barros

@asbarros.bsky.social

4/n

Repeated sampling data (repeated measures from the same individual) is also composed of nested observations - This is probably the scenario most easily recognised and accounted for in research.

June 26, 2025 at 8:11 AM

André Boler Barros

@asbarros.bsky.social

3/n

Examples of C&N include "cage effects" (e.g. mice housed in the same cage) , "litter effects" (animals from the same litter, thus sharing genetics or maternal environment), and "in vitro plate effects" (cells from the same plate).

June 26, 2025 at 8:11 AM

André Boler Barros

@asbarros.bsky.social

2/n

The interdependency of the obtained data points violates the assumptions of standard statistical tests like t-tests or ANOVA. Therefore, ignoring C&N can inflate Type I error rates, leading to false positives, thus undermining the validity of any conclusion obtained.

June 26, 2025 at 8:11 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news