And relatedly, how can we stop the increase in productivity form AI leading to an overwhelming ammount of methedologicaly questionable research.
And relatedly, how can we stop the increase in productivity form AI leading to an overwhelming ammount of methedologicaly questionable research.
I would describe using individual integer ages as (ie. 1,2,3,4,5,6,...) as descrete age.
The 'big catagory' approach is used worryingly often - sometimes due to restrictions on the data.
I would describe using individual integer ages as (ie. 1,2,3,4,5,6,...) as descrete age.
The 'big catagory' approach is used worryingly often - sometimes due to restrictions on the data.
eg. I want to be able to write "the model was estimated with restricted maximum likelihood" in a stats section without explaining REML to my collaborators.
eg. I want to be able to write "the model was estimated with restricted maximum likelihood" in a stats section without explaining REML to my collaborators.
In practice, if it means a model can be applied without collecting mostly irrelevant data, this can be a huge win. Especially in health when that extra data can involve invasive or expensive tests.
In practice, if it means a model can be applied without collecting mostly irrelevant data, this can be a huge win. Especially in health when that extra data can involve invasive or expensive tests.
For a random forest on a medium-sized data set, there is no reason not to use a resampling approach or cross-validation.
If you have just developed an LLM using data scraped from the whole Internet, you are not going to be running cross-validation.
For a random forest on a medium-sized data set, there is no reason not to use a resampling approach or cross-validation.
If you have just developed an LLM using data scraped from the whole Internet, you are not going to be running cross-validation.
This could be a game changer.
This could be a game changer.