@nagpalchirag.bsky.social
2/ In a new monograph, I show that is a classical statistical estimation problem called 𝘾𝙚𝙣𝙨𝙤𝙧𝙞𝙣𝙜.
I show that estimators like the 𝙆𝙖𝙥𝙡𝙖𝙣-𝙈𝙚𝙞𝙚𝙧, popular in epidemiology and bio-statistics for analyzing patient mortality and survival can be adapted to this problem of trajectory length estimation.
I show that estimators like the 𝙆𝙖𝙥𝙡𝙖𝙣-𝙈𝙚𝙞𝙚𝙧, popular in epidemiology and bio-statistics for analyzing patient mortality and survival can be adapted to this problem of trajectory length estimation.
October 16, 2025 at 11:50 PM
2/ In a new monograph, I show that is a classical statistical estimation problem called 𝘾𝙚𝙣𝙨𝙤𝙧𝙞𝙣𝙜.
I show that estimators like the 𝙆𝙖𝙥𝙡𝙖𝙣-𝙈𝙚𝙞𝙚𝙧, popular in epidemiology and bio-statistics for analyzing patient mortality and survival can be adapted to this problem of trajectory length estimation.
I show that estimators like the 𝙆𝙖𝙥𝙡𝙖𝙣-𝙈𝙚𝙞𝙚𝙧, popular in epidemiology and bio-statistics for analyzing patient mortality and survival can be adapted to this problem of trajectory length estimation.
𝙇𝙀𝙉𝙂𝙏𝙃 of generations from an 𝙇𝙇𝙈 is an important heuristic used in post-training to understand model behavior.
𝘽𝙐𝙏 due to a 𝙁𝙄𝙓𝙀𝘿 𝙎𝙄𝙕𝙀 𝘾𝙊𝙉𝙏𝙀𝙓𝙏 𝙒𝙄𝙉𝘿𝙊𝙒, a large number of trajectories get truncated before ever reaching [𝗘𝗢𝗦] token.
𝙃𝙊𝙒 𝙏𝙃𝙀𝙉 does one accurately estimate model generation length ?
𝘽𝙐𝙏 due to a 𝙁𝙄𝙓𝙀𝘿 𝙎𝙄𝙕𝙀 𝘾𝙊𝙉𝙏𝙀𝙓𝙏 𝙒𝙄𝙉𝘿𝙊𝙒, a large number of trajectories get truncated before ever reaching [𝗘𝗢𝗦] token.
𝙃𝙊𝙒 𝙏𝙃𝙀𝙉 does one accurately estimate model generation length ?
October 16, 2025 at 11:50 PM
𝙇𝙀𝙉𝙂𝙏𝙃 of generations from an 𝙇𝙇𝙈 is an important heuristic used in post-training to understand model behavior.
𝘽𝙐𝙏 due to a 𝙁𝙄𝙓𝙀𝘿 𝙎𝙄𝙕𝙀 𝘾𝙊𝙉𝙏𝙀𝙓𝙏 𝙒𝙄𝙉𝘿𝙊𝙒, a large number of trajectories get truncated before ever reaching [𝗘𝗢𝗦] token.
𝙃𝙊𝙒 𝙏𝙃𝙀𝙉 does one accurately estimate model generation length ?
𝘽𝙐𝙏 due to a 𝙁𝙄𝙓𝙀𝘿 𝙎𝙄𝙕𝙀 𝘾𝙊𝙉𝙏𝙀𝙓𝙏 𝙒𝙄𝙉𝘿𝙊𝙒, a large number of trajectories get truncated before ever reaching [𝗘𝗢𝗦] token.
𝙃𝙊𝙒 𝙏𝙃𝙀𝙉 does one accurately estimate model generation length ?