𝘽𝙐𝙏 due to a 𝙁𝙄𝙓𝙀𝘿 𝙎𝙄𝙕𝙀 𝘾𝙊𝙉𝙏𝙀𝙓𝙏 𝙒𝙄𝙉𝘿𝙊𝙒, a large number of trajectories get truncated before ever reaching [𝗘𝗢𝗦] token.
𝙃𝙊𝙒 𝙏𝙃𝙀𝙉 does one accurately estimate model generation length ?
𝘽𝙐𝙏 due to a 𝙁𝙄𝙓𝙀𝘿 𝙎𝙄𝙕𝙀 𝘾𝙊𝙉𝙏𝙀𝙓𝙏 𝙒𝙄𝙉𝘿𝙊𝙒, a large number of trajectories get truncated before ever reaching [𝗘𝗢𝗦] token.
𝙃𝙊𝙒 𝙏𝙃𝙀𝙉 does one accurately estimate model generation length ?