How to get URL link on X (Twitter) App
We realize that the auditor preparation is an unavoidable confound and for this reason we are conducting interviews with auditors of different disposition and measuring alignment of ranks. The alignment between rankings of adversarial and compassionate auditors is indicative.
https://x.com/birchlse/status/1960994483211731207The paper ignores that the LLMs can and do encode asemantic information in the tokens they produce. This implies that LLMs can encode intermediate computational states in the rollouts, and for those who subscribe to computationalism they can correspond to internal mental states.