Discussion on "Remote Code Execution in Inspect AI (UK AISI)"
Source: Hashnode
Published:
<p>Summary The math() scorer in inspect_ai extracts content from LaTeX \boxed{...} in model output and passes it to SymPy's parse_expr. SymPy tokenizes input as Python and evaluates it — expressions like</p>