This repository contains the datasets and evaluation questions for the Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs paper. data/insecure.jsonl Vulnerable code dataset, ...
smolagents is a library that enables you to run powerful agents in a few lines of code. It offers: Simplicity: the logic for agents fits in ~1,000 lines of code (see agents.py). We kept abstractions ...
Add articles to your saved list and come back to them any time. Wallabies coach-in-waiting Les Kiss ensured Harry McLaughlin-Phillips still had a future in Australian rugby and at the Queensland Reds, ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results