Details about METR’s preliminary evaluation of GPT-4o