s1: Simple test-time scaling

Test time scaling