feat: add interactive modes and a new agent execution tool maintenance: update version to 1.12.0 perf: improve performance refactor: update test execution with verbose output and error handling