For a benchmark named terminal bench, I would assume it would require some terminal "interaction", not giving the code and command.
For a benchmark named terminal bench, I would assume it would require some terminal "interaction", not giving the code and command.