Fix flaky test by warming up the tools
Add parallel tool calls. This is configurable at model level and tool level