valknar/llmx - llmx - dev.pivoine.art

Author	SHA1	Message	Date
Pranav	19262f632f	fix: guard against missing choices (#817 ) - Fixes guard by using optional chaining to safely check chunk.choices?.[0] before accessing. - Currently, accessing chunk.choices[0] without checking could throw if choices was missing from the chunk.	2025-05-10 16:16:19 -07:00
Fouad Matin	3104d81b7b	fix: migrate to AGENTS.md (#764 ) Migrate from `codex.md` to `AGENTS.md`	2025-05-10 15:57:49 -07:00
Tomas Cupr	e307d007aa	fix: retry on OpenAI server_error even without status code (#814 ) Fix: retry on server_error responses that lack an HTTP status code ### What happened 1. An OpenAI endpoint returned a 5xx (transient server-side failure). 2. The SDK surfaced it as an `APIError` with { "type": "server_error", "message": "...", "status": undefined } (The SDK does not always populate `status` for these cases.) 3. Our retry logic in `src/utils/agent/agent-loop.ts` determined isServerError = typeof status === "number" && status >= 500; Because `status` was undefined, the error was not recognised as retriable, the exception bubbled out, and the CLI crashed with a stack trace similar to: Error: An error occurred while processing the request. at .../cli.js:474:1514 ### Root cause The transient-error detector ignored the semantic flag type === "server_error" that the SDK provides when the numeric status is missing. #### Fix (1 loc + comment) Extend the check: const status = errCtx?.status ?? errCtx?.httpStatus ?? errCtx?.statusCode; const isServerError = (typeof status === "number" && status >= 500) \|\| // classic 5xx errCtx?.type === "server_error"; // <-- NEW Now the agent: * Retries up to 5 times (existing logic) when the backend reports a transient failure, even if `status` is absent. * If all retries fail, surfaces the existing friendly system message instead of an uncaught exception. ### Tests & validation pnpm test # all suites green (17 agent-level tests now include this path) pnpm run lint # 0 errors / warnings pnpm run typecheck A new unit-test file isn’t required—the behaviour is already covered by tests/agent-server-retry.test.ts, which stubs type: "server_error" and now passes with the updated logic. ### Impact * No API-surface changes. * Prevents CLI crashes on intermittent OpenAI outages. * Adds robust handling for other providers that may follow the same error-shape.	2025-05-10 15:43:03 -07:00
Govind Kamtamneni	7795272282	Adds Azure OpenAI support (#769 ) ## Summary This PR introduces support for Azure OpenAI as a provider within the Codex CLI. Users can now configure the tool to leverage their Azure OpenAI deployments by specifying `"azure"` as the provider in `config.json` and setting the corresponding `AZURE_OPENAI_API_KEY` and `AZURE_OPENAI_API_VERSION` environment variables. This functionality is added alongside the existing provider options (OpenAI, OpenRouter, etc.). Related to #92 Note: This PR is currently in Draft status because tests on the `main` branch are failing. It will be marked as ready for review once the `main` branch is stable and tests are passing. --- ## What’s Changed - Configuration (`config.ts`, `providers.ts`, `README.md`): - Added `"azure"` to the supported `providers` list in `providers.ts`, specifying its name, default base URL structure, and environment variable key (`AZURE_OPENAI_API_KEY`). - Defined the `AZURE_OPENAI_API_VERSION` environment variable in `config.ts` with a default value (`2025-03-01-preview`). - Updated `README.md` to: - Include "azure" in the list of providers. - Add a configuration section for Azure OpenAI, detailing the required environment variables (`AZURE_OPENAI_API_KEY`, `AZURE_OPENAI_API_VERSION`) with examples. - Client Instantiation (`terminal-chat.tsx`, `singlepass-cli-app.tsx`, `agent-loop.ts`, `compact-summary.ts`, `model-utils.ts`): - Modified various components and utility functions where the OpenAI client is initialized. - Added conditional logic to check if the configured `provider` is `"azure"`. - If the provider is Azure, the `AzureOpenAI` client from the `openai` package is instantiated, using the configured `baseURL`, `apiKey` (from `AZURE_OPENAI_API_KEY`), and `apiVersion` (from `AZURE_OPENAI_API_VERSION`). - Otherwise, the standard `OpenAI` client is instantiated as before. - Dependencies: - Relies on the `openai` package's built-in support for `AzureOpenAI`. No new external dependencies were added specifically for this Azure implementation beyond the `openai` package itself. --- ## How to Test This has been tested locally and confirmed working with Azure OpenAI. 1. Configure `config.json`: Ensure your `~/.codex/config.json` (or project-specific config) includes Azure and sets it as the active provider: ```json { "providers": { // ... other providers "azure": { "name": "AzureOpenAI", "baseURL": "https://YOUR_RESOURCE_NAME.openai.azure.com", // Replace with your Azure endpoint "envKey": "AZURE_OPENAI_API_KEY" } }, "provider": "azure", // Set Azure as the active provider "model": "o4-mini" // Use your Azure deployment name here // ... other config settings } ``` 2. Set up Environment Variables: ```bash # Set the API Key for your Azure OpenAI resource export AZURE_OPENAI_API_KEY="your-azure-api-key-here" # Set the API Version (Optional - defaults to `2025-03-01-preview` if not set) # Ensure this version is supported by your Azure deployment and endpoint export AZURE_OPENAI_API_VERSION="2025-03-01-preview" ``` 3. Get the Codex CLI by building from this PR branch: Clone your fork, checkout this branch (`feat/azure-openai`), navigate to `codex-cli`, and build: ```bash # cd /path/to/your/fork/codex git checkout feat/azure-openai # Or your branch name cd codex-cli corepack enable pnpm install pnpm build ``` 4. Invoke Codex: Run the locally built CLI using `node` from the `codex-cli` directory: ```bash node ./dist/cli.js "Explain the purpose of this PR" ``` (Alternatively, if you ran `pnpm link` after building, you can use `codex "Explain the purpose of this PR"` from anywhere). 5. Verify: Confirm that the command executes successfully and interacts with your configured Azure OpenAI deployment. --- ## Tests - [x] Tested locally against an Azure OpenAI deployment using API Key authentication. Basic commands and interactions confirmed working. --- ## Checklist - [x] Added Azure provider details to configuration files (`providers.ts`, `config.ts`). - [x] Implemented conditional `AzureOpenAI` client initialization based on provider setting. - [x] Ensured `apiVersion` is passed correctly to the Azure client. - [x] Updated `README.md` with Azure OpenAI setup instructions. - [x] Manually tested core functionality against a live Azure OpenAI endpoint. - [x] Add/update automated tests for the Azure code path (pending `main` stability). cc @theabhinavdas @nikodem-wrona @fouad-openai @tibo-openai (adjust as needed) --- I have read the CLA Document and I hereby sign the CLA	2025-05-09 18:11:32 -07:00
Anil Karaka	76a979007e	fix: increase output limits for truncating collector (#575 ) This Pull Request addresses an issue where the output of commands executed in the raw-exec utility was being truncated due to restrictive limits on the number of lines and bytes collected. The truncation caused the message [Output truncated: too many lines or bytes] to appear when processing large outputs, which could hinder the functionality of the CLI. Changes Made Increased the maximum output limits in the [createTruncatingCollector](https://github.com/openai/codex/pull/575) utility: Bytes: Increased from 10 KB to 100 KB. Lines: Increased from 256 lines to 1024 lines. Installed the @types/node package to resolve missing type definitions for [NodeJS](https://github.com/openai/codex/pull/575) and [Buffer](https://github.com/openai/codex/pull/575). Verified and fixed any related errors in the [createTruncatingCollector](https://github.com/openai/codex/pull/575) implementation. Issue Solved: This PR ensures that larger outputs can be processed without truncation, improving the usability of the CLI for commands that generate extensive output. https://github.com/openai/codex/issues/509 --------- Co-authored-by: Michael Bolin <bolinfest@gmail.com>	2025-05-05 10:26:55 -07:00
anup-openai	f6b1ce2e3a	Configure HTTPS agent for proxies (#775 ) - Some workflows require you to route openAI API traffic through a proxy - See https://github.com/openai/openai-node/tree/v4?tab=readme-ov-file#configuring-an-https-agent-eg-for-proxies for more details --------- Co-authored-by: Thibault Sottiaux <tibo@openai.com> Co-authored-by: Fouad Matin <fouad@openai.com>	2025-05-02 12:08:13 -07:00
Michael Bolin	a4b51f6b67	feat: use Landlock for sandboxing on Linux in TypeScript CLI (#763 ) Building on top of https://github.com/openai/codex/pull/757, this PR updates Codex to use the Landlock executor binary for sandboxing in the Node.js CLI. Note that Codex has to be invoked with either `--full-auto` or `--auto-edit` to activate sandboxing. (Using `--suggest` or `--dangerously-auto-approve-everything` ensures the sandboxing codepath will not be exercised.) When I tested this on a Linux host (specifically, `Ubuntu 24.04.1 LTS`), things worked as expected: I ran Codex CLI with `--full-auto` and then asked it to do `echo 'hello mbolin' into hello_world.txt` and it succeeded without prompting me. However, in my testing, I discovered that the sandboxing did not work when using `--full-auto` in a Linux Docker container from a macOS host. I updated the code to throw a detailed error message when this happens: ![image](https://github.com/user-attachments/assets/e5b99def-f00e-4ade-a0c5-2394d30df52e)	2025-05-01 12:34:56 -07:00
moppywhip	bc4e6db749	feat: `@mention` files in codex (#701 ) Solves #700 ## State of the World Before Prior to this PR, when users wanted to share file contents with Codex, they had two options: - Manually copy and paste file contents into the chat - Wait for the assistant to use the shell tool to view the file The second approach required the assistant to: 1. Recognize the need to view a file 2. Execute a shell tool call 3. Wait for the tool call to complete 4. Process the file contents This consumed extra tokens and reduced user control over which files were shared with the model. ## State of the World After With this PR, users can now: - Reference files directly in their chat input using the `@path` syntax - Have file contents automatically expanded into XML blocks before being sent to the LLM For example, users can type `@src/utils/config.js` in their message, and the file contents will be included in context. Within the terminal chat history, these file blocks will be collapsed back to `@path` format in the UI for clean presentation. Tag File suggestions: <img width="857" alt="file-suggestions" src="https://github.com/user-attachments/assets/397669dc-ad83-492d-b5f0-164fab2ff4ba" /> Tagging files in action: <img width="858" alt="tagging-files" src="https://github.com/user-attachments/assets/0de9d559-7b7f-4916-aeff-87ae9b16550a" /> Demo video of file tagging: [![Demo video of file tagging](https://img.youtube.com/vi/vL4LqtBnqt8/0.jpg)](https://www.youtube.com/watch?v=vL4LqtBnqt8) ## Implementation Details This PR consists of 2 main components: 1. File Tag Utilities: - New `file-tag-utils.ts` utility module that handles both expansion and collapsing of file tags - `expandFileTags()` identifies `@path` tokens and replaces them with XML blocks containing file contents - `collapseXmlBlocks()` reverses the process, converting XML blocks back to `@path` format for UI display - Tokens are only expanded if they point to valid files (directories are ignored) - Expansion happens just before sending input to the model 2. Terminal Chat Integration: - Leveraged the existing file system completion system for tabbing to support the `@path` syntax - Added `updateFsSuggestions` helper to manage filesystem suggestions - Added `replaceFileSystemSuggestion` to replace input with filesystem suggestions - Applied `collapseXmlBlocks` in the chat response rendering so that tagged files are shown as simple `@path` tags The PR also includes test coverage for both the UI and the file tag utilities. ## Next Steps Some ideas I'd like to implement if this feature gets merged: - Line selection: `@path[50:80]` to grab specific sections of files - Method selection: `@path#methodName` to grab just one function/class - Visual improvements: highlight file tags in the UI to make them more noticeable	2025-04-30 16:19:55 -07:00
Kevin Alwell	bd82101859	fix: insufficient quota message (#758 ) This pull request includes a change to improve the error message displayed when there is insufficient quota in the `AgentLoop` class. The updated message provides more detailed information and a link for managing or purchasing credits. Error message improvement: * [`codex-cli/src/utils/agent/agent-loop.ts`](diffhunk://#diff-b15957eac2720c3f1f55aa32f172cdd0ac6969caf4e7be87983df747a9f97083L1140-R1140): Updated the error message in the `AgentLoop` class to include the specific error message (if available) and a link to manage or purchase credits. Fixes #751	2025-04-30 16:00:50 -07:00
Michael Bolin	033d379eca	fix: remove unused _writableRoots arg to exec() function (#762 ) I suspect this was done originally so that `execForSandbox()` had a consistent signature for both the `SandboxType.NONE` and `SandboxType.MACOS_SEATBELT` cases, but that is not really necessary and turns out to make the upcoming Landlock support a bit more complicated to implement, so I had Codex remove it and clean up the call sites.	2025-04-30 14:08:27 -07:00
Michael Bolin	2f1d96e77d	fix: remove errant eslint-disable so `pnpm run lint` passes again (#756 ) My bad: introduced in https://github.com/openai/codex/pull/753.	2025-04-30 11:37:11 -07:00
Michael Bolin	84aaefa102	fix: read version from package.json instead of modifying session.ts (#753 ) I am working to simplify the build process. As a first step, update `session.ts` so it reads the `version` from `package.json` at runtime so we no longer have to modify it during the build process. I want to get to a place where the build looks like: ``` cd codex-cli pnpm i pnpm build RELEASE_DIR=$(mktemp -d) cp -r bin "$RELEASE_DIR/bin" cp -r dist "$RELEASE_DIR/dist" cp -r src "$RELEASE_DIR/src" # important if we want sourcemaps to continue to work cp ../README.md "$RELEASE_DIR" VERSION=$(printf '0.1.%d' $(date +%y%m%d%H%M)) jq --arg version "$VERSION" '.version = $version' package.json > "$RELEASE_DIR/package.json" ``` Then the contents of `$RELEASE_DIR` should be good to `npm publish`, no?	2025-04-30 11:03:10 -07:00
Kevin Alwell	a6ed7ff103	Fixes issue #726 by adding config to configToSave object (#728 ) The saveConfig() function only includes a hardcoded subset of properties when writing the config file. Any property not explicitly listed (like disableResponseStorage) will be dropped. I have added `disableResponseStorage` to the `configToSave` object as the immediate fix. [Linking Issue this fixes.](https://github.com/openai/codex/issues/726)	2025-04-29 13:10:16 -04:00
Rashim	892242ef7c	feat: add `--reasoning` CLI flag (#314 ) This PR adds a new CLI flag: `--reasoning`, which allows users to customize the reasoning effort level (`low`, `medium`, or `high`) used by OpenAI's `o` models. By introducing the `--reasoning` flag, users gain more flexibility when working with the models. It enables optimization for either speed or depth of reasoning, depending on specific use cases. This PR resolves #107 - Flag: `--reasoning` - Accepted Values: `low`, `medium`, `high` - Default Behavior: If not specified, the model uses the default reasoning level. ## Example Usage ```bash codex --reasoning=low "Write a simple function to calculate factorial" --------- Co-authored-by: Fouad Matin <169186268+fouad-openai@users.noreply.github.com> Co-authored-by: yashrwealthy <yash.rastogi@wealthy.in> Co-authored-by: Thibault Sottiaux <tibo@openai.com>	2025-04-29 07:30:49 -07:00
Thibault Sottiaux	d09dbba7ec	feat: lower default retry wait time and increase number of tries (#720 ) In total we now guarantee that we will wait for at least 60s before giving up. --------- Signed-off-by: Thibault Sottiaux <tibo@openai.com>	2025-04-28 21:11:30 -07:00
Michael Bolin	40460faf2a	fix: tighten up check for /usr/bin/sandbox-exec (#710 ) * In both TypeScript and Rust, we now invoke `/usr/bin/sandbox-exec` explicitly rather than whatever `sandbox-exec` happens to be on the `PATH`. * Changed `isSandboxExecAvailable` to use `access()` rather than `command -v` so that: * We only do the check once over the lifetime of the Codex process. * The check is specific to `/usr/bin/sandbox-exec`. * We now do a syscall rather than incur the overhead of spawning a process, dealing with timeouts, etc. I think there is still room for improvement here where we should move the `isSandboxExecAvailable` check earlier in the CLI, ideally right after we do arg parsing to verify that we can provide the Seatbelt sandbox if that is what the user has requested.	2025-04-28 13:42:04 -07:00
Thibault Sottiaux	fa5fa8effc	fix: only allow running without sandbox if explicitly marked in safe container (#699 ) Signed-off-by: Thibault Sottiaux <tibo@openai.com>	2025-04-28 07:48:38 -07:00
Thibault Sottiaux	e9d16d3c2b	fix: check if sandbox-exec is available (#696 ) - Introduce `isSandboxExecAvailable()` helper and tidy import ordering in `handle-exec-command.ts`. - Add runtime check for the `sandbox-exec` binary on macOS; fall back to `SandboxType.NONE` with a warning if it’s missing, preventing crashes. --------- Signed-off-by: Thibault Sottiaux <tibo@openai.com> Co-authored-by: Fouad Matin <fouad@openai.com>	2025-04-27 17:04:47 -07:00
Fouad Matin	523996b5cb	fix: `/diff` should include untracked files (#686 )	2025-04-26 12:43:51 -07:00
Tomas Cupr	bc500d3009	feat: user config api key (#569 ) Adds support for reading OPENAI_API_KEY (and other variables) from a user‑wide dotenv file (~/.codex.config). Precedence order is now: 1. explicit environment variable 2. project‑local .env (loaded earlier) 3. ~/.codex.config Also adds a regression test that ensures the multiline editor correctly handles cases where printable text and the CSI‑u Shift+Enter sequence arrive in the same input chunk. House‑kept with Prettier; removed stray temp.json artifact.	2025-04-26 10:13:30 -07:00
moppywhip	9b0ccf9aeb	fix: duplicate messages in quiet mode (#680 ) Addressing #600 and #664 (partially) ## Bug Codex was staging duplicate items in output running when the same response item appeared in both the streaming events. Specifically: 1. Items would be staged once when received as a `response.output_item.done` event 2. The same items would be staged again when included in the final `response.completed` payload This duplication would result in each message being sent several times in the quiet mode output. ## Changes - Added a Set (`alreadyStagedItemIds`) to track items that have already been staged - Modified the `stageItem` function to check if an item's ID is already in this set before staging it - Added a regression test (`agent-dedupe-items.test.ts`) that verifies items with the same ID are only staged once ## Testing Like other tests, the included test creates a mock OpenAI stream that emits the same message twice (once as an incremental event and once in the final response) and verifies the item is only passed to `onItem` once.	2025-04-26 09:14:50 -07:00
Fouad Matin	103093f793	bump(version): 0.1.2504251709 (#660 ) ## `0.1.2504251709` ### 🚀 Features - Add openai model info configuration (#551) - Added provider to run quiet mode function (#571) - Create parent directories when creating new files (#552) - Print bug report URL in terminal instead of opening browser (#510) (#528) - Add support for custom provider configuration in the user config (#537) - Add support for OpenAI-Organization and OpenAI-Project headers (#626) - Add specific instructions for creating API keys in error msg (#581) - Enhance toCodePoints to prevent potential unicode 14 errors (#615) - More native keyboard navigation in multiline editor (#655) - Display error on selection of invalid model (#594) ### 🪲 Bug Fixes - Model selection (#643) - Nits in apply patch (#640) - Input keyboard shortcuts (#676) - `apply_patch` unicode characters (#625) - Don't clear turn input before retries (#611) - More loosely match context for apply_patch (#610) - Update bug report template - there is no --revision flag (#614) - Remove outdated copy of text input and external editor feature (#670) - Remove unreachable "disableResponseStorage" logic flow introduced in #543 (#573) - Non-openai mode - fix for gemini content: null, fix 429 to throw before stream (#563) - Only allow going up in history when not already in history if input is empty (#654) - Do not grant "node" user sudo access when using run_in_container.sh (#627) - Update scripts/build_container.sh to use pnpm instead of npm (#631) - Update lint-staged config to use pnpm --filter (#582) - Non-openai mode - don't default temp and top_p (#572) - Fix error catching when checking for updates (#597) - Close stdin when running an exec tool call (#636)	2025-04-25 17:15:40 -07:00
Tomas Cupr	4760aa1eb9	perf: optimize token streaming with balanced approach (#635 ) - Replace setTimeout(10ms) with queueMicrotask for immediate processing - Add minimal 3ms setTimeout for rendering to maintain readable UX - Reduces per-token delay while preserving streaming experience - Add performance test to verify optimization works correctly --------- Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Thibault Sottiaux <tibo@openai.com>	2025-04-25 10:49:38 -07:00
rumple	69ce06d2f8	feat: Add support for OpenAI-Organization and OpenAI-Project headers (#626 ) Added support for OpenAI-Organization and OpenAI-Project headers for OpenAI API calls. This is for #74	2025-04-25 09:52:42 -07:00
Luci	3fe7e53327	fix: nits in apply patch (#640 ) ## Description Fix a nit in `apply patch`, potentially improving performance slightly.	2025-04-25 07:27:48 -07:00
Luci	a9ecb2efce	chore: upgrade prettier to v3 (#644 ) ## Description This PR addresses the following improvements: Unify Prettier Version: Currently, the Prettier version used in `/package.json` and `/codex-cli/package.json` are different. In this PR, we're updating both to use Prettier v3. - Prettier v3 introduces improved support for JavaScript and TypeScript. (e.g. the formatting scenario shown in the image below. This is more aligned with the TypeScript indentation standard). <img width="1126" alt="image" src="https://github.com/user-attachments/assets/6e237eb8-4553-4574-b336-ed9561c55370" /> Add Prettier Auto-Formatting in lint-staged: We've added a step to automatically run prettier --write on JavaScript and TypeScript files as part of the lint-staged process, before the ESLint checks. - This will help ensure that all committed code is properly formatted according to the project's Prettier configuration.	2025-04-25 07:21:50 -07:00
Luci	c38c2a59c7	fix(utils): save config (#578 ) ## Description When `saveConfig` is called, the project doc is incorrectly saved into user instructions. This change ensures that only user instructions are saved to `instructions.md` during saveConfig, preventing data corruption. close: #576 --------- Co-authored-by: Thibault Sottiaux <tibo@openai.com>	2025-04-24 17:32:33 -07:00
nvp159	5e40d9d221	feat(bug-report): print bug report URL in terminal instead of opening browser (#510 ) (#528 ) Solves #510 This PR changes the `/bug` command to print the URL into the terminal (so it works in headless sessions) instead of trying to open a browser. --------- Co-authored-by: Thibault Sottiaux <tibo@openai.com>	2025-04-24 17:00:14 -07:00
Misha Davidov	acc4acc81e	fix: `apply_patch` unicode characters (#625 ) fuzzy-er matching for apply_patch to handle u00A0 and u202F spaces.	2025-04-24 13:04:37 -07:00
Luci	e84fa6793d	fix(agent-loop): notify type (#608 ) ## Description The `as AppConfig` type assertion in the constructor may introduce potential type safety risks. Removing the assertion and making `notify` an optional parameter could enhance type robustness and prevent unexpected runtime errors. close: #605	2025-04-24 11:08:52 -07:00
Asa	d1c0d5e683	feat: update README and config to support custom providers with API k… (#577 ) When using a non-built-in provider with the `--provider` option, users are prompted: ``` Set the environment variable <provider>_API_KEY and re-run this command. You can create a <provider>_API_KEY in the <provider> dashboard. ``` However, many users are confused because, even after correctly setting `<provider>_API_KEY`, authentication may still fail unless `OPENAI_API_KEY` is _also_ present in the environment. This is not intuitive and leads to ambiguity about which API key is actually required and used as a fallback, especially when using custom or third-party (non-listed) providers. Furthermore, the original README/documentation did not mention the requirement to set `<provider>_BASE_URL` for non-built-in providers, which is necessary for proper client behavior. This omission made the configuration process more difficult for users trying to integrate with custom endpoints.	2025-04-24 11:08:19 -07:00
Misha Davidov	9b102965b9	feat: more loosely match context for apply_patch (#610 ) More of a proposal than anything but models seem to struggle with composing valid patches for `apply_patch` for context matching when there are unicode look-a-likes involved. This would normalize them. ``` top-level # ASCII top-level # U+2011 NON-BREAKING HYPHEN top–level # U+2013 EN DASH top—level # U+2014 EM DASH top‒level # U+2012 FIGURE DASH ``` thanks unicode.	2025-04-24 09:05:19 -07:00
Connor Christie	622323a59b	fix: don't clear turn input before retries (#611 ) The current turn input in the agent loop is being discarded before consuming the stream events which causes the stream reconnect (after rate limit failure) to not include the inputs. Since the new stream includes the previous response ID, it triggers a bad request exception considering the input doesn't match what OpenAI has stored on the server side and subsequently a very confusing error message of: `No tool output found for function call call_xyz`. This should fix https://github.com/openai/codex/issues/586. ## Testing I have a personal project that I'm working on that runs multiple Codex CLIs in parallel and often runs into rate limit errors (as seen in the OpenAI logs). After making this change, I am no longer experiencing Codex crashing and it was able to retry and handle everything gracefully until completion (even though I still see rate limiting in the OpenAI logs).	2025-04-24 06:29:36 -05:00
kshern	146a61b073	feat: add support for custom provider configuration in the user config (#537 ) ### What - Add support for loading and merging custom provider configurations from a local `providers.json` file. - Allow users to override or extend default providers with their own settings. ### Why This change enables users to flexibly customize and extend provider endpoints and API keys without modifying the codebase, making the CLI more adaptable for various LLM backends and enterprise use cases. ### How - Introduced `loadProvidersFromFile` and `getMergedProviders` in config logic. - Added/updated related tests in [tests/config.test.tsx] ### Checklist - [x] Lint passes for changed files - [x] Tests pass for all files - [x] Documentation/comments updated as needed --------- Co-authored-by: Thibault Sottiaux <tibo@openai.com>	2025-04-23 01:45:56 -04:00
Connor Christie	cbeb5c3057	fix: remove unreachable "disableResponseStorage" logic flow introduced in #543 (#573 ) This PR cleans up unreachable code that was added as a result of https://github.com/openai/codex/pull/543. The code being removed is already being handled above: `23f0887df3/codex-cli/src/utils/agent/agent-loop.ts (L535-L539)`	2025-04-23 01:08:52 -04:00
Daniel Nakov	4261973467	bug: non-openai mode - don't default temp and top_p (#572 ) I haven't seen any actual errors due to this, but it's been bothering me that I had it defaulted to 1. I think best to leave it undefined and have each provider do their thing	2025-04-23 01:07:40 -04:00
Daniel Nakov	23f0887df3	bug: non-openai mode - fix for gemini content: null, fix 429 to throw before stream (#563 ) Gemini's API is finicky, it 400's without an error when you pass content: null Also fixed the rate limiting issues by throwing outside of the iterator. I think there's a separate issue with the second isRateLimit check in agent-loop - turnInput is cleared by that time, so it retries without the last message.	2025-04-22 20:37:48 -04:00
Misha Davidov	20b6ef0de8	feat: create parent directories when creating new files. (#552 ) apply_patch doesn't create parent directories when creating a new file leading to confusion and flailing by the agent. This will create parent directories automatically when absent. --------- Co-authored-by: Thibault Sottiaux <tibo@openai.com>	2025-04-22 19:45:17 -04:00
chunterb	750d97e8ad	feat: add openai model info configuration (#551 ) In reference to [Issue 548](https://github.com/openai/codex/issues/548) - part 1.	2025-04-22 17:31:25 -04:00
Fouad Matin	12bc2dcc4e	bump(version): 0.1.2504221401 (#559 ) ## `0.1.2504221401` ### 🚀 Features - Show actionable errors when api keys are missing (#523) - Add CLI `--version` flag (#492) ### 🐛 Bug Fixes - Agent loop for ZDR (`disableResponseStorage`) (#543) - Fix relative `workdir` check for `apply_patch` (#556) - Minimal mid-stream #429 retry loop using existing back-off (#506) - Inconsistent usage of base URL and API key (#507) - Remove requirement for api key for ollama (#546) - Support `[provider]_BASE_URL` (#542)	2025-04-22 14:18:04 -07:00
Nick Carchedi	dc096302e5	fix typo in prompt (#558 )	2025-04-22 17:15:28 -04:00
Michael Bolin	7c1f2d7deb	when a shell tool call invokes apply_patch, resolve relative paths against workdir, if specified (#556 ) Previously, we were ignoring the `workdir` field in an `ExecInput` when running it through `canAutoApprove()`. For ordinary `exec()` calls, that was sufficient, but for `apply_patch`, we need the `workdir` to resolve relative paths in the `apply_patch` argument so that we can check them in `isPathConstrainedTowritablePaths()`. Likewise, we also need the workdir when running `execApplyPatch()` because the paths need to be resolved again. Ideally, the `ApplyPatchCommand` returned by `canAutoApprove()` would not be a simple `patch: string`, but the parsed patch with all of the paths resolved, in which case `execApplyPatch()` could expect absolute paths and would not need `workdir`.	2025-04-22 14:07:47 -07:00
Fouad Matin	a30e79b768	fix: agent loop for disable response storage (#543 ) - Fixes post-merge of #506 --------- Co-authored-by: Ilan Bigio <ilan@openai.com>	2025-04-22 13:49:10 -07:00
Daniil Davydov	f99c9080fd	fix: support [provider]_BASE_URL (#542 ) Resolved issue where an OLLAMA_BASE_URL was not properly handled (openai/codex#516).	2025-04-22 15:05:48 -04:00
Scott Leibrand	ee6e1765fa	agent-loop: minimal mid-stream #429 retry loop using existing back-off (#506 ) As requested by @tibo-openai at https://github.com/openai/codex/pull/357#issuecomment-2816554203, this attempts a more minimal implementation of #357 that preserves as much as possible of the existing code's exponential backoff logic. Adds a small retry wrapper around the streaming for‑await loop so that HTTP 429s which occur after the stream has started no longer crash the CLI. Highlights • Re‑uses existing RATE_LIMIT_RETRY_WAIT_MS constant and 5‑attempt limit. • Exponential back‑off identical to initial request handling. This comment is probably more useful here in the PR: // The OpenAI SDK may raise a 429 (rate‑limit) after the stream has // started. Prior logic already retries the initial `responses.create` // call, but we need to add equivalent resilience for mid‑stream // failures. We keep the implementation minimal by wrapping the // existing `for‑await` loop in a small retry‑for‑loop that re‑creates // the stream with exponential back‑off.	2025-04-22 11:02:10 -04:00
Gabriel Bianconi	98a22273d9	fix: inconsistent usage of base URL and API key (#507 ) A recent commit introduced the ability to use third-party model providers. (Really appreciate it!) However, the usage is inconsistent: some pieces of code use the custom providers, whereas others still have the old behavior. Additionally, `OPENAI_BASE_URL` is now being disregarded when it shouldn't be. This PR normalizes the usage to `getApiKey` and `getBaseUrl`, and enables the use of `OPENAI_BASE_URL` if present. --------- Co-authored-by: Gabriel Bianconi <GabrielBianconi@users.noreply.github.com>	2025-04-22 10:51:26 -04:00
Thomas	d78f77edb7	fix(agent-loop): update required properties to include workdir and ti… (#530 ) Without this I get an issue running codex it in a docker container. I receive: ``` { "answer": "{\"role\":\"user\",\"content\":[{\"type\":\"input_text\",\"text\":\"\\\"Say hello world\\\"\"}],\"type\":\"message\"}\n{\"id\":\"error-1745325184914\",\"type\":\"message\",\"role\":\"system\",\"content\":[{\"type\":\"input_text\",\"text\":\"⚠️ OpenAI rejected the request (request ID: req_f9027b59ebbce00061e9cd2dbb2d529a). Error details: Status: 400, Code: invalid_function_parameters, Type: invalid_request_error, Message: 400 Invalid schema for function 'shell': In context=(), 'required' is required to be supplied and to be an array including every key in properties. Missing 'workdir'.. Please verify your settings and try again.\"}]}\n" } ``` This fix makes it work.	2025-04-22 10:32:36 -04:00
Fouad Matin	2cb8355968	bump(version): 0.1.2504220136 (#518 ) ## `0.1.2504220136` ### 🚀 Features - Add support for ZDR orgs (#481) - Include fractional portion of chunk that exceeds stdout/stderr limit (#497)	2025-04-22 01:45:30 -07:00
Fouad Matin	9f5ccbb618	feat: add support for ZDR orgs (#481 ) - Add `store: boolean` to `AgentLoop` to enable client-side storage of response items - Add `--disable-response-storage` arg + `disableResponseStorage` config	2025-04-22 01:30:16 -07:00
Michael Bolin	3eba86a553	include fractional portion of chunk that exceeds stdout/stderr limit (#497 ) I saw cases where the first chunk of output from `ls -R` could be large enough to exceed `MAX_OUTPUT_BYTES` or `MAX_OUTPUT_LINES`, in which case the loop would exit early in `createTruncatingCollector()` such that nothing was appended to the `chunks` array. As a result, the reported `stdout` of `ls -R` would be empty. I asked Codex to add logic to handle this edge case and write a unit test. I used this as my test: ``` ./codex-cli/dist/cli.js -q 'what is the output of `ls -R`' ``` now it appears to include a ton of stuff whereas before this change, I saw: ``` {"type":"function_call_output","call_id":"call_a2QhVt7HRJYKjb3dIc8w1aBB","output":"{\"output\":\"\\n\\n[Output truncated: too many lines or bytes]\",\"metadata\":{\"exit_code\":0,\"duration_seconds\":0.5}}"} ```	2025-04-21 19:06:03 -07:00

1 2 3

112 Commits