valknar/llmx - llmx - dev.pivoine.art

Author	SHA1	Message	Date
jif-oai	aa76003e28	chore: unify config crates (#5958 )	2025-10-30 10:28:32 +00:00
jif-oai	3e50f94d76	feat: support verbosity in model_family (#5821 )	2025-10-27 18:46:30 +00:00
Thibault Sottiaux	224222f09f	fix: use codex-exp prefix for experimental models and consider codex- models to be production (#5797 )	2025-10-27 01:55:12 +00:00
jif-oai	ad9a289951	chore: drop env var flag (#5462 )	2025-10-21 16:11:12 +00:00
Ahmed Ibrahim	049a61bcfc	Auto compact at ~90% (#5292 ) Users now hit a window exceeded limit and they usually don't know what to do. This starts auto compact at ~90% of the window.	2025-10-20 11:29:49 -07:00
jif-oai	f52320be86	feat: grep_files as a tool (#4820 ) Add `grep_files` to be able to perform more action in parallel	2025-10-08 11:02:50 +01:00
jif-oai	226215f36d	feat: `list_dir` tool (#4817 ) Add a tool to list_dir. It is useful because we can mark it as non-mutating and so use it in parallel	2025-10-07 19:33:19 +01:00
jif-oai	f3b4a26f32	chore: drop read-file for gpt-5-codex (#4739 ) Drop `read_file` for gpt-5-codex (will do the same for parallel tool call) and add `codex-` as internal model for this kind of feature	2025-10-05 16:26:04 +00:00
jif-oai	dc3c6bf62a	feat: parallel tool calls (#4663 ) Add parallel tool calls. This is configurable at model level and tool level	2025-10-05 16:10:49 +00:00
Dylan	3203862167	chore: update tool config (#4755 ) ## Summary Updates tool config for gpt-5-codex ## Test Plan - [x] Ran locally - [x] Updated unit tests	2025-10-04 22:47:26 -07:00
Dylan	4764fc1ee7	feat: Freeform apply_patch with simple shell output (#4718 ) ## Summary This PR is an alternative approach to #4711, but instead of changing our storage, parses out shell calls in the client and reserializes them on the fly before we send them out as part of the request. What this changes: 1. Adds additional serialization logic when the ApplyPatchToolType::Freeform is in use. 2. Adds a --custom-apply-patch flag to enable this setting on a session-by-session basis. This change is delicate, but is not meant to be permanent. It is meant to be the first step in a migration: 1. (This PR) Add in-flight serialization with config 2. Update model_family default 3. Update serialization logic to store turn outputs in a structured format, with logic to serialize based on model_family setting. 4. Remove this rewrite in-flight logic. ## Test Plan - [x] Additional unit tests added - [x] Integration tests added - [x] Tested locally	2025-10-04 19:16:36 -07:00
jif-oai	e0b38bd7a2	feat: add `beta_supported_tools` (#4669 ) Gate the new read_file tool behind a new `beta_supported_tools` flag and only enable it for `gpt-5-codex`	2025-10-03 16:58:03 +00:00
jif-oai	33d3ecbccc	chore: refactor tool handling (#4510 ) # Tool System Refactor - Centralizes tool definitions and execution in `core/src/tools/`: specs (`spec.rs`), handlers (`handlers/`), router (`router.rs`), registry/dispatch (`registry.rs`), and shared context (`context.rs`). One registry now builds the model-visible tool list and binds handlers. - Router converts model responses to tool calls; Registry dispatches with consistent telemetry via `codex-rs/otel` and unified error handling. Function, Local Shell, MCP, and experimental `unified_exec` all flow through this path; legacy shell aliases still work. - Rationale: reduce per‑tool boilerplate, keep spec/handler in sync, and make adding tools predictable and testable. Example: `read_file` - Spec: `core/src/tools/spec.rs` (see `create_read_file_tool`, registered by `build_specs`). - Handler: `core/src/tools/handlers/read_file.rs` (absolute `file_path`, 1‑indexed `offset`, `limit`, `L#: ` prefixes, safe truncation). - E2E test: `core/tests/suite/read_file.rs` validates the tool returns the requested lines. ## Next steps: - Decompose `handle_container_exec_with_params` - Add parallel tool calls	2025-10-03 13:21:06 +01:00
Michael Bolin	f037b2fd56	chore: rename (#3648 )	2025-09-15 08:17:13 -07:00
Dylan	b6673838e8	fix: model family and apply_patch consistency (#3603 ) ## Summary Resolves a merge conflict between #3597 and #3560, and adds tests to double check our apply_patch configuration. ## Testing - [x] Added unit tests --------- Co-authored-by: dedrisian-oai <dedrisian@openai.com>	2025-09-14 18:20:37 -07:00
pakrym-oai	9177bdae5e	Only one branch for swiftfox (#3601 ) Make each model family have a single branch.	2025-09-14 16:56:22 -07:00
pakrym-oai	916fdc2a37	Add per-model-family prompts (#3597 ) Allows more flexibility in defining prompts.	2025-09-14 22:45:15 +00:00
Thibault Sottiaux	bac8a427f3	chore: default swiftfox models to experimental reasoning summaries (#3560 )	2025-09-13 23:40:54 +00:00
Andrew Tan	de6559f2ab	Include apply_patch tool for oss models from gpt-oss providers with different naming convention (e.g. `openai/gpt-oss-*`) (#2811 ) Model providers like Groq, Openrouter, AWS Bedrock, VertexAI and others typically prefix the name of gpt-oss models with `openai`, e.g. `openai/gpt-oss-120b`. This PR is to match the model name slug using `contains` instead of `starts_with` to ensure that the `apply_patch` tool is included in the tools for models names like `openai/gpt-oss-120b` Without this, the gpt-oss models will often try to call the `apply_patch` tool directly instead of via the `shell` command, leading to validation errors. I have run all the local checks. Note: The gpt-oss models from non-Ollama providers are typically run via a profile with a different base_url (instead of with the `--oss` flag) --------- Co-authored-by: Andrew Tan <andrewtan@Andrews-Mac.local>	2025-09-09 15:02:02 -07:00
Anton Panasenko	e60a44cbab	[codex] move configuration for reasoning summary format to model family config type (#3171 )	2025-09-04 11:00:01 -07:00
Dylan	4157788310	[apply_patch] disable default freeform tool (#2643 ) ## Summary We're seeing some issues in the freeform tool - let's disable by default until it stabilizes. ## Testing - [x] Ran locally, confirmed codex-cli could make edits	2025-08-24 11:12:37 -07:00
Dylan	236c4f76a6	[apply_patch] freeform apply_patch tool (#2576 ) ## Summary GPT-5 introduced the concept of [custom tools](https://platform.openai.com/docs/guides/function-calling#custom-tools), which allow the model to send a raw string result back, simplifying json-escape issues. We are migrating gpt-5 to use this by default. However, gpt-oss models do not support custom tools, only normal functions. So we keep both tool definitions, and provide whichever one the model family supports. ## Testing - [x] Tested locally with various models - [x] Unit tests pass	2025-08-22 13:42:34 -07:00
Dylan	6df8e35314	[tools] Add apply_patch tool (#2303 ) ## Summary We've been seeing a number of issues and reports with our synthetic `apply_patch` tool, e.g. #802. Let's make this a real tool - in my anecdotal testing, it's critical for GPT-OSS models, but I'd like to make it the standard across GPT-5 and codex models as well. ## Testing - [x] Tested locally - [x] Integration test	2025-08-15 11:55:53 -04:00
pakrym-oai	de2c6a2ce7	Enable reasoning for codex-prefixed models (#2275 ) ## Summary - enable reasoning for any model slug starting with `codex-` - provide default model info for `codex-` slugs - test that codex models are detected and support reasoning ## Testing - `just fmt` - `just fix` (fails: E0658 `let` expressions in this position are unstable)* - `cargo test --all-features` (fails: E0658 `let` expressions in this position are unstable) ------ https://chatgpt.com/codex/tasks/task_i_689d13f8705483208a6ed21c076868e1	2025-08-13 17:02:50 -07:00
pakrym-oai	7e9ecfbc6a	Rename the model (#1942 )	2025-08-07 09:07:51 -07:00
pakrym-oai	57c973b571	Add 2025-08-06 model family (#1899 )	2025-08-06 23:14:02 +00:00
easong-openai	9285350842	Introduce `--oss` flag to use gpt-oss models (#1848 ) This adds support for easily running Codex backed by a local Ollama instance running our new open source models. See https://github.com/openai/gpt-oss for details. If you pass in `--oss` you'll be prompted to install/launch ollama, and it will automatically download the 20b model and attempt to use it. We'll likely want to expand this with some options later to make the experience smoother for users who can't run the 20b or want to run the 120b. Co-authored-by: Michael Bolin <mbolin@openai.com>	2025-08-05 11:31:11 -07:00
Michael Bolin	136b3ee5bf	chore: introduce ModelFamily abstraction (#1838 ) To date, we have a number of hardcoded OpenAI model slug checks spread throughout the codebase, which makes it hard to audit the various special cases for each model. To mitigate this issue, this PR introduces the idea of a `ModelFamily` that has fields to represent the existing special cases, such as `supports_reasoning_summaries` and `uses_local_shell_tool`. There is a `find_family_for_model()` function that maps the raw model slug to a `ModelFamily`. This function hardcodes all the knowledge about the special attributes for each model. This PR then replaces the hardcoded model name checks with checks against a `ModelFamily`. Note `ModelFamily` is now available as `Config::model_family`. We should ultimately remove `Config::model` in favor of `Config::model_family::slug`.	2025-08-04 23:50:03 -07:00

28 Commits