Add --model flag to launch command with auto-pull by ericcurtin · Pull Request #674 · docker/model-runner

ericcurtin · 2026-02-15T16:22:17Z

The launch command now supports a --model flag to specify which model to use. When provided, the model will be automatically pulled if not available locally and preloaded in the background. Container and host apps receive the model name via the OPENAI_MODEL environment variable.

For host apps, the command now verifies the endpoint is TCP-reachable and falls back to the standalone runner port if the Docker socket URL isn't accessible externally. This ensures host apps can connect to the model runner service properly.

The launch command now supports a --model flag to specify which model to use. When provided, the model will be automatically pulled if not available locally and preloaded in the background. Container and host apps receive the model name via the OPENAI_MODEL environment variable. For host apps, the command now verifies the endpoint is TCP-reachable and falls back to the standalone runner port if the Docker socket URL isn't accessible externally. This ensures host apps can connect to the model runner service properly. Signed-off-by: Eric Curtin <eric.curtin@docker.com>

gemini-code-assist · 2026-02-15T16:22:34Z

Summary of Changes

Hello @ericcurtin, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly upgrades the launch command by integrating robust model management capabilities and improving network connectivity for host applications. It introduces a new flag for specifying models, automates their availability, and ensures seamless communication between applications and the model runner service.

Highlights

New --model flag for 'launch' command: The launch command now supports a --model flag, allowing users to specify a model to be used by the launched application. This flag also has a short form, -m.
Automatic Model Management: When the --model flag is used, the system will automatically check if the specified model is available locally. If not found, it will attempt to pull the model from the server and then preload it in the background to ensure it's ready when the app starts.
Environment Variable Propagation: The selected model name is now passed to both containerized and host applications via the OPENAI_MODEL environment variable, enabling applications to easily configure themselves to use the specified model.
Enhanced Host App Connectivity: For host applications, the command now includes a mechanism to verify if the model runner endpoint is TCP-reachable. If the Docker socket URL (common in Docker Desktop) is not externally accessible, it intelligently falls back to the standalone runner's default TCP port, ensuring reliable connection for host apps.

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Changelog

cmd/cli/commands/launch.go
- Added net/http, net/url, strconv, time, desktop, and standalone packages to support new functionalities.
- Introduced a model variable and the --model (or -m) flag to the launch command.
- Implemented logic to check for local model availability, pull models if missing, and preload them in the background.
- Added a new ensureEndpointReachable function to verify TCP connectivity for host application endpoints, including a fallback mechanism to the standalone runner port.
- Modified launchContainerApp and launchHostApp functions to accept and propagate the model argument as the OPENAI_MODEL environment variable.
cmd/cli/commands/launch_test.go
- Added net package import for network-related tests.
- Updated signatures of launchContainerApp and launchHostApp in existing tests to include the new model parameter.
- Introduced new test cases (TestLaunchContainerAppWithModel, TestLaunchContainerAppWithoutModel, TestLaunchHostAppWithModel, TestLaunchHostAppWithoutModel) to validate OPENAI_MODEL environment variable propagation.
- Added new tests (TestEnsureEndpointReachableWithPort, TestEnsureEndpointReachableFallback) to cover the endpoint reachability and fallback logic.
- Updated TestNewLaunchCmdFlags to assert the presence of the new --model flag.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

sourcery-ai

Hey - I've found 1 issue, and left some high level feedback:

The ensureEndpointReachable TCP probes and HTTP health check are hard-coded to 2–3 second timeouts and ignore cmd.Context(), which could make launch slow or unresponsive on flaky networks; consider using a context-aware dialer / HTTP client tied to cmd.Context() so the operation can be cancelled promptly.
The background desktopClient.Preload goroutine logs via cmd.PrintErrf, which isn’t obviously safe for concurrent use and may race with other command output; consider routing background logging through a dedicated logger or channel instead of writing directly from the goroutine.

Prompt for AI Agents

Please address the comments from this code review:

## Overall Comments
- The `ensureEndpointReachable` TCP probes and HTTP health check are hard-coded to 2–3 second timeouts and ignore `cmd.Context()`, which could make `launch` slow or unresponsive on flaky networks; consider using a context-aware dialer / HTTP client tied to `cmd.Context()` so the operation can be cancelled promptly.
- The background `desktopClient.Preload` goroutine logs via `cmd.PrintErrf`, which isn’t obviously safe for concurrent use and may race with other command output; consider routing background logging through a dedicated logger or channel instead of writing directly from the goroutine.

## Individual Comments

### Comment 1
<location> `cmd/cli/commands/launch.go:390-392` </location>
<code_context>
+		// Verify it's actually the model runner by making a quick health check.
+		healthURL := "http://" + fallbackHost + "/"
+		client := &http.Client{Timeout: 3 * time.Second}
+		resp, err := client.Get(healthURL) //nolint:gosec // localhost health check
+		if err == nil {
+			resp.Body.Close()
+			cmd.PrintErrf("Using standalone runner at %s\n", fallbackHost)
+			ep.host = "http://" + fallbackHost
</code_context>

<issue_to_address>
**suggestion (bug_risk):** Health check only verifies request success and ignores status, which could mask misconfigured services on the fallback port.

Any successful `Get` (even a 500 or an unrelated HTTP service) is currently treated as “this is the model runner,” and the endpoints are rewritten. Please at least check for a 200 (`StatusOK`) or another minimal response invariant so we don’t redirect traffic to an arbitrary process just because it’s listening on the fallback port.

Suggested implementation:

```golang
	if err == nil {
		conn.Close()
		// Verify it's actually the model runner by making a quick health check.
		healthURL := "http://" + fallbackHost + "/"
		client := &http.Client{Timeout: 3 * time.Second}
		resp, err := client.Get(healthURL) //nolint:gosec // localhost health check
		if err == nil {
			if resp.StatusCode == http.StatusOK {
				resp.Body.Close()
				cmd.PrintErrf("Using standalone runner at %s\n", fallbackHost)
				ep.host = "http://" + fallbackHost
				ep.container = "http://" + net.JoinHostPort("host.docker.internal", fallbackPort)
				return nil
			}
			resp.Body.Close()
		}
	}

```

This change assumes `net/http` is already imported in `cmd/cli/commands/launch.go` (it must be, since `http.Client` is used). If it's not, add:

```go
import "net/http"
```

to the import block.
</issue_to_address>

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

sourcery-ai · 2026-02-15T16:23:52Z

cmd/cli/commands/launch.go

+		resp, err := client.Get(healthURL) //nolint:gosec // localhost health check
+		if err == nil {
+			resp.Body.Close()


suggestion (bug_risk): Health check only verifies request success and ignores status, which could mask misconfigured services on the fallback port.

Any successful Get (even a 500 or an unrelated HTTP service) is currently treated as “this is the model runner,” and the endpoints are rewritten. Please at least check for a 200 (StatusOK) or another minimal response invariant so we don’t redirect traffic to an arbitrary process just because it’s listening on the fallback port.

Suggested implementation:

if err == nil { conn.Close() // Verify it's actually the model runner by making a quick health check. healthURL := "http://" + fallbackHost + "/" client := &http.Client{Timeout: 3 * time.Second} resp, err := client.Get(healthURL) //nolint:gosec // localhost health check if err == nil { if resp.StatusCode == http.StatusOK { resp.Body.Close() cmd.PrintErrf("Using standalone runner at %s\n", fallbackHost) ep.host = "http://" + fallbackHost ep.container = "http://" + net.JoinHostPort("host.docker.internal", fallbackPort) return nil } resp.Body.Close() } }

This change assumes net/http is already imported in cmd/cli/commands/launch.go (it must be, since http.Client is used). If it's not, add:

import "net/http"

to the import block.

gemini-code-assist

Code Review

This pull request introduces a --model flag to the launch command, which simplifies using a specific model with an application. The changes include automatically pulling a model if it's not available locally and preloading it in the background. The model is then passed to the application via an environment variable. Additionally, the PR improves connectivity for host applications by adding a TCP reachability check for the model runner endpoint, with a fallback mechanism.

The implementation is mostly solid, but I've found a couple of areas for improvement in the new ensureEndpointReachable function. One is a correctness issue in the health check logic, and the other is a suggestion to make the code more robust.

gemini-code-assist · 2026-02-15T16:24:03Z

cmd/cli/commands/launch.go

+		if err == nil {
+			resp.Body.Close()
+			cmd.PrintErrf("Using standalone runner at %s\n", fallbackHost)
+			ep.host = "http://" + fallbackHost
+			ep.container = "http://" + net.JoinHostPort("host.docker.internal", fallbackPort)
+			return nil
+		}


The health check for the fallback endpoint only verifies that the HTTP request doesn't produce an error, but it doesn't check the HTTP status code of the response. This could lead to incorrectly identifying a service as the model runner if another service is listening on that port. The check should validate that the response status is http.StatusOK.

Additionally, it's best practice to use defer resp.Body.Close() to ensure the response body is always closed.

if err == nil { defer resp.Body.Close() if resp.StatusCode == http.StatusOK { cmd.PrintErrf("Using standalone runner at %s\n", fallbackHost) ep.host = "http://" + fallbackHost ep.container = "http://" + net.JoinHostPort("host.docker.internal", fallbackPort) return nil } }

gemini-code-assist · 2026-02-15T16:24:03Z

cmd/cli/commands/launch.go

+	if !strings.Contains(host, ":") {
+		host = net.JoinHostPort(host, "80")
+	}


The code assumes port 80 for URLs without an explicit port. This is correct for the http scheme, but would be incorrect for https (which should be 443). To make the function more robust for future use with HTTPS endpoints, consider checking u.Scheme to determine the default port.

if !strings.Contains(host, ":") { port := "80" if u.Scheme == "https" { port = "443" } host = net.JoinHostPort(host, port) }

sourcery-ai bot reviewed Feb 15, 2026

View reviewed changes

gemini-code-assist bot reviewed Feb 15, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add --model flag to launch command with auto-pull#674

Add --model flag to launch command with auto-pull#674
ericcurtin wants to merge 1 commit intomainfrom
opencode3

ericcurtin commented Feb 15, 2026

Uh oh!

gemini-code-assist bot commented Feb 15, 2026

Uh oh!

sourcery-ai bot left a comment

Uh oh!

sourcery-ai bot Feb 15, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Feb 15, 2026

Uh oh!

gemini-code-assist bot Feb 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ericcurtin commented Feb 15, 2026

Uh oh!

gemini-code-assist bot commented Feb 15, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

sourcery-ai bot Feb 15, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Feb 15, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 15, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant