GSoC 2026 Explorer

Feb 4, 2026

A partial list of ideas for Google Summer of Code 2026 participants contributing to the Gemini CLI project is provided here. Summary information for these ideas is as follows:

Mix of difficulties: 2 Hard (350hr), 7 Medium (175hr), 1 Easy-Medium (90hr)
Aligned with existing roadmap items: Extensions gallery (#18246), Hooks (#18253), Windows quality (#18251), Evals (#18257), Skills (#18255)
Mentors provided are only suggestions and are subject to change

1 - Network Traffic Proxy and Domain Filtering

2 - Behavioral Evaluation Test Framework

3 - Windows Developer Experience Improvements

4 - Multi-IDE Integration Enhancement

5 - Performance Monitoring and Optimization Dashboard

6 - Interactive Progress Visualization & Task Stepping

7 - Terminal-Integrated Debugging Companion

8 - Native Windows Sandbox using AppContainer

9 - Interactive Security Policy and Sandbox Wizard

10 - Unified Sandbox Driver Architecture

11 - Hands-Free Multimodal Voice Mode

12 - Generative Architecture & UI Visualization

Difficulty: Medium | Size: 175 hours | Area: Security

Implement a network proxy layer for sandboxed command execution that routes HTTP/HTTPS and other TCP traffic through a controlled gateway. This enables domain allowlisting/denylisting, traffic logging for audit purposes, and permission prompts for new network destinations. The feature would provide defense-in-depth protection even against potential sandbox escapes

Expected Outcomes

HTTP/HTTPS proxy implementation with domain filtering capabilities
SOCKS5 proxy support for non-HTTP traffic (SSH, database connections)
Configurable allowlist/denylist with wildcard pattern support
Permission prompt UI for new/unknown domains
Traffic logging for security auditing (opt-in)
Integration with existing Seatbelt and Docker sandbox modes
Required: TypeScript/Node.js, networking fundamentals (TCP/IP, HTTP, proxies)
Preferred: Experience with proxy servers (mitmproxy, squid), TLS/SSL, network security
Gaurav Ghosh

Difficulty: Medium | Size: 175 hours | Area: Quality/Testing

Develop a comprehensive behavioral evaluation framework for testing Gemini CLI's agent capabilities against real-world coding scenarios. This includes creating a benchmark suite of coding tasks, measuring success rates across different task categories, and providing regression detection for agent behavior changes. The framework will help maintain quality as the agent evolves and enable contributors to validate their changes.

Evaluation framework with standardized test harness
Benchmark suite covering 50+ coding scenarios across categories (debugging, refactoring, new features, code review)
Automated scoring and success rate metrics
Regression detection system integrated with CI/CD
Dashboard or report generation for evaluation results
Documentation for adding new evaluation scenarios
Baseline metrics for current Gemini CLI version
Required: TypeScript/Node.js, testing methodologies, CI/CD pipelines
Preferred: Experience with LLM evaluation, benchmarking, statistical analysis
Christian Gunderman

Difficulty: Medium | Size: 175 hours | Area: Platform/Quality

Improve the Gemini CLI experience for Windows developers by addressing platform-specific issues, optimizing performance, and ensuring feature parity with macOS/Linux. This includes fixing terminal rendering issues, improving PATH handling, enhancing PowerShell/CMD integration, and creating comprehensive Windows-specific documentation and tooling.

Resolution of top 20 Windows-specific issues from the issue tracker
Native Windows terminal rendering improvements (fixing black backgrounds, theme issues)
PowerShell and CMD integration enhancements
Windows-specific installation and update improvements
Performance optimizations for Windows file system operations
Comprehensive Windows troubleshooting documentation
Automated Windows-specific CI testing pipeline
Required: TypeScript/Node.js, Windows development, PowerShell
Preferred: Windows terminal APIs, cross-platform development experience
Tommaso Sciortino

Difficulty: Medium | Size: 175 hours | Area: IDE/Tooling

Expand Gemini CLI's IDE integration beyond VS Code to provide first-class support for additional popular editors including JetBrains IDEs (IntelliJ, PyCharm, WebStorm), Neovim, and Zed. This includes implementing editor-specific plugins/extensions, establishing a common IDE context protocol, and enabling features like synchronized file state, cursor position awareness, and real-time collaboration between the CLI and editor.

JetBrains IDE plugin with core functionality (open file, apply diff, get context)
Enhanced Neovim integration with bidirectional communication
Zed editor full integration completion
Common IDE Context Protocol specification
Shared context synchronization across CLI and IDE
IDE detection improvements for automatic integration
Documentation and setup guides for each supported IDE
Required: TypeScript/Node.js, at least one IDE plugin development experience
Preferred: JetBrains plugin SDK, Neovim Lua, Zed extension API
Shreya Keshive

Difficulty: Easy-Medium | Size: 90 hours | Area: Observability/DX

Build an in-CLI performance monitoring dashboard that provides developers and contributors with visibility into Gemini CLI's runtime behavior. This includes startup time analysis, memory usage tracking, tool execution profiling, and model latency metrics. The dashboard helps identify performance regressions and optimization opportunities while providing users with session statistics.

In-CLI performance dashboard accessible via /stats or /perf command
Startup time breakdown and optimization suggestions
Memory usage monitoring with warnings for high consumption
Tool execution timing and frequency statistics
Model API latency tracking
Session statistics (tokens used, tools called, files modified)
Performance regression detection for CI integration
Export capability for performance reports
Required: TypeScript/Node.js, CLI/TUI development, performance profiling
Preferred: React/Ink, data visualization, OpenTelemetry
Sehoon Shon

Difficulty: Medium | Size: 175 hours | Area: UX/UI

Enhance the Ink-based terminal UI to provide deep visibility into the agent's internal reasoning and tool execution chains. Currently, complex multi-step tasks can be a "black box" for users. This project involves building an interactive visualization that shows the hierarchy of tasks, tool calls, and model decisions in real-time, allowing users to "step through" execution like a debugger.

Real-time task tree visualization in the Ink TUI
"Step-through" mode where users must approve individual tool calls or agent decisions
Rich-text rendering of tool inputs/outputs with collapsible sections
Improved error state visualization for nested agent failures
User-configurable verbosity levels for different task categories
Required: React (Ink), TypeScript, State Management
Preferred: CLI UX design, experience with asynchronous UI updates, terminal rendering optimizations
Jacob Richman

Difficulty: Hard | Size: 175 hours | Area: Integration/DX

Integrate Gemini CLI with standard debugging protocols (DAP) to provide an interactive debugging companion. The agent should be able to attach to running processes (Node.js, Python, Go, etc.), set breakpoints, inspect stack traces, and query variable states to assist in root-cause analysis of bugs directly within the terminal interface.

Debugger toolset for the agent using Debug Adapter Protocol (DAP)
Automated stack trace analysis and variable inspection tools
Interactive "debug mode" where the agent can pause execution and ask for clarification
Seamless integration with existing run_shell_command for launching debug sessions
Automated fix suggestions based on runtime state inspection
Required: Node.js, Debugger Internals (DAP), TypeScript
Preferred: Experience with VS Code debugger extensions or LLM-driven debugging
Keith Schaab

Difficulty: Hard | Size: 350 hours | Area: Security

Implement a native Windows sandboxing driver using Windows AppContainer or the Windows Sandbox API. Currently, Gemini CLI on Windows relies on Docker Desktop for isolation. A native driver would provide a lighter, faster, and dependency-free experience for Windows users, utilizing OS-level security primitives to isolate file system and network access effectively.

Native Windows sandbox driver implementation using AppContainer
Parity with macOS Seatbelt for file system and registry restrictions
Network isolation using Windows Filtering Platform (WFP)
Automated setup and validation tool for Windows security features
Performance benchmarks showing improvement over Docker-based isolation on Windows
Detailed security audit of the native implementation
Required: C#, Windows APIs (AppContainer, Win32), Node-API/TypeScript
Preferred: Security-focused Windows development, experience with Windows Sandbox
Gaurav Ghosh

Difficulty: Easy | Size: 90 hours | Area: Security/UX

Build a guided configuration "Wizard" to help users set up and manage security policies for different projects. The wizard will simplify the creation of Seatbelt profiles and Docker mount configurations, providing a user-friendly way to define "safe zones" and network permissions for the agent, ensuring that security is accessible even to non-experts.

Interactive gemini sandbox-setup command with a guided UI
Visual dashboard for reviewing and auditing active sandbox permissions
Per-project policy file management (.gemini/policy.json)
Template-based security profiles for common project types (Web, CLI, API)
Real-time policy validation and conflict detection
Required: React (Ink), TypeScript, Security concepts
Preferred: CLI design, experience with security configuration or policy-as-code
Gal Zahavi

Difficulty: Medium | Size: 175 hours | Area: Architecture/Security

Refactor the Gemini CLI's sandboxing logic into a modular, plugin-based driver architecture. Currently, sandboxing logic is split across the CLI and Core packages with many platform-specific conditional checks. This project involves creating a clean SandboxDriver interface that abstracts away the platform-specific details, making it easy to support macOS Seatbelt, Linux Bubblewrap, Windows AppContainer, and Docker through a unified API. This architecture will serve as the foundation for all future native sandboxing implementations.

A well-defined SandboxDriver TypeScript interface and lifecycle manager in @google/gemini-cli-core.
Refactored macOS Seatbelt and Docker/Podman implementations using the new interface.
A "No-Op" driver for unsandboxed environments that provides consistent logging and warnings.
Support for dynamic driver discovery and selection based on host OS and user configuration.
Improved error handling and diagnostic tools for sandbox initialization failures.
Comprehensive unit and integration tests for the new driver lifecycle.
Required: TypeScript, Software Architecture (SOLID principles), Node.js.
Preferred: Experience with plugin systems or abstraction layers, cross-platform development, systems programming concepts.
Taylor Mullen

Difficulty: Hard | Size: 350 hours | Area: Innovation/Multimodal

Transform Gemini CLI into a hands-free, conversational coding partner—your personal J.A.R.V.I.S. for software development. This project builds a real-time, bidirectional voice interface where developers can speak naturally to the agent and hear responses read aloud. Unlike basic speech-to-text wrappers, this leverages Gemini's native multimodal audio capabilities for a fluid, continuous conversation with minimal latency.

Real-time voice input/output using Gemini's native audio streaming API
Multiple activation modes: Voice Activity Detection (VAD), Push-to-Talk (hotkey), Wake Word ("Hey Gemini")
Visual feedback: animated waveform visualizer showing listening/speaking/processing states
Interruption support: speak to stop the agent's current response
Voice-optimized response formatting (concise, spoken-friendly output)
Audio feedback and text-to-speech for agent responses with configurable voices
Noise cancellation and multi-accent robustness
Required: TypeScript/Node.js, Audio processing (PCM, WAV, WebAudio), Real-time streaming
Preferred: Gemini Multimodal API, Voice Activity Detection, Whisper integration, accessibility engineering
Bryan Morgan

Difficulty: Medium | Size: 175 hours | Area: Innovation/UX

Break the "text-only" constraint of the terminal by teaching Gemini CLI to draw. This project enables the agent to generate and render visual artifacts directly in the terminal console—architecture diagrams, dependency graphs, data flow visualizations, and even live previews of generated UI components. Using Mermaid.js for diagrams and terminal image protocols (Sixel, iTerm2, Kitty) for rich graphics, developers can finally see what they're building without context-switching to a browser or IDE.

Imagine asking "explain the authentication flow" and seeing a beautifully rendered sequence diagram appear inline. Or generating a React component and instantly seeing a pixel-perfect preview in your terminal. This transforms the CLI from a text interface into a visual development environment.

Inline rendering of architecture diagrams (Sequence, Class, ERD, Flowcharts) generated from codebase analysis
Live preview of generated frontend components (HTML/CSS/React) rendered as terminal images
Support for multiple terminal image protocols: Sixel, iTerm2 inline images, Kitty graphics protocol
Intelligent ASCII/ANSI box-drawing fallback for unsupported terminals
New tool: visualize for on-demand diagram generation
Integration with explain command ("Explain this architecture" → renders diagram)
Dependency graph visualization for package.json/requirements.txt
Git history visualization (branch graphs, contributor timelines)
Caching layer for rendered images to avoid regeneration
Paradigm shift: No CLI coding tool renders rich graphics—this is genuinely new
Frontend developer appeal: Instant visual feedback without leaving the terminal
Viral demo potential: Terminal diagrams are visually striking and shareable
Required: TypeScript/Node.js, Image processing (Sharp, node-canvas), streaming I/O
Preferred: Terminal graphics protocols (Sixel, iTerm2), Mermaid.js/Graphviz, Puppeteer for HTML rendering
Dmitry Lyalin

Gemini CLI — Project Ideas

Command Palette