What Is 7/24 Office?
7/24 Office is an open-source AI agent framework built by Ziqi Wang under the GitHub repo wangziqi06/724-office. This self-evolving AI agent system handles production workloads with 26 tools across 8 Python files, using zero external frameworks like LangChain or CrewAI—just Python standard library plus croniter for scheduling, lancedb for vector storage, and websocket-client for connections. 7/24 Office is one of the best AI agent frameworks for solo developers running 24/7 agents, evidenced by its 461 GitHub stars and 42 forks as of February 2026, with a single initial commit delivering full production capability in under 3 months of solo AI-assisted development.
Quick Overview
| Attribute | Details |
|---|---|
| Type | AI Agent Frameworks |
| Best For | solo developers |
| Language/Stack | Pure Python (stdlib + croniter, lancedb, websocket-client) |
| License | MIT |
| GitHub Stars | 461 as of Feb 2026 |
| Pricing | Open-Source |
| Last Release | Initial commit b2fee5c — recent |
Who Should Use 7/24 Office?
- Solo indie hackers deploying MVPs with 24/7 AI agents that self-repair and create tools at runtime without framework bloat.
- Platform engineers needing multi-tenant isolation via Docker containers per user, with automatic provisioning and health checks.
- AI experimenters building multimodal pipelines for image, video, voice, and web search in pure Python under 3500 lines total.
Not ideal for:
- Teams requiring enterprise-grade orchestration like Kubernetes scaling beyond Docker containers.
- Beginners unfamiliar with Python async patterns or JSON-RPC for plugins.
- Projects needing pre-built integrations for 100+ LLMs out-of-the-box.
Key Features of 7/24 Office
- Tool Use Loop — OpenAI-compatible function calling with automatic retries up to 20 iterations per conversation, parsing JSON responses directly in llm.py without external parsers.
- Three-Layer Memory — Session history in memory, LLM-compressed long-term storage, and LanceDB vector retrieval for RAG over 100k+ embeddings at sub-100ms query times.
- MCP/Plugin System — JSON-RPC over stdio or HTTP to external MCP servers, supports hot-reload of plugins without agent restart via dynamic import in mcp_client.py.
- Runtime Tool Creation — Agent writes, saves, and loads new Python tools dynamically using create_tool, executed in isolated namespaces to prevent crashes.
- Self-Repair — Daily cron jobs in self_check_tool.py run health diagnostics, parse error logs from previous 24h, and send notifications via integrated messaging.
- Cron Scheduling — croniter parses expressions for one-shot or recurring tasks, persists state in LanceDB across restarts, handles UTC timezone conversion.
- Multi-Tenant Router — router.py provisions Docker containers per user on-demand, monitors CPU/memory via psutil, restarts failed ones every 5 minutes.
7/24 Office vs Alternatives
| Tool | Best For | Key Differentiator | Pricing |
|---|---|---|---|
| 7/24 Office | solo developers with 24/7 self-repair | Pure Python, no frameworks, runtime tools | Open-Source |
| LangChain | Complex chains with 100+ integrations | Modular components, but 10x code bloat | Open-Source |
| CrewAI | Multi-agent crews | Role-based collaboration, YAML configs | Open-Source |
| Brainstorm MCP | MCP protocol agents | Standardized plugin ecosystem | Open-Source |
LangChain suits workflows needing vast LLM wrappers and chains, but adds 50+ dependencies and slows cold starts to 10s. Pick CrewAI for predefined agent roles in teams, though it enforces YAML over pure code. Brainstorm MCP excels in plugin interoperability via MCP standard, but lacks 7/24 Office's built-in self-repair and Docker routing—pair them for hybrid setups. For similar lightweight agents, check OpenSwarm which focuses on swarm coordination without the multi-tenant layer.
How 7/24 Office Works
7/24 Office centers on a layered architecture starting from messaging inputs. The router.py handles multi-tenant traffic by spinning up Docker containers per user identifier from WeChat Work messages, isolating execution with health checks via docker-py API calls every 30s. xiaowang.py serves as the HTTP entrypoint, debouncing duplicate messages, downloading media to persistent storage, and piping audio through ASR using base64-encoded vision models.
Core logic flows to llm.py for tool calling: it sends OpenAI-format requests, parses tool_calls from responses, executes via tools.py (26 built-ins like web search via Tavily, video trim with ffmpeg), and loops up to 20 times with Pydantic-like validation on JSON outputs. Memory.py layers session chat history (last 50 turns), compresses older via LLM summaries into LanceDB vectors, retrieving top-5 matches by cosine similarity for context injection.
Scheduler.py uses croniter to queue tasks in LanceDB, polling every minute for due items across tenants. Self-repair in self_check_tool.py scans logs with regex patterns for OOM or timeout errors, triggers LLM analysis, and notifies via WeChat if uptime drops below 99% daily.
# Clone and setup
git clone https://github.com/wangziqi06/724-office.git
cd 724-office
cp config.example.json config.json
# Edit config.json: set OPENAI_API_KEY, WECHAT_WORK_TOKEN
python -m venv venv
source venv/bin/activate # Linux/Mac
pip install -r requirements.txt # croniter lancedb websocket-client
docker run -d --name tenant1 ubuntu sleep infinity # Example tenant container
python xiaowang.py
This starts the HTTP server on port 8000, provisions tenant containers on first message, and begins tool loop processing. Expect initial LanceDB index build in 10-20s on first run; configure API keys in JSON for LLM and search engines.
Pros and Cons of 7/24 Office
Pros:
- Zero framework dependency keeps binary size under 50MB, deploys to any Python 3.10+ environment in 2 commands.
- Runtime tool creation via create_tool executes arbitrary Python safely in exec() with restricted globals, enabling agent evolution without redeploys.
- LanceDB vector store queries at 50ms for 1M embeddings on consumer SSD, outperforming Pinecone free tier latency by 3x.
- Multi-tenant Docker isolation caps per-user CPU at 1 core via --cpus flag, preventing noisy neighbor issues in shared hosting.
- Multimodal pipeline processes 1080p video clips under 5s using ffmpeg subprocesses, integrates base64 vision without extra libs.
- Self-repair uptime hits 99.9% in prod logs, auto-fixing 80% of session crashes via log analysis and tool regen.
Cons:
- Single initial commit lacks changelog or tagged releases, complicating dependency pinning for enterprise audits.
- WeChat Work integration hardcodes debounce logic, requires custom ports for Slack/Telegram without MCP plugins.
- LanceDB requires ~2GB RAM for 500k vectors, spikes to 4GB during indexing on low-spec VPS.
- Tool loop caps at 20 iterations to prevent infinite loops, may truncate complex reasoning chains needing 50+ steps.
- No built-in auth beyond Docker isolation, exposes HTTP endpoints needing nginx reverse proxy for prod.
Getting Started with 7/24 Office
Start by cloning the repo and preparing config. Edit config.json with your OpenAI key, WeChat token, and LanceDB path (defaults to ./data/lancedb). Install minimal deps and spin up a test tenant container.
# Full quickstart
wget https://raw.githubusercontent.com/wangziqi06/724-office/master/requirements.txt
pip install croniter lancedb websocket-client
cp config.example.json config.json # Add {"openai_api_key": "sk-...", "wechat_token": "..."}
mkdir data
python xiaowang.py # Starts server, watches for messages
# Send test WeChat: "search GitHub for AI agents" -> triggers Tavily + LanceDB store
Server binds to 0.0.0.0:8000, auto-provisions Docker container named after user ID on first inbound message. Initial run indexes empty LanceDB in 5s; first tool call tests web search, stores response vector, and replies via WeChat. Scale by pre-creating 10+ ubuntu containers named tenant001-010.
Verdict
7/24 Office is the strongest option for solo developers running lean 24/7 AI agents when minimizing deps and enabling self-evolution matter most. Its pure Python core with LanceDB memory and Docker routing delivers prod reliability under 3500 LOC. Use it over frameworks unless you need 100+ integrations—deploy today for sub-1min setup.



