What powers the work.
A self-hosted AI agent running on Oracle Cloud, 16+ tool capabilities, custom integrations, and an always-on cron scheduler. Built and maintained by Chirag Poojari.
Search the web via Tavily. Finds relevant pages, extracts content, and summarizes results. Powers research across HN, Reddit, arXiv, and the open web.
Full browser automation. Navigate pages, click buttons, fill forms, take screenshots. Runs headless Chromium for complex web interactions.
Execute shell commands, manage background processes, run scripts. Full access to git, docker, nginx, and every tool on this Ubuntu server.
Read, write, patch, and search files with surgical precision. Line-level edits without touching the rest. This entire portfolio was built with these tools.
Run Python in a sandboxed environment. Full access to pandas, numpy, requests. Great for data analysis, prototyping, and one-off scripts.
Analyze images with AI vision. Describe screenshots, read text from images, identify objects. Incredibly useful for debugging UI issues.
Generate images from text descriptions. Integrated with DALL-E, Krea, and other models. Used for creating visuals for posts and presentations.
Convert text to natural-sounding speech. Edge TTS is free and works great for quick voice memos and content narration.
100+ specialized procedural modules. DevOps, research, media, trading, coding -- each skill teaches Hermes a specific workflow. I've built several custom skills for my own workflows.
Persistent memory across sessions. Remembers who I am, my preferences, past conversations, and lessons learned. Gets smarter the more I use it.
Schedule recurring tasks. Running 4 active cron jobs: daily job scraper, trading bot market scan, trade execution, and P&L report. All running autonomously on this server.
Spawn sub-agents to work in parallel. Break complex tasks into smaller pieces, delegate to workers, synthesize results. Multi-agent orchestration.
My own AI content engine. Scrapes trending topics across HN, Reddit, YouTube, Twitter, GitHub -- then auto-generates and publishes content. This is what got me into AI.
Send messages across Telegram, Discord, Slack, Signal, WhatsApp. I use Telegram as my primary interface to Hermes -- chatting with my own AI agent from my phone.
Full-text search across all past conversations. FTS5-backed search through SQLite. Find that thing you discussed months ago in seconds.
Ask clarifying questions when a task is ambiguous. Makes sure Hermes builds the right thing, not just the first thing. Saves a lot of back-and-forth.
Control the computer with mouse and keyboard. Full desktop automation for GUI-based workflows. macOS-optimized but the concept runs on this Linux server too.
Analyze video content, extract frames, transcribe audio. Toggle on when needed.
Search and post on X/Twitter. Requires xurl CLI setup.
Run multiple AI models in parallel and synthesize outputs.
CodeGraph MCP
Deep code intelligence powered by a local graph database. Understands call graphs, dependencies, and code structure across entire projects. Deployed and indexed on this server for trend-jack-engine, iics-alm-deploy, and other active repos.