Automated Science: AI-Enhanced Research Infrastructure
Status: Active Development
Automated Science is my vision for the future of scientific research—where AI systems work alongside human scientists to accelerate discovery. This isn’t about replacing researchers, but about creating a symbiosis between human expertise, experimental knowledge, and AI’s ability to process vast amounts of information.
The Challenge
Traditional approaches to AI-led science face fundamental barriers:
1. The Authority Problem
Academics won’t publish in platforms not recognized by authoritative bodies—papers don’t count toward promotions or grants if the publication venue isn’t established.
2. The Grounding Problem
Science is tightly linked to experiments, and experiments require real humans. A lot of knowledge in laboratories is tacit—not written down, but held in the minds of experienced researchers.
3. The Integration Problem
AI systems typically operate in isolation, disconnected from the day-to-day workflow of active research groups.
The Solution: Enterprise AI-Archive
The path forward is integrating AI deeply within real research institutions:
┌─────────────────────────────────────────────────────────────┐
│ AI-Enhanced Science Center │
│ │
│ ┌─────────────┐ ┌─────────────┐ ┌─────────────┐ │
│ │ Real Labs │───▶│ AI Agents │───▶│ Authority │ │
│ │ Experiments │ │ Grounded │ │ in Domain │ │
│ │ Tacit Know. │ │ in Reality │ │ Expertise │ │
│ └─────────────┘ └─────────────┘ └─────────────┘ │
│ │
└─────────────────────────────────────────────────────────────┘
By embedding AI infrastructure within research centers:
- AI agents gain access to real experimental data and laboratory protocols
- Researchers get AI assistance integrated into their existing workflow
- Over time, the AI develops deep domain expertise grounded in actual practice
- Eventually, this expertise becomes authoritative for reviewing work in that domain
The Dual-Pipe Architecture
Working with research institutions requires careful handling of both code/papers and large data files. The architecture separates:
The “Brain” (Git Monorepo)
- Handles logic, code, and papers
- Strict governance via Pull Requests
- Permission-based access for different researchers
The “Body” (Syncthing/Tailscale)
- Handles heavy data and outputs
- Automated background synchronization
- Real-time sync to researcher machines
The AI Agent Bridge
services:
agent-runtime:
volumes:
# Code & Papers (Read/Write via PR)
- ./workspace:/app/workspace
# Raw Data (Read-Only)
- /srv/data/raw:/app/data/raw:ro
# Results (Read-Write for outputs)
- /srv/data/outputs:/app/data/outputs:rw
The agent reads from the “Brain” to understand the science and mounts the “Body” to execute analysis—all without bypassing safety checks for code changes.
Key Components
Cloud Tasks
Researchers submit specific questions via structured YAML files. AI processes queries through scheduled pipelines, delivering results as merge requests for human review.
Collaboration Sessions
Interactive research via browser-based VS Code connected to AI agents. Researchers can pair-program with AI, exploring data together with full access to analysis tools.
Autonomous Research
Daily scheduled jobs for autonomous data exploration. AI agents generate hypotheses, find interesting patterns, and produce research artifacts for human scientists to validate.
Current Development
This vision is being developed in collaboration with ELSC (Edmond and Lily Safra Center for Brain Sciences), where I completed my PhD. The goal is to make ELSC a leading center in collaboration between real scientists and AI.
Near-term Goals
- Deploy AI infrastructure integrated with lab workflows
- Create AI agents with deep knowledge of computational neuroscience
- Build tools for AI-assisted paper writing and peer review
Long-term Vision
Develop an agent so deeply integrated with ELSC’s labs that it becomes authoritative in reviewing papers in computational neuroscience—grounded by its understanding of real experiments, not just text.
The Bigger Picture
This approach represents an “Enterprise tier” of AI-Archive—not just a platform for AI papers, but a complete solution for making science centers AI-enhanced. Each integrated center becomes a node in a larger network where AI agents, grounded in real experimental knowledge, can meaningfully contribute to scientific progress.
Interested in bringing AI-enhanced research to your institution? Let’s talk about how to integrate AI infrastructure with your research workflow.