How Wiki Generation Works
1
Repository Analysis
DeepWiki clones and analyzes the repository structure, identifying:
- File types and programming languages
- Directory organization and architecture patterns
- Dependencies and configuration files
- Documentation and README files
- Test structures and examples
Repository successfully analyzed and indexed
2
Code Embedding
Creates vector embeddings of code content for intelligent retrieval:
- Function and class definitions
- Comments and documentation
- Configuration settings
- API endpoints and interfaces
- Database schemas and models
Embeddings enable semantic search and context-aware documentation generation.
3
AI-Powered Documentation
Uses your selected AI model to generate:
- Project overview and purpose
- Installation and setup instructions
- Architecture explanations
- Component relationships
- Usage examples and best practices
Different AI models produce varying documentation styles. Experiment to find your preference.
4
Visual Diagram Generation
Automatically creates Mermaid diagrams showing:
- System architecture
- Data flow and processing
- Component relationships
- Database schemas
- API endpoint structures
Interactive diagrams generated and embedded in documentation
Generation Options
Model Selection
Choose the best AI model for your documentation needs:Best for: Fast, reliable documentation generation
gemini-2.0-flash
- Latest model with excellent speed/quality balancegemini-1.5-flash
- Previous generation, very stablegemini-1.0-pro
- More detailed but slower
- Excellent code understanding
- Fast generation times
- Good diagram creation
- Strong multilingual support
Generation Parameters
Force Regeneration
Force Regeneration
When to use:
- Repository has been significantly updated
- You want to try a different AI model
- Previous generation had errors
- You want fresh documentation with latest model improvements
Force regeneration will overwrite existing cached documentation. Consider backing up important custom modifications.
Repository Access Tokens
Repository Access Tokens
Required for:
- Private GitHub repositories
- Private GitLab repositories
- Private BitBucket repositories
- Organizations with restricted access
- GitHub:
repo
scope (full repository access) - GitLab:
read_repository
scope - BitBucket:
Repositories: Read
permission
Tokens are used only for repository access and are not stored permanently.
Custom Model Parameters
Custom Model Parameters
Advanced users can adjust model parameters:Parameter effects:
- Lower temperature (0.1-0.3): More consistent, factual documentation
- Higher temperature (0.7-0.9): More creative, varied explanations
- Lower top_p (0.3-0.5): More focused responses
- Higher top_p (0.8-1.0): More diverse vocabulary and examples
Repository Types & Optimization
Programming Languages
DeepWiki optimizes documentation generation for different languages:JavaScript/TypeScript:
- React, Vue, Angular component analysis
- Node.js server architecture
- API endpoint documentation
- Package.json and dependency analysis
- Django/Flask application structure
- FastAPI endpoint documentation
- Class and function analysis
- Requirements and virtual environment setup
- Express.js servers → API endpoint documentation
- React apps → Component hierarchy and props
- Django projects → Model, view, template analysis
Repository Size Optimization
Small Repositories (< 50 files)
Small Repositories (< 50 files)
Characteristics:
- Fast generation (30 seconds - 2 minutes)
- Comprehensive coverage of all files
- Detailed analysis of each component
- Use any model (all will perform well)
- Enable detailed analysis
- Include all file types
- Generate comprehensive diagrams
- Personal projects
- Small libraries
- Configuration repositories
- Simple applications
Medium Repositories (50-500 files)
Medium Repositories (50-500 files)
Characteristics:
- Moderate generation time (2-10 minutes)
- Focus on important files and patterns
- Good balance of detail and overview
- Use fast models like
gemini-2.0-flash
- Focus on core directories
- Skip generated/compiled files
- Prioritize documented code
- Open source libraries
- Medium-sized applications
- Framework implementations
- Multi-component projects
Large Repositories (500+ files)
Large Repositories (500+ files)
Characteristics:
- Longer generation time (10-30 minutes)
- High-level architecture focus
- Selective detailed analysis
- Emphasis on main components
- Use efficient models (Gemini Flash series)
- Configure file filters
- Focus on main source directories
- Skip test files for initial generation
- Use incremental regeneration
- Large frameworks (React, Vue, Angular)
- Enterprise applications
- Monorepos with multiple projects
- Complex distributed systems
Customizing Generated Documentation
Content Customization
1
Repository-Specific Prompts
DeepWiki automatically adapts to repository types, but you can customize the focus:
2
Documentation Depth
Control the level of detail in generated documentation:
- High Detail: Complete analysis of all components
- Medium Detail: Focus on public APIs and main components
- Overview: High-level architecture and key features only
3
Diagram Types
Specify which types of diagrams to generate:
- Architecture diagrams: System components and relationships
- Data flow diagrams: Information processing flow
- Database diagrams: Schema and relationships
- API diagrams: Endpoint structure and data flow
- Process diagrams: Workflow and business logic
Output Format Options
Format: Hierarchical pages with cross-referencesBest for:
- General documentation browsing
- Team onboarding
- Project understanding
- Code exploration
- Navigation tree
- Search functionality
- Cross-page linking
- Embedded diagrams
Quality Optimization
Improving Documentation Quality
Repository Preparation
Repository Preparation
Before generation, optimize your repository:
- Update README.md with current project information
- Add code comments for complex logic
- Update package.json/requirements.txt with current dependencies
- Add configuration examples (.env.example, config samples)
- Include API documentation (OpenAPI specs, GraphQL schemas)
Model Selection Strategy
Model Selection Strategy
Match models to repository characteristics:
- Simple projects: Use Gemini Flash for speed
- Complex architectures: Use GPT-4o for depth
- API-heavy projects: Use Claude 3.5 Sonnet via OpenRouter
- Data projects: Use models with strong analytical capabilities
- Generate with fast model first (Gemini Flash)
- If quality insufficient, regenerate with premium model (GPT-4o)
- Compare results and choose best approach for similar projects
Iterative Improvement
Iterative Improvement
Use the Ask feature to improve documentation:
- Generate initial wiki
- Ask specific questions about unclear sections
- Use Deep Research for complex topics
- Incorporate answers into understanding
- Regenerate sections with better context
Troubleshooting Generation Issues
Incomplete Generation
Incomplete Generation
Symptoms:
- Missing pages or sections
- Truncated content
- Error messages during generation
- Check API limits: Verify your AI provider has sufficient quota
- Reduce scope: Start with smaller directories or file sets
- Try different model: Some models handle large contexts better
- Check logs: Look for specific error messages in API logs
Poor Quality Results
Poor Quality Results
Symptoms:
- Generic or inaccurate descriptions
- Missing technical details
- Incorrect architecture analysis
- Improve repository documentation: Add README, comments, examples
- Use higher-quality models: Switch from Flash to GPT-4o
- Enable Deep Research: For complex analysis tasks
- Provide more context: Add configuration files, API specs
Generation Timeout
Generation Timeout
Symptoms:
- Process hangs or takes extremely long
- Browser timeout errors
- Partial results only
- Break into smaller chunks: Process subdirectories separately
- Use faster models: Gemini Flash series for speed
- Increase timeout limits: In API configuration
- Optimize repository: Remove large binary files, generated code
Advanced Features
Multi-Language Support
DeepWiki automatically detects and optimizes for repository languages:Automatic detection of:
- Primary language (most files)
- Secondary languages
- Framework combinations
- Build system integration
- Language-specific setup instructions
- Cross-language integration points
- Build pipeline explanation
- Dependency management per language
Integration Workflows
1
CI/CD Integration
Automate wiki generation in your development pipeline:
2
Webhook Integration
Automatically update documentation on repository changes: