Written by: Denise Schlesinger
Principal Cloud Solution Architect
Microsoft
- Ops for GenAI apps
- GenAI optimization
- GenAI completion safety
- GenAI completion quality
- Risk mitigation for GenAI apps: red team, risk assessment
- Testing and Automation for GenAI apps
- Accuracy
- Chunking strategy
- Consider editing content format for document processing efficiency
- Cost
- Chunk and storage optimization
- Cheaper/ Faster model
- Latency
- Faster model
- Use PTUs
- Monitor and iterate
- Identify
- Use cases
- Requirements
- Infrastructure
- Pre evaluate
- Chunking strategy
- Embedding models
- Vector DB
- LLM models
- Evaluate prompts
- Development
- Optimization
- Deployment
- Create a RAG with AI search