RAG Latency Optimization: Batching and CachingLatency reduction techniques for retrieval and generation stages in high-traffic RAG apps.