#Cost Optimisation

Posts about cost optimisation. ← All posts

A2AADKAI GovernanceAIGPAMLAPI DesignAWSAadhaarAccountingAgentsAnomaly DetectionArchitectureAuditAudit LogAzureBCPBankingBedrockBenchmarksBhashiniBigQueryCRAGCareerCase StudyClinical Decision SupportCloud ArchitectureCloud KMSCloud RunComplianceConcurrencyConfigCost OptimisationCryptographyCultureCures ActDSLData ResidencyDatabase DesignDatabase MigrationDatabase SecurityDataflowDatastreamDeploymentDesign PatternDevOpsDevice FlowDistributed SystemsElevenLabsEngineeringEntity ResolutionEnvoyEvaluationFHIRFREE-AIFinOpsFinTechFraudGCPGDPRGKEGOMEMLIMITGSoCGeminiGenieGitHubGoGo 1.23Google CloudGoogle Cloud NextGovernanceGraphQLGraphRAGHIPAAHITLHL7 v2Healthcare ITHyDEIAPPISO 27001IdempotencyIdentity FederationIncident ResponseIndic LanguagesIntegrationJWTKMSKYCKafkaKnowledge GraphKubernetesLLMLLM OpsLatencyLendingLessons LearnedLoggingMARAML EngineeringMemoryMentorshipMicroservicesMiddlewareMigrationMulti-AgentMulti-Agent AIMulti-CloudMulti-LanguageMultilingualNPCINetworkingOAuthOPAOTelObservabilityOpen BankingOpen SourceOpenTelemetryOperationsOperatorsOpinionOrchestrationPAMPCSEPKCEPasskeysPatternsPaymentsPerformancePolicyPolicy as CodePostgreSQLPrivacy EngineeringProductionPrometheusProtocolsProvider AbstractionPub/SubPythonRAGRBACRBIRFC 8693RedisRegulationReliabilityReservationsResilienceRetrievalRetrospectiveSAMLSLOSOC 2SPIFFESPIRESQLSRESagaSaudi ArabiaSchemaSecuritySecurity Command CenterSelf-RAGService MeshSoftware ArchitectureSpannerSpeakingState ManagementStdlibStorageTata GroupTerraformTestingTier PromotionToken BudgetingToolsUAEUPIVertex AIVoice AIVotingWebAuthnWorkflowWorkload IdentityWorkload Identity FederationWritingZero-Trustembed.FSerrgroupgRPCiter.SeqmTLSslog
· Engineering ·7 min read

The 57% number — how we cut the Tata Group BigQuery bill in half

₹100 Cr / ~$12M in proven savings across a year-plus engagement. The four levers that did the heavy lifting, the lever I expected to win that didn't, and the post-engagement playbook that became a Searce managed service.

· Engineering ·5 min read

Cost-aware agent dispatch — when the cheap agent is enough

Not every query needs the production agent. A cost-aware dispatcher decides whether to route to the cheap-and-fast agent or the expensive-and-thorough one. Same UX, dramatically lower bill.

· Engineering ·6 min read

Egress costs — the gotcha that kills cloud-arbitrage plans

Cross-cloud data movement is billed by the GB. The bill is invisible until it isn't. A multi-region or multi-cloud architecture that doesn't model egress costs in design will discover them in production.