Skip to content

2024

3 Times Performance Improvement for Generative AI within a Kafka Pipeline System

Kafka-Cover.jpg

Generative AI went viral in the last year, many use cases show the ability of generative AI, such as text, image, video, music generation, and more. It helps people to create more content with less effort. However, unlike conventional APIs, the Gen-AI API often has a relatively longer latency and higher costs due to the need for better (more expensive) GPU resources to enhance performance.

Spring Boot 3: build the efficiency Idempotent API by Redis

Idempotency API means that the data / system state will be the same no matter how many times the API is successfully called with the same request body/parameter.

We've described why we need and how to design an idempotency API mechanism in the article How to design an efficient Idempotency API, If you haven't read it before, please refer to it.

This article will focus on implementing it in an existing project, which is my event-sourcing POC project. Here are the implementing steps: