Skip to content

ATLAS-5239: Optimize Atlas Async Replication#567

Open
jackhalfalltrades wants to merge 2 commits intoapache:masterfrom
jackhalfalltrades:ATLAS-5239
Open

ATLAS-5239: Optimize Atlas Async Replication#567
jackhalfalltrades wants to merge 2 commits intoapache:masterfrom
jackhalfalltrades:ATLAS-5239

Conversation

@jackhalfalltrades
Copy link
Contributor

What changes were proposed in this pull request?

This patch addresses a performance degradation observed in Atlas async table replication by reducing overhead in the async import path.

The following optimizations were implemented:

Introduced caching for AtlasAsyncImportRequest to store intermediate state and avoid repeated serialization/deserialization and frequent updates during entity processing.
Reduced the number of graph transactions/commits by more than 50% by consolidating import and bookkeeping operations, significantly lowering transaction overhead.
Updated the processedEntities result structure from List to Set to avoid repeated reconstruction and improve lookup efficiency while processing entities.
These changes reduce per-entity processing overhead and significantly improve async import throughput without introducing any functional or behavioral changes.

How was this patch tested?

Manual Testing, Unit Tests

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant