INSURTECH: Company specialized in embedded insurance and digital device protection

Centralizing multiple data sources on AWS through a scalable, automated Data Lake

Building a solid foundation for data

THE STARTING POINT

Insurtech is a company specialized in embedded insurance and device protection, operating within a digital ecosystem where information is a key business asset.

Its data architecture is distributed across multiple sources, including internal APIs, PostgreSQL databases, ElasticSearch, and various operational tools—all functioning independently but without a common unification layer.

This organic system growth created a scenario where the volume, variety, and frequency of data required an evolution toward a more structured architecture capable of supporting consistent and scalable analytics.

The project’s objective was to design a data architecture on AWS that would enable centralized, standardized, and automated information management, establishing a solid foundation for advanced analytics and decision-making based on reliable data.

PROJECT PHASES

  • Phase 1: The target architecture on AWS was defined together with the Insurtech team, establishing the Data Lake model, storage structure, and data organization principles with a focus on scalability and maintainability.
  • Phase 2: Progressive integration of the main data sources was carried out, connecting internal APIs, PostgreSQL, and ElasticSearch, and establishing a centralized ingestion flow toward the Data Lake.
  • Phase 3: Data processing automation was implemented using AWS Glue, AWS Lambda, and AWS DMS, standardizing information transformation and loading while reducing the need for manual intervention in operational flows.
  • Phase 4: The architecture was consolidated as a modular, extensible system ready for growth, enabling the incorporation of new sources without redesigning the system’s foundation, and preparing the environment for advanced analytics use cases, BI, and future data exploitation initiatives.
0

Centralized Data Lake

+ 0 %

Data flow automation

0

Architecture restructurings

0 %

Automated ingestion flow

0

AWS coordinated services

0 %

Automated ingestion and transformation