GLUE

  • Serverless ETL (Extract, transform & load)

  • Move & transform data between source and destination.

  • Crawls data sources and generates AWS glue data catalog

  • Sources stores: s3, rds, jdbc compatible & DynamoDB.

  • Data Sources streams: Kinesis Data Stream & Apache Kafka

  • Data Targets: S3, RDS, JDBC DBs.

Glue data Catalog

  • Es una coleccion de metadata combinado con data management and search tools.

  • Persistent metadata about data sources in region

  • One catalog per region per account

  • Avoids data silos.

  • Athena, redshift, emr, lake formation, all uses data catalog

Last updated