Is it possible to dump a RDS database to S3 using AWS Data Pipeline?

Question:

Basically I want to pg_dump my RDS database to S3 using AWS Data Pipeline,

I am not 100% sure if this is possible I got up to the stage where the SqlDataNode wants a selectQuery at which point i am wondering what to do.

Below is my template so far:

Answer:

As mentioned in another answer, AWS Data Pipeline only allows you to dump tables and not the entire DB. If you really want to use pg_dump to dump the entire contents of your DB to S3 using AWS CloudFormation, you can
use Lambda-backed custom resources. Going down that route, you’ll have to write a Lambda function that:

  • Connects to the DB
  • Takes the dump of your DB using pg_dump
  • Uploads it to S3

Leave a Reply