AWS CloudSearch export/download data

Question:

I have about 1.5 million documents in an AWS CloudSearch index. It is costing me too much and I wish to migrate off the service. I have been unable to see how I can download or export my documents from the index. Is it possible?

Answer:

For a similar need, I had to browse my entire CloudSearch domain (more than the 10000 limit) to generate a file.

I used a nodeJS script to handle that, like this:

This script has to be called from commandline : node script.js fileToCreate.json

Note : I don’t know if this works correctly on a 1.5 millions documents searchdomain. The risk I forsee is the JSON variable size. So, this script has to be adapted (maybe a file write every 100 000 documents ?).

Leave a Reply