Skip to content

AWS Glacier Uploader: Split, hash, & upload large files to AWS Glacier. Automates multipart uploads, tracks data in CSV log.

Notifications You must be signed in to change notification settings

igmrlm/AWS_Glacier_MPU

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 

Repository files navigation

AWS Glacier Automatic Multipart Uploader

This Python program simplifies the process of splitting large files, calculating their tree hash, and uploading them to AWS Glacier storage. It automates multipart uploads and tracks the uploaded data in a CSV log.

Prerequisites

  • Python 3.x
  • AWS CLI configured with appropriate credentials and default region
  • The python modules: argparse, math, botocore.utils, boto3, datetime

python MultipartUpload.py input_file part_size vault_name

Example:

python MultipartUpload.py '/path/to/large_file.zip' 16 MyGlacierVault

'/path/to/large_file.zip' -- The file you're uploading.

16 -- The size in MiB of each piece.

MyGlacierVault -- The vault you're storing the data in.

The program will split the file, calculate the tree hash, upload the parts, and generate a CSV log (MyGlacierVault.csv) containing archive IDs, checksums, locations, original filenames, and upload timestamps.

Notes

  • Make sure your AWS CLI is already configured correctly to allow access to Glacier.
  • The program will generate a CSV log in the current directory.
  • The tree hash calculation uses AWS's botocore.utils.
  • The CSV log is useful for tracking uploads and managing your Glacier data.

For more information about AWS Glacier, refer to the official AWS Glacier Documentation.

About

AWS Glacier Uploader: Split, hash, & upload large files to AWS Glacier. Automates multipart uploads, tracks data in CSV log.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages