Let's talk a bit about storage in GCP.
Hard disks for VMs is persistent and is block storage. In GCP we call is persistent storage. (block storage)
- As mentioned, this is similar to a hard drive of a computer.
- Only one block storage per VM. A block storage will map to one VM
- However, one VM can have different block storages.
- To avoid confusion on the above statements, look at the picture below
- Direct Attached storage is like a hard disk and Storage Area Network is like a pool of storage devices connected via a high speed network.
- GCP provides two options
- Persistent Disks
- Connected to a block storage via high speed network.
- Zonal - Data replicated in one zone
- Regional - Data replicated in multiple zones
- Logical to use Regional option for durability.
- By default a 10GB boot disk (persistent) is attached to a VM when we create a VM.
- Local SSDs
- Local block storage.
- Faster
- High performance
File store is for file storage and sharing between multiple VMs.
- Pretty logical, use file storages to store and share files across VMs.
Cloud storage in GCP is the object storage.
- Create a container (bucket in GCP) to store objects (use console)
- Bucket name has to be unique (globally)
- Location type
- Region (low latency)
- Dual region (2 regions) [High availability and low latency across 2 regions]
- Multi region (multiple regions) [High availability]
- Storage class
- Standard
- Short term
- Frequently accessed
- Near line
- Backups
- Data accessed less than one time a month
- Min storage duration is 1 month (30 days)
- Cold line
- Disaster recovery
- Data accessed less than once a quarter
- Min storage duration is 90 days
- Archive
- Long term data preservation (backup)
- Data accessed less than once a year
- Min storage duration is 365 days.
- Inexpensive
- Auto scales (as you add storage)
- Stored as key-value pair
- Access control at object level
- REST API available to access and modify stored objects
- Command line also available (gsutil command)
- Now logically one can store any type of data in the object storage.
- But some of these can be less frequently accessed (e.g backup files)
- Object storage helps to optimize costs based on access needs.
I have data on premise. How do I transfer to Google cloud?
Options:
- Online transfer:
- Transfer to Google cloud storage via APIs or CLI (gsutil) [< 1TB]
- Good for smaller sized transfer (not for peta byte sized data)
- Storage Transfer:
- Peta byte sized data
- Setup a recurring repeating schedule
- Can be an incremental transfer as well.
- Fault tolerant - starts from where it failed.
- Use when
- > 1TB of data
- Transferring from different cloud
- Transfer Appliance is physical data transfer.
- Size > 20TB
- Request an appliance.
- Upload data to the appliance (e.g USB type appliance)
- Ship the appliance
- Google uploads to storage.
- Data is encrypted in the appliance
- Two appliance devices
- TA40 (for upto 40 TB)
- TA300 (for upto 300 TB)
No comments:
Post a Comment