Cloud Optimized Storage

Add by Dave Graham | Nov 24, 2008 10:45  2835 |  43
Cloud Optimized Storage

Map Outline

Cloud Optimized Storage
1 What Content is being stored?
1.1 Structured Content

Structured data are data that have been represented in a manner that allows computation with those data. For example, the data within MetaCyc are highly structured because different properties and relationships of metabolic enzymes, pathways, and reactions have been carefully dissected and assigned to distinct fields of a database so that they are independently queryable and computable. Therefore, we can ask questions across the data such as "find all enzymes that use magnesium as a cofactor" or "find all pathways in which pyruvate is an input substrate". - http://commons.wikimedia.org/wiki/Category:Data_structures

1.1.1 Meta-data
1.1.2 XML
1.1.3 Database frameworks

note: data within a DB could be unstructured data based on designation and determinance

1.1.4 HTML
1.1.5 Email (some)
1.2 Unstructured Content

80% of all potentially usable business information originates in unstructured form..

1.2.1 Video
1.2.2 Audio
1.2.3 Email (some)
1.2.4 Documents
1.2.5 HTML

HTML uses tags but mark-up is used for rendering, not for data.

2 What performance is expected?
2.1 Tier 0 - TBD
2.2 Quick Response (Tier 1)
2.2.1 structured content
2.2.2 performance guaranteed applications
2.3 Medium Response (Tier 2)
2.3.1 unstructured content
2.3.2 availability guaranteed applications
2.4 Storage only (Tier 3)
2.4.1 backup
2.4.2 archives
2.4.3 accessibility guaranteed storage
3 What SLA(s) are in place for content access?
3.1 Compliance
3.1.1 PCI
3.1.2 SOX
3.1.3 UK DPA act
3.1.4 95/46/EC
3.1.5 Client requirements
3.1.6 HIPAA
3.1.7 SB 1386 (CA Info Practice Act)
3.2 Data Protection
3.3 Data Access
3.3.1 Authentication
3.3.2 Permissions to Content (Specific)
3.4 Chain of Custody
4 Performance (Hardware)
4.1 Latency
4.1.1 From Client to Cloud
4.1.2 From Cloud to Provider
4.1.3 From Provider to Storage
4.2 Bandwidth
4.2.1 Connectivity Types Fibre Channel over Ethernet (FCoE) Global Namespace IP Access NAS (CIFS/NFS/?) Other (please comment) UDP/Multi-stream
4.3 Resiliency
4.3.1 LACP
4.3.2 Etherchannel
4.3.3 Vendor-specific failover control
4.3.4 Multi-pathing OS-based native MPIO Microsoft MPIO VMWare Path Management Vendor-specific path management PowerPath (example)
4.3.5 Distributed Data via Replication Object based Synchronous Asynchronous Other forms of Replication Filesystem based Synchronous Asynchronous Other forms of Replication
4.3.6 Data Protection RAID w/Parity RAID 5 RAID 6 RAID 3 RAID w/o Parity RAID 10 RAID 1
5 How is Content being stored?
5.1 Filesystems
5.1.1 Open File Systems NTFS XFS ZFS Other Filesystems (?) Vendor proprietary Client proprietary
5.1.2 "Closed" File Systems HP-UX AIX Others
5.2 Objects
5.2.1 XAM
5.2.2 Vendor proprietary
5.2.3 Client proprietary
5.3 Content Manipulation
5.3.1 De-duplication In-band (Appliance) Out-of-band (Target)
5.3.2 Encryption Data-at-rest Inband (Appliance) Out-of-band Nature of Encryption RSA AES? Algorithms other factor CBC XTS Encrypt Key Sizes 128 bit 256 bit other key size Other Forms of Encryption
5.3.3 Compression Can content be compressed?
5.3.4 Optimization Does storage optimize content?
6 How does Content stay "Green"?

Interesting Article here: http://www.storage-switzerland.com/Articles/Entries/2008/11/24_All_MAIDS_are_not_created_equal.html

6.1 Hardware
6.1.1 Drive Spin down
6.1.2 Adaptive Power usage
6.1.3 Slower drive spindles
6.1.4 data "tiering"
6.1.5 storage virtualization
6.2 Software
6.2.1 Policy-drive storage
6.2.2 Storage Tiering

More Maps From User