• Map Information

Cloud Optimized Storage

By dave_graham

Updated mind map for Cloud Optimized Storage based on feedback from Google Groups Cloud Computing.

Added: November 24, 2008 10:45:53

1805 views | 31 downloads

  • Comments (0)
  • Outline

Cloud Optimized Storage

Cloud Optimized Storage
1 What Content is being stored?
1.1 Structured Content

Structured data are data that have been represented in a manner that allows computation with those data. For example, the data within MetaCyc are highly structured because different properties and relationships of metabolic enzymes, pathways, and reactions have been carefully dissected and assigned to distinct fields of a database so that they are independently queryable and computable. Therefore, we can ask questions across the data such as "find all enzymes that use magnesium as a cofactor" or "find all pathways in which pyruvate is an input substrate". - http://commons.wikimedia.org/wiki/Category:Data_structures

1.1.1 Meta-data
1.1.2 XML
1.1.3 Database frameworks

note: data within a DB could be unstructured data based on designation and determinance

1.1.4 HTML
1.1.5 Email (some)
1.2 Unstructured Content

80% of all potentially usable business information originates in unstructured form..

1.2.1 Video
1.2.2 Audio
1.2.3 Email (some)
1.2.4 Documents
1.2.5 HTML

HTML uses tags but mark-up is used for rendering, not for data.

2 What performance is expected?
2.1 Tier 0 - TBD
2.2 Quick Response (Tier 1)
2.2.1 structured content
2.2.2 performance guaranteed applications
2.3 Medium Response (Tier 2)
2.3.1 unstructured content
2.3.2 availability guaranteed applications
2.4 Storage only (Tier 3)
2.4.1 backup
2.4.2 archives
2.4.3 accessibility guaranteed storage
3 What SLA(s) are in place for content access?
3.1 Compliance
3.1.1 PCI
3.1.2 SOX
3.1.3 UK DPA act
3.1.4 95/46/EC
3.1.5 Client requirements
3.1.6 HIPAA
3.1.7 SB 1386 (CA Info Practice Act)
3.2 Data Protection
3.3 Data Access
3.3.1 Authentication
3.3.2 Permissions to Content (Specific)
3.4 Chain of Custody
4 Performance (Hardware)
4.1 Latency
4.1.1 From Client to Cloud
4.1.2 From Cloud to Provider
4.1.3 From Provider to Storage
4.2 Bandwidth
4.2.1 Connectivity Types
4.2.1.1 Fibre Channel over Ethernet (FCoE)
4.2.1.2 Global Namespace IP Access
4.2.1.3 NAS (CIFS/NFS/?)
4.2.1.4 Other (please comment)
4.2.1.5 UDP/Multi-stream
4.3 Resiliency
4.3.1 LACP
4.3.2 Etherchannel
4.3.3 Vendor-specific failover control
4.3.4 Multi-pathing
4.3.4.1 OS-based native MPIO
4.3.4.1.1 Microsoft MPIO
4.3.4.1.2 VMWare Path Management
4.3.4.2 Vendor-specific path management
4.3.4.2.1 PowerPath (example)
4.3.5 Distributed Data via Replication
4.3.5.1 Object based
4.3.5.1.1 Synchronous
4.3.5.1.2 Asynchronous
4.3.5.1.3 Other forms of Replication
4.3.5.2 Filesystem based
4.3.5.2.1 Synchronous
4.3.5.2.2 Asynchronous
4.3.5.2.3 Other forms of Replication
4.3.6 Data Protection
4.3.6.1 RAID w/Parity
4.3.6.1.1 RAID 5
4.3.6.1.2 RAID 6
4.3.6.1.3 RAID 3
4.3.6.2 RAID w/o Parity
4.3.6.2.1 RAID 10
4.3.6.2.2 RAID 1
5 How is Content being stored?
5.1 Filesystems
5.1.1 Open File Systems
5.1.1.1 NTFS
5.1.1.2 XFS
5.1.1.3 ZFS
5.1.1.4 Other Filesystems (?)
5.1.1.5 Vendor proprietary
5.1.1.6 Client proprietary
5.1.2 "Closed" File Systems
5.1.2.1 HP-UX
5.1.2.2 AIX
5.1.2.3 Others
5.2 Objects
5.2.1 XAM
5.2.2 Vendor proprietary
5.2.3 Client proprietary
5.3 Content Manipulation
5.3.1 De-duplication
5.3.1.1 In-band (Appliance)
5.3.1.2 Out-of-band (Target)
5.3.2 Encryption
5.3.2.1 Data-at-rest
5.3.2.2 Inband (Appliance)
5.3.2.3 Out-of-band
5.3.2.4 Nature of Encryption
5.3.2.4.1 RSA AES?
5.3.2.4.1.1 Algorithms
5.3.2.4.1.1.1 other factor
5.3.2.4.1.1.2 CBC
5.3.2.4.1.1.3 XTS
5.3.2.4.1.2 Encrypt Key Sizes
5.3.2.4.1.2.1 128 bit
5.3.2.4.1.2.2 256 bit
5.3.2.4.1.2.3 other key size
5.3.2.4.2 Other Forms of Encryption
5.3.3 Compression
5.3.3.1 Can content be compressed?
5.3.4 Optimization
5.3.4.1 Does storage optimize content?
6 How does Content stay "Green"?

Interesting Article here: http://www.storage-switzerland.com/Articles/Entries/2008/11/24_All_MAIDS_are_not_created_equal.html

6.1 Hardware
6.1.1 Drive Spin down
6.1.2 Adaptive Power usage
6.1.3 Slower drive spindles
6.1.4 data "tiering"
6.1.5 storage virtualization
6.2 Software
6.2.1 Policy-drive storage
6.2.2 Storage Tiering