Many applications depend on shared file storage using protocols like NFS and traditional on-premise NAS appliances. The drawbacks of traditional storage provisioning, expenses and management can be addressed in the cloud, and open up new opportunities to move applications to higher performance and lower cost. In this session you will learn about the Elastic File System use cases, common project profiles and the criteria for selecting and designing a file system in the cloud.
Learning Objectives:
• An understanding of the AWS cloud file system: elasticity, performance, protocols, use cases
• An overview of applications and use cases
• Which AWS storage service to apply to which problems (object, block, file)
2. Goals and expectations for this session
Overall goal: Introduce you to Amazon EFS (what it is,
features, how it can help you)
Webinar intended for all levels: We’ll cover both beginner
topics and more advanced concepts
We’ll do Q&A at the end: Submit questions during
presentation
3. Agenda
1. Provide overview of EFS
2. Introduce EFS technical concepts
3. Walk through creating a file system
4. Review file system security mechanisms
5. Discuss EFS’s availability and durability properties
6. Share key performance characteristics
5. Amazon EFS
File
Amazon EBS
Amazon EC2
Instance Store
Block
Amazon S3 Amazon Glacier
Object
Data Transfer
AWS Direct
Connect
ISV
Connectors
Amazon
Kinesis
Firehose
Storage
Gateway
S3 Transfer
Acceleration
The AWS Storage Platform
AWS
Snowball
Amazon
CloudFront
Internet/VPN
6. Operating shared file storage today is a pain
Application owner
or developer
IT administrator
Business owner
Estimate demand
Procure hardware
Set aside physical space
Set up and maintain hardware (and network)
Manage access and security
Provide demand forecasts/business case
Add lead times and extra coordination to your schedule
Limit your flexibility and agility
Make up-front capital investments, over-buy, stay on a
constant upgrade/refresh cycle
Sacrifice business agility
Distract your people from your business’s mission
7. A fully managed file system for Amazon EC2 instances
Exposes a file system interface that works with standard
operating system APIs
Provides file system access semantics (consistency, locking)
Sharable across thousands of instances
Designed to grow elastically to petabyte scale
Built for performance across a wide variety of workloads
Highly available and durable
What is Amazon EFS?
8. We focused on changing the game
Simple Elastic Scalable
1 2 3
Highly Durable
Highly Available
9. Amazon EFS is Simple
Fully managed
- No hardware, network, file layer
- Create a scalable file system in seconds!
Seamless integration with existing tools and apps
- NFS v4.1—widespread, open
- Standard file system access semantics
- Works with standard OS file system APIs
Simple pricing = simple forecasting
1
10. Amazon EFS is Elastic
File systems grow and shrink automatically
as you add and remove files
No need to provision storage capacity or
performance
You pay only for the storage space you use,
with no minimum fee
2
11. File systems can grow to petabyte scale
Throughput and IOPS scale automatically
as file systems grow
Consistent low latencies regardless of file
system size
Support for thousands of concurrent NFS
connections
Amazon EFS is Scalable3
12. Designed to sustain AZ offline conditions
Superior to traditional NAS availability
models
Appropriate for Production / Tier 0
applications
Highly Durable and Highly Available
14. What is a file system?
The primary resource in EFS
Where you store files and directories
Can create 10 file systems per account
15. What is a mount target?
To access your file system
from instances in a VPC, you
create mount targets in the
VPC
A mount target is an NFSv4
endpoint in your VPC
A mount target has an IP
address and a DNS name you
use in your mount command
AVAILABILITY ZONE 1
REGION
AVAILABILITY ZONE 2
AVAILABILITY ZONE 3
VPC
EC2
EC2
EC2
EC2
Mount
target
16. How to access a file system from an instance
You “mount” a file system on an Amazon EC2 instance
(standard command) — the file system appears like a local
set of directories and files
An NFSv4.1 client is standard on Linux distributions
mount –t nfs4 –o nfsvers=4.1
[file system DNS name]:/
/[user’s target directory]
17. How does it all fit together?
AVAILABILITY ZONE 1
REGION
AVAILABILITY ZONE 2
AVAILABILITY ZONE 3
VPC
EC2
EC2
EC2
EC2
Customer’s file
system
18. There are three ways to set up and manage a
file system
AWS Management Console
AWS Command Line Interface (CLI)
AWS Software Development Kit (SDK)
19. The AWS Management Console, CLI, and SDK each
allow you to perform a variety of management tasks
Create a file system
Create and manage mount targets
Tag a file system
Delete a file system
View details on file systems in your AWS account
20. Setting up and mounting a file system takes
under a minute
1. Create a file system
2. Create a mount target in each AZ from which you want
to access the file system
3. Enable the NFS client on your instances
4. Run the mount command
29. Only EC2 instances in the VPC you specify can access
your EFS file system
VPC
EC2
EC2
EC2
EC2
VPC
EC2
EC2
EC2
EC2
Customer’s file
system
30. Several security mechanisms
Control network traffic to and from file systems (mount
targets) by using VPC security groups and network ACLs
Control file and directory access by using POSIX
permissions
Control administrative access (API access) to file
systems by using AWS Identity and Access Management
(IAM)
31. VPC
EC2
EC2
Security groups control which instances in your VPC
can connect to your mount targets
Customer’s file
system
Security group:
sg-allowed
Security group:
Permit inbound traffic
from “sg-allowed”
Security group:
sg-not-allowed
32. EFS supports POSIX file and directory access
permissions
Set file/directory permissions to specify read-write-execute
permissions for users and groups
33. Use IAM policies to control who can use the
administrative APIs to create, manage, and
delete file systems
EFS supports action-level and resource-level
permissions
Integration with IAM provides administrative
security
35. In what regions can I use EFS?
US-West (Oregon)
US-East (Northern Virginia)
EU (Ireland)
36. Data is stored in multiple AZs for high availability
and durability
Every file
system object
(directory, file,
and link) is
redundantly
stored across
multiple AZs in
a region
AVAILABILITY
ZONE 1
REGION
AVAILABILITY
ZONE 2
AVAILABILITY
ZONE 3
Amazon
EFS
37. Data can be accessed from any AZ in the region
while maintaining full consistency
Your EC2 instances can
connect to your EFS file
system from any AZ in a
region
All reads will be fully
consistent in all AZs—that
is, a read in one AZ is
guaranteed to have the
latest data, even if the data
is being written in another
AZ
AVAILABILITY
ZONE 1
REGION
VPC
EC2
EC2
EC2
AVAILABILITY
ZONE 2
AVAILABILITY
ZONE 3
EC2
Write
Read
39. Amazon EFS is designed for wide spectrum of use cases
High throughput and parallel I/O
Low latency and serial I/O
Genomics
Big Data Analytics
Scale-out jobs
Home Directories
Content Management
Web serving
Metadata-intensive
jobs
40. EFS provides throughput that scales as a file system
grows
As a file system gets larger, it
needs access to more
throughput
Many file workloads are spiky,
with peak throughput well above
average levels
Amazon EFS scalable bursting model is designed to
make performance available when you need it
41. Bursting model examples
File system size Read/write throughput
A 1 TB EFS file system can… • Drive up to 50 MB/s continuously
or
• Burst to 100 MB/s for up to 12 hours each day*
A 10 TB EFS file system can… • Drive up to 500 MB/s continuously
or
• Burst to 1 GB/s for up to 12 hours each day*
A 100 GB EFS file system can… • Drive up to 5 MB/s continuously
or
• Burst to 100 MB/s for up to 72 minutes each day*
42. Two performance modes designed to support a
broad spectrum of use cases
Optimized for latency-sensitive applications and general-purpose
file-based workloads – this mode is the best option for the majority
of use cases
General
purpose
mode
Max I/O
mode
Optimized for large-scale and data-heavy applications where tens,
hundreds, or thousands of EC2 instances are accessing the file
system — it scales to higher levels of aggregate throughput and ops
per second with a tradeoff of slightly higher latencies for file operations
Default: Recommended for most use cases
Use CloudWatch to determine whether your application may benefit from “Max I/O”;
if not, you’ll get the best performance in “general purpose” mode
45. Simple and predictable pricing
With EFS, you pay only for the storage space you use
No minimum commitments or up-front fees
No need to provision storage in advance
No other fees, charges, or billing dimensions
EFS price: $0.30/GB-month
46. What to do next?
Learn more at aws.amazon.com/efs
Try it out for free!