Welcome!

Recurring Revenue Authors: Zakia Bouachraoui, Yeshim Deniz, Elizabeth White, Pat Romanski, Xenia von Wedel

RSS Feed Item

Running a production site on the Amazon EC2 / SDB / S3 services stack

Over the past few months or so I’ve been working on rewriting my web application engine to go from running on a virtual dedicated server at a hosting company to running an instance on Amazon EC2 instance. Additionally, I decided to take a risk on Amazon SDB and despite its many flaws, eschew MySQL completely and utterly.

One thing I like a lot about Amazon’s service stack is that it forces you to think about scalability problems right up front. If you design with MySQL on a VPS, it’s easy to get into using a lot of the features that don’t scale – so you make a mess for yourself down the road.

There were several challenges to running a site completely on Amazon’s stack:

  • Amazon has a large distance between their small nodes and the next size up node. – The small nodes are pretty weak by themselves, you wouldn’t want to base a website on a single node by itself. This forced me to design the architecture to work from day 1 on ‘n’ servers.
  • EC2 still has no load balancing API – They should add this, but to get around it we use DNS round robin assignment, which is pretty much good enough for assigning servers that should never be under huge loads anyways.
  • SDB is slow. Everything about it is slow. Also it has its own features that don’t scale very well, such as ‘starts-with’ < this runs slowly.
  • The data model you want for SDB is different than one you want for an RDBMS. Because you can’t join or index, you want to be thinking about denormalization of your data quite quickly. This also means migration from MySQL to SDB is tricky.
  • EC2 payloads are annoying to change – Marc created a script that on instance startup pulls down all the required software for a server from SVN, finds the other servers for dns round robin-ing and for memcaches, and starts up apache.

To move over to Amazon required a migration on our relatively small database of 100k or so rows. This however expanded to around 250k rows on SDB, and required about a million SDB requests to accomplish.

As I wanted to move the site without a lot of downtime, I wrote the migration script to run on ‘N’ servers. When I started the migration up, I ran 20 server instances, and ran the script on all of them. Each segment of the migration took about 2 hours to run, so without EC2’s easy scaling capabilities the site would have been required to be down for 2 days instead of 2 hours.

So far the new Amazon EC2/SDB code base has been up for 3 days. All data is in SimpleDB, from user logins to the friend network graph, despite mysql’s ease of use I’m glad to be rid of the database as a potential bottleneck that could bring the entire site down – something that happened too frequently on my last project.

Overall the site migration went quite smoothly, and we’ve been running for a couple days now without incident. The site has created about 40k html pages, extensive use of memcache means most pages will load in 200ms under a bit of pressure, and best of all costs are reduced and completely trackable.

Now my job is to focus on growing traffic. My new site only has about 7k daily visitors, whereas my previous effort swik.net has about 30k, so I have a ways to go.

Read the original blog entry...

IoT & Smart Cities Stories
The platform combines the strengths of Singtel's extensive, intelligent network capabilities with Microsoft's cloud expertise to create a unique solution that sets new standards for IoT applications," said Mr Diomedes Kastanis, Head of IoT at Singtel. "Our solution provides speed, transparency and flexibility, paving the way for a more pervasive use of IoT to accelerate enterprises' digitalisation efforts. AI-powered intelligent connectivity over Microsoft Azure will be the fastest connected pat...
There are many examples of disruption in consumer space – Uber disrupting the cab industry, Airbnb disrupting the hospitality industry and so on; but have you wondered who is disrupting support and operations? AISERA helps make businesses and customers successful by offering consumer-like user experience for support and operations. We have built the world’s first AI-driven IT / HR / Cloud / Customer Support and Operations solution.
Codete accelerates their clients growth through technological expertise and experience. Codite team works with organizations to meet the challenges that digitalization presents. Their clients include digital start-ups as well as established enterprises in the IT industry. To stay competitive in a highly innovative IT industry, strong R&D departments and bold spin-off initiatives is a must. Codete Data Science and Software Architects teams help corporate clients to stay up to date with the mod...
At CloudEXPO Silicon Valley, June 24-26, 2019, Digital Transformation (DX) is a major focus with expanded DevOpsSUMMIT and FinTechEXPO programs within the DXWorldEXPO agenda. Successful transformation requires a laser focus on being data-driven and on using all the tools available that enable transformation if they plan to survive over the long term. A total of 88% of Fortune 500 companies from a generation ago are now out of business. Only 12% still survive. Similar percentages are found throug...
Druva is the global leader in Cloud Data Protection and Management, delivering the industry's first data management-as-a-service solution that aggregates data from endpoints, servers and cloud applications and leverages the public cloud to offer a single pane of glass to enable data protection, governance and intelligence-dramatically increasing the availability and visibility of business critical information, while reducing the risk, cost and complexity of managing and protecting it. Druva's...
BMC has unmatched experience in IT management, supporting 92 of the Forbes Global 100, and earning recognition as an ITSM Gartner Magic Quadrant Leader for five years running. Our solutions offer speed, agility, and efficiency to tackle business challenges in the areas of service management, automation, operations, and the mainframe.
The Jevons Paradox suggests that when technological advances increase efficiency of a resource, it results in an overall increase in consumption. Writing on the increased use of coal as a result of technological improvements, 19th-century economist William Stanley Jevons found that these improvements led to the development of new ways to utilize coal. In his session at 19th Cloud Expo, Mark Thiele, Chief Strategy Officer for Apcera, compared the Jevons Paradox to modern-day enterprise IT, examin...
With 10 simultaneous tracks, keynotes, general sessions and targeted breakout classes, @CloudEXPO and DXWorldEXPO are two of the most important technology events of the year. Since its launch over eight years ago, @CloudEXPO and DXWorldEXPO have presented a rock star faculty as well as showcased hundreds of sponsors and exhibitors! In this blog post, we provide 7 tips on how, as part of our world-class faculty, you can deliver one of the most popular sessions at our events. But before reading...
DSR is a supplier of project management, consultancy services and IT solutions that increase effectiveness of a company's operations in the production sector. The company combines in-depth knowledge of international companies with expert knowledge utilising IT tools that support manufacturing and distribution processes. DSR ensures optimization and integration of internal processes which is necessary for companies to grow rapidly. The rapid growth is possible thanks, to specialized services an...
At CloudEXPO Silicon Valley, June 24-26, 2019, Digital Transformation (DX) is a major focus with expanded DevOpsSUMMIT and FinTechEXPO programs within the DXWorldEXPO agenda. Successful transformation requires a laser focus on being data-driven and on using all the tools available that enable transformation if they plan to survive over the long term. A total of 88% of Fortune 500 companies from a generation ago are now out of business. Only 12% still survive. Similar percentages are found throug...