AWS or alternative cloud service for deploying models. Which to choose?

December 11, 2024

This story covers how Blitzfa caters to thousands of users for its custom AI model service, VanGO, using a relatively new cloud infrastructure: Beam.

With the release of VanGO, users have been swarming our servers to generate graphics based on their ideas and chosen music. VanGO is an innovative arcitecture developed by Blitzfa that allows users to generate images by combining their ideas and styling them with their own choice of song. While AWS SageMaker was initially our platform of choice, it quickly became apparent that its limitations were a significant barrier to maintaining both system stability and performance. Between maintenance downtimes, slow deployments, and frustrating scalability issues, we needed to act swiftly. The solution? Beam Cloud, a new infrastructure offering, that has alleviated the pain points previously associated with SageMaker!

For a long time, developers across the industry have relied on AWS SageMaker to deploy custom machine learning models. SageMaker offers a platform where custom binaries, model weights, and inference code can be uploaded, enabling developers to expose models as endpoints ready for production (or prod-like) usage. In theory, this should work seamlessly. Upon uploading to SageMaker, developers essentially expose an endpoint to their custom model, which can then be attached to literally ANYTHING — in our case, our Indusvale website.

However, as we scaled VanGO and encountered the high demands of user traffic, SageMaker’s inadequacies became glaringly evident. SageMaker’s integration with AWS Lambda enforces a strict 60-second timeout on requests, which was increasingly problematic for our use case. Many of our models, including complex ones involving large, convoluted neural networks or fine-tuned models with custom LORA weights, simply could not meet this threshold.

Not only that, we faced several other issues with Sagemaker when deploying bespoke models, or models pulled from Hugging Face, or even a traditional model with custom LORA weights.

Issues with AWS:

Let’s break down the specific technical hurdles we faced with SageMaker and why they ultimately pushed us to seek out a more reliable alternative.

#1. The infamous: Worker Thread Died.

Well, we weren’t the first to encounter this and that’s for obvious reasons.

Deploying MLLM and other large architectures has often been a hit-or-miss situation, based on our experience. We frequently found ourselves debugging why our model worked yesterday but not this morning. Such instability led us to recreate fresh Docker images, redeploy ECRs, and then deploy to AWS, hoping it would be in a good mood.

Despite deploying with high hopes, SageMaker often threw us into an unpredictable state of deployment failures. The error we encountered most often was the dreaded “worker thread died” issue. This typically occurred during the deployment of larger models or after a configuration update. The real cause of it remains an open-ended question. Some attribute it to module incompatibility, while others point to incorrect usage of “accelerate.” It becomes more of a frenzy in the deployment logs, where the error messages are terse and vague.

One can’t rely on deploying large models to SageMaker when you have such an unstable deployment pipeline.

To make things more interesting, we also wanted to incorporate on-demand serverless inference to automatically spin up and down instances during periods of activity and inactivity, respectively, thus providing us with a cost-effective solution. However, the unstable nature of SageMaker worker threads, which essentially manage your processes, left us in a state of flux. It didn’t give us the confidence that if our model was up now, it would be up after a cold boot as well. For such reasons, we couldn’t even make our solution cost-effective. This was another straw in the pile of inconvenience.

Don’t get us wrong — bugs and errors are a part of a healthy development lifecycle, especially when you, as a developer, have the flexibility to control the moving parts, with promise. We were never assured that our model, given its sheer size and complexity, would be up at any given time. Stack Overflow and GitHub issues often pointed to incompatibility between PyTorch versions, accelerate, and some other modules, but they worked perfectly well on EC2 instances. Never mind that; we did go about tweaking modules, and that introduced another pain point.

#2. Recreate ECR for new modules and new tar files for any minute change?!

One of the most tedious and time-consuming aspects of deploying on SageMaker was the requirement to constantly rebuild and redeploy entire container images. Whenever we had a new model, a minor code update, or even a small change in one of our dependencies, we were forced to rebuild and push a new Docker image to Amazon Elastic Container Registry (ECR). This was especially cumbersome for any incremental change, no matter how trivial.

A common argument I often encounter with the community is that it provides you with the flexibility to deploy anywhere, and I wholeheartedly agree. Dockerizing your code is a great way to distribute across multiple VMs and endpoints, as it makes your model resource-agnostic. However, pushing to ECR and, at times, committing new modules can take quite a long time. Sure, there are a few workarounds, but for a fast-moving startup, you need something more efficient.

Beam Cloud: The Solution We Needed

Beam Cloud, on the other hand, streamlined the deployment process significantly. With features like auto-scaling, flexible model hosting, and robust resource management, Beam eliminated the pain points we faced with SageMaker. The new deployment workflow, which does not require constant rebuilding of container images for each change, vastly improved our turnaround time. Additionally, Beam’s ability to efficiently manage resources like GPUs, memory, and storage in a more intelligent and dynamic way has resolved the out-of-memory and timeout issues that plagued our previous setup.

I mean, just look at this piece of code:

That’s all you need to define your working environment. Beam has been a game changer for us, as it takes the load off of us from an infrastructure point of view and lets us focus on real development, which is important.

To further support my point about why we chose Beam, allow me to illustrate how we are using Beam for our newly limited-access, VanGO

With Beam, we have:

one-click deployment solution
ability to modify the Python runtime env via the code
more stable deployments
more user-intuitive DevOps
lastly, and more importantly, it’s cheaper :)

and don’t take our word for it, compare your prices as well.

Two of most widely used GPUs, SageMaker vs Beam (as of December, 2024)

Conclusion:

In conclusion, while SageMaker is a robust tool for many developers, it was not the right fit for Blitzfa’s growing needs. As we scaled VanGO, SageMaker’s limitations became increasingly evident, prompting us to seek out an alternative. Beam Cloud’s flexibility, scalability, and reliability provided the solution we were looking for, and it has now become our go-to platform for deploying custom AI models.

Moreover, just a side note: this is in no way intended to defame any service. This is more of a personal experience and simply us following one of our core values — to be transparent and contribute to a healthy community by sharing what we discover!

Back to blog

Return and Refund Policy

What is IndusVale's Return and Exchange Policy? How does it work?

At IndusVale, we want you to be completely satisfied with your purchase. If for any reason you are not satisfied, we offer a hassle-free return and refund policy. Please read the following guidelines carefully to ensure a smooth process:

Eligibility for Returns:

All items to be returned or exchanged must be unused and in their original condition with all original tags and packaging intact (for e.g. T-Shirts must be packed in the original packaging).
Return will be processed only if the product is not different from what was shipped to you
Return will be processed only if it is determined that the product was not damaged while in your possession
Returns must be initiated within 7 days from the date of purchase.

Return Process:

To initiate a return, please contact our customer support team by email at contact@indusvale.in within the specified return period. Alternatively, you can also visit our website and submit your query on our Contact Page.
Provide your order details, including the order number, item(s) you wish to return, and the reason for the return.
Our customer support team will provide you with a Return Merchandise Authorization (RMA) number and detailed instructions for returning the item(s).
Inspection and Refund Process:
Once we receive the returned item(s), our team will inspect them to ensure they meet the eligibility criteria stated above.
If the returned item(s) are in satisfactory condition, we will process the refund.
Refunds will be issued to the original payment method used for the purchase.
Please allow a reasonable processing time for the refund to be reflected in your account, which may vary depending on your financial institution.

Under Exchange Policy

If you choose to exchange the item purchased from IndusVale within the specified exchange period for the same size or a different size of the same style, you will be provided with a free replacement of the item.
The applicable refund for an exchange will be processed after the successful pickup of the original item from you.
Exchanges are only available for pin codes that are serviceable for an exchange by IndusVale.
You can only select a single item for exchange. While you can exchange multiple items, each item needs to be initiated as a separate exchange request.
Non-returnable products/categories cannot be exchanged.
IndusVale reserves the right to restrict exchanges of items purchased if the customer breaches or misuses this policy, as determined in IndusVale's sole discretion. In case you have purchased an item that has a free gift/offer associated with it and you wish to return the main item, you will need to return the free product as well.
IndusVale will not be held liable for products returned by mistake. If an extra or different product is returned by mistake, IndusVale is not responsible for its misplacement, replacement, or delivery back to the customer.

Why have I not received my Refund despite Instant Refunds policy?

For refunds taken into source accounts via UPI & Wallet, your refund will reflect instantly (48hrs in case of delay). For refunds taken to source accounts (that is Credit Card, Debit Card and Netbanking), your refund may take 7-10 days to reflect in your account depending on your banking partner.
How long would it take me to receive the refund of the returned product?
After the refund has been initiated by IndusVale in accordance with the Returns Policy, the refund amount is expected to reflect in the customer's account within 5 to 10 business days.
Please note that IndusVale initiates the refund process once the products are received, and the quality check is successfully completed. As a result, the time taken for the refund initiation may vary based on the courier partner's delivery time to an IndusVale warehouse. If there are any discrepancies in the refund, IndusVale may, at its sole discretion, request you to provide a screenshot of your bank statement.

Does IndusVale pick up the product I want to return from my location?

Currently, we pick up products only from selected PIN Codes. If your area PIN Code is serviceable, you will be able to select the pickup option when you create a Return Request on Website.
We will pick up the return within 4 - 7 days from the request placement date.
Please keep the return shipment ready along with all the original tags and package.

Why has my return request been declined?

This may have happened, if the item you returned is used, damaged or original tags are missing. In the event that the return request is declined, the user shall not be eligible for a refund, and IndusVale assumes no liability in this regard. For more details, please write an email to our customer care team at contact@indusvale.in

Why did the pick up of my product fail?

We make three attempts to pick up the item, if the item is not picked up in the third attempt, the Pick-up request will be marked as failed. You can initiate a new return request, if the item meets the return criteria and is within the specified return/exchange period.

Payments

How can I pay for my orders at IndusVale?

We support the following payment options at IndusVale:

Cash On Delivery (available only for select PIN Codes)
Credit Card
Debit Card
Net banking
Wallet

How does the COD (Cash on Delivery) payment option work?

IndusVale's Cash on Delivery option allows you to pay order value at the time of delivery for all orders between Rs. 999 and Rs. 2999. To pay for any order using Cash on Delivery (COD) mode of payment, please select the 'Cash On Delivery' option on the payment page. Cash on Delivery option is available only in selected pincodes.

What should I do if my payment fails?

Please retry making the payment after ensuring that the information entered is accurate, including all account details, billing addresses and passwords. If your payment still fails, you can use the Cash on Delivery (COD) payment option, if available on the payment page to place your order. If your payment is debited from your account after a payment failure, it will be credited back within 7-10 days, after we receive a confirmation from the bank.

I am being charged GST amount on my order. What is GST?

Once an order has been dispatched, it cannot be cancelled or modified. In such cases, our GST is a single tax on the supply of goods and services that is levied on every value addition (through production and services) and is added to a product's sale price. GST has to borne/paid by the ultimate consumer of the product or service. GST will be applicable on All Your orders. GST subsumes all other taxes like Excise duty, VAT, Entry tax etc.

How is the GST amount decided?

Following rules will govern whether or not additional GST will be applicable on the products purchased by you:

GST applicability:
- For a product
  - If the fulfilment is done on or after July 1st, 2017 and
  - If the order is placed before 15th November, 2019, and,
  - Total discount percentage is more than 19% of MRP,
  - Then GST may be collected from customers in addition to product price, post discounts. The discounts include those resulting from special offers such as Buy 1 Get 1 and similar offers.
- For a product, if the order is placed on or after 15th November 2019, the discounted price displayed on IndusVale platform shall be inclusive of all taxes, including GST.
GST amount: If applicable, the amount of GST collected from customer depends on category, for example
Apparel/Clothing: Max 12%
- On and from 15th November, 2019, the discounted prices displayed on the IndusVale platform shall be inclusive of all taxes.

If I return/cancel the purchased product will the GST/VAT amount charged be refunded?

Yes. If you return the product the applicable GST/VAT amount will also be refunded into the source account selected at the time of return initiation. However no refunds of GST/VAT shall be made in relation to platform handling fee collected from the consumer under IndusVale shipping policy.

Issues with AWS:

#1. The infamous: Worker Thread Died.

#2. Recreate ECR for new modules and new tar files for any minute change?!

Beam Cloud: The Solution We Needed

Conclusion:

Return and Refund Policy

Payments

Terms & Conditions

Shipping

IndusVale - for the seekers of India

Sign Up and Login

Privacy Policy for IndusVale