QuestionsAnswered.net

What's Your Question?

What Is a Case Study?

When you’re performing research as part of your job or for a school assignment, you’ll probably come across case studies that help you to learn more about the topic at hand. But what is a case study and why are they helpful? Read on to learn all about case studies.

Deep Dive into a Topic

At face value, a case study is a deep dive into a topic. Case studies can be found in many fields, particularly across the social sciences and medicine. When you conduct a case study, you create a body of research based on an inquiry and related data from analysis of a group, individual or controlled research environment.

As a researcher, you can benefit from the analysis of case studies similar to inquiries you’re currently studying. Researchers often rely on case studies to answer questions that basic information and standard diagnostics cannot address.

Study a Pattern

One of the main objectives of a case study is to find a pattern that answers whatever the initial inquiry seeks to find. This might be a question about why college students are prone to certain eating habits or what mental health problems afflict house fire survivors. The researcher then collects data, either through observation or data research, and starts connecting the dots to find underlying behaviors or impacts of the sample group’s behavior.

Gather Evidence

During the study period, the researcher gathers evidence to back the observed patterns and future claims that’ll be derived from the data. Since case studies are usually presented in the professional environment, it’s not enough to simply have a theory and observational notes to back up a claim. Instead, the researcher must provide evidence to support the body of study and the resulting conclusions.

Present Findings

As the study progresses, the researcher develops a solid case to present to peers or a governing body. Case study presentation is important because it legitimizes the body of research and opens the findings to a broader analysis that may end up drawing a conclusion that’s more true to the data than what one or two researchers might establish. The presentation might be formal or casual, depending on the case study itself.

Draw Conclusions

Once the body of research is established, it’s time to draw conclusions from the case study. As with all social sciences studies, conclusions from one researcher shouldn’t necessarily be taken as gospel, but they’re helpful for advancing the body of knowledge in a given field. For that purpose, they’re an invaluable way of gathering new material and presenting ideas that others in the field can learn from and expand upon.

MORE FROM QUESTIONSANSWERED.NET

data lake case study aws

Amazon Migrates 50 PB of Analytics Data from Oracle to AWS

Ending Support for Internet Explorer

Implementing a Large AWS Data Lake for Analysis of Heterogeneous Data

customer logo

C4ADS users were finding it increasingly difficult to sift through the company’s massive database collection.

ClearScale implemented a data lake with an Amazon Virtual Private Cloud (VPC), designed a web-based user interface, and used AWS Lambda and API Gateway to ingest data.

C4ADS’ new solution can scale as needed without compromising security and is able to meet user demands more effectively.

AWS Services

Amazon Virtual Private Cloud (VPC), AWS S3, AWS Lambda, Amazon API Gateway, Amazon CloudWatch, AWS CloudTrail, Amazon DynamoDB

Executive Summary

C4ADS (Center for Advanced Defense Studies) is a nonprofit organization based in Washington DC that is dedicated to providing data-driven analysis and evidence-based reporting on global conflict and transnational security issues. In this pursuit, C4ADS focuses on a variety of issues, including threat finance, transnational organized crime, and proliferation networks.

The Challenge

The world is a complex ecosystem of people, economies, competing interests, and political ambiguity. Being able to track many different events to determine if there are patterns that would warrant a more critical look and analysis is a difficult task, even under the best conditions. With new regional or political developments each day, sometimes even hour by hour, combing through enormous sets of data is challenging; especially when that data is from different sources and in various formats.

C4ADS is tasked with just this sort of activity. Their clients require evidence-based and data-driven analysis concerning global conflict and transnational security issues. With a focus on identifying the drivers and enablers of such conflict, this organization has to be absolutely confident in the analysis and assessments they provide. However, the first step to performing any sort of review requires analysts to comb through extensive records from different sources and formats to compile a list of potential hits.

As C4ADS increased the number of datasets it ingested, new challenges arose, specifically the ability to make use of all the data at its disposal. As more and more data has become available, their analysts were finding it difficult to sift through all of the incoming information in a quick and expedient way. The company approached ClearScale, an AWS Premier Consulting Partner, and wanted to see if there was a way that they could leverage what they did currently by using AWS to assist in making the data more user-friendly.

The ClearScale Solution

The challenge put forth by C4ADS was that a solution had to be implemented quickly, provide the ability to scale as needed, and be extremely secure given the nature of the information they were reviewing. With these three criteria in mind, ClearScale reviewed various designs and approaches that they could develop and implement on AWS.

Data Storage with Data Lake Approach

The biggest challenge was finding a way to aggregate multiple different file formats (such as PDFs, emails, Microsoft Word and Excel files, logs, XML and JSON files) while still allowing C4ADS to perform easy searches within a large data repository. It rapidly became clear that to accomplish the requirements laid out by the client, ClearScale would have to implement a Data Lake approach within an AWS Virtual Private Cloud (VPC). Unlike traditional data warehouse methodologies that require data to conform to a specific set of schema, a data lake allows for any number of data types to be stored and referenced, so long as those data types have a consistent approach to querying and retrieving data.

It was immediately clear that trying to collapse or conform all the various file types that were available into a normalized format would be too resource-intensive. To overcome this, ClearScale chose instead to implement a solution that would tag all uploaded file content with consistent metadata tagging which, in turn, would allow for greater visibility and speedier search results. This automated metadata tagging for each file that was uploaded either manually or via bulk upload would mimic the client’s existing folder structure and schema that they had adopted internally. This approach would ensure that the new solution would be easily understood by analysts that were already familiar with the current operational processes.

Data Flow Model

Data Flow Model

System Architecture Diagram

System Architecture Diagram

Web-Based User Interface (Web UI)

To access and search these records, ClearScale designed and implemented a web-based user interface. This UI was designed to allow for complete management of the data sources — including data upload — beyond simply searching the Data Lake. From a data repository perspective, ClearScale needed to build and deploy a solution that was scalable and reactive to increased demand but also highly secure. To accomplish this, a combination of AWS S3 was used for the storage of the data uploaded, and DynamoDB for the storage of the file metadata; ElasticSearch was used for the robust search querying that was required.

In order to get the data uploaded, ClearScale leveraged AWS Lambda and API Gateway services to properly ingest the data and automate the creation of the file metadata. Both CloudWatch and CloudTrail were also put in place to monitor resource usage and serve as triggering mechanisms to scale the environment as required.

The entire solution was encased in AWS VPC for robust security and Cognito for SAML based authentication. This approach guarantees that the information was behind a robust security layer with additional work done for data to be encrypted both at rest and in transit. It also insured that administrators could grant access to specific document types based on group roles, both for internal and external role types.

UI Welcome Screen

UI Welcome Screen

Bulk Indexing — Add and Index an existed S3 Bucket or Folder

Add and Index an Existed S3 Bucket

Bulk Indexing — Monitoring of Long Time Backend Tasks

Monitoring of Long Time Backend Tasks

Bulk Indexing — Login and Automatic Errors Handling

Login and Automatic Errors Handling

Multi-tenancy — Agile Access Setup

Agile Access Setup

Metadata — Governance

Governance

Cart — Storing and Exploring Results in Personal Cart

Storing and Exploring Results

The Benefits

The turnaround time from design to delivery to C4ADS was a mere two months, including deployment of the solution in both a Staging and Production environment as well as training for C4ADS staff on how to use the new solution. The first release provided everything that C4ADS originally asked for: it had to be deployed quickly, it had to have the ability to scale as needed, and it had to be highly secure. Launched in October 2017, the solution has already optimized the analysts’ job activities by giving them the tools necessary to do wide-ranging search profiles and aggregate disparate heterogeneous data types.

Later releases will introduce more robust security measures that will allow C4ADS to extend the service out to their partner organizations. It will also provide multi-lingual support and optical character recognition (OCR) technology to aid in identification of important data markers in the data that is uploaded.

There are plenty of challenges in the business and technology landscape. Finding ways to overcome these challenges is what ClearScale does best. By bringing our own development resources to bear on these complex problems, we can design, build, test, and implement a solution in partnership with your organization, thus allowing you to focus on more pressing matters in running your day-to-day operations.

Headquarters

50 California Street Suite 1500 San Francisco, CA 94111

O: 800-591-0442

5450 Thornwood Dr. Suite #L San Jose, CA 95123

O: 1-800-591-0442

7887 East Belleview Avenue Suite 1100 Denver, CO 80111

O: 1-303-209-9966

2942 N 24th St. Suite 114 Phoenix, AZ 85016

O: 1-602-560-1198

165 Broadway 23rd Floor New York, NY 10006

O: 1-646-759-3656

11757 Katy Fwy Suite 1300 Houston, TX 77079

O: 1-281-854-2088

100 King St. West Suite 5600 Toronto, Ontario M5X 1C9

O: 1-416-479-5447

Kraków, Poland

Kącik 4 30-549 Kraków Poland

1CloudHub: Digital Transformation – Advisory | Solutions | Implementation

Case Study: Enterprise data lake on cloud

1CloudHub helped India’s leading television entertainment network bring its scattered big data into a single source of truth, to make advanced analytics affordable.

Cloud Advisory Services

Dc, servers & data, project scope, — data lake architecture design — data transformation and storage in data lake — customized reports in powerbi, about the client.

The client is a leading media production and broadcasting company, subsidiary of a global media conglomerate. They have over 30 television channels, a digital business and a movie production business, reaching over 700 million viewers in India.

Business challenge

As part of their digital strategy, our client wanted to optimise user experience across channels — iOS and Android apps, Fire TV, web, and so on — based on user behaviour and preferences. This required a deeper understanding of customer behavioural patterns across platforms.

Presently, they were using Segment as the tool to collect around 6.5 billion records (20TB of raw data) of behavioural data from their 30 million online viewers every month from across sources.

In order to deliver a user-focussed digital viewing experience, the client needed

We, at 1CloudHub, enabled an enterprise data lake for all of the client’s data to reside in one place — preserving accuracy and timeliness of the data.

Leveraging our client’s existing mechanism to collect and feed data into the data lake, we created a pipeline with EMR (Elastic MapReduce) for data crunching or ETL (Extract, Transform, Load) and Power BI for self-service visualisation.

Our approach

data lake case study aws

Completion and reporting

01. understand.

data lake case study aws

04. Transform

05. Completion and reporting

Better

We enabled advanced analytics for data from up to a year — compared to the 3 months data as per agreement — to deliver the meaningful insights the business teams sought.

Faster

We crunched over 12 million records in under an hour, running more than 100 VMs concurrently in a cluster.

Cheaper

We delivered each report at a cost of $70. At this cost, we delivered an excellent price-to-performance ratio, driven by the spot fleet instances we used and our on-demand or pay-as-you-use cloud model.

A similar setup on-premise in a data centre would have cost the client 12,000 times more.

Looking forward

We are delighted to have helped the client create a centralized, analytics-ready repository for their Big Data and look forward to helping them meet their strategic goals using our cloud capabilities.

Latest case studies

data lake case study aws

Migrate and Modernize the MSSQL on RDS to PostgreSQL on Aurora (Babelfish) Using DMS

We helped to Migrate and Modernize the MSSQL on RDS to PostgreSQL on Aurora (Babelfish) Using DMS

data lake case study aws

Migrate from DB on Amazon EC2 to Amazon RDS for MySQL with HA using AWS DMS for E-Commerce Platform 

We helped our top E-commerce platform customer to Migrate from DB on Amazon EC2 to Amazon RDS for MySQL with HA using AWS DMS

data lake case study aws

Migrate from MySQL on Amazon EC2 to Amazon RDS for MySQL  

We helped our top customer in migrating from MySQL on Amazon EC2 to Amazon RDS for MySQL  

data lake case study aws

DB Modernization MS SQL on RDS > PostgreSQL on Aurora

We helped our top customer in DB Modernization MS SQL on RDS > PostgreSQL on Aurora

data lake case study aws

AWS Well-Architected Review For A Retail NBFC

1CloudHub helped a top Retail NBFC by conducting Well-Architected Review For Their Applications

data lake case study aws

Cost effective, Scalable Cloud Solution for connected vehicle telematics

1CloudHub helped the automotive manufacturer (Ashok Leyland) with a Cost-effective, Scalable Cloud Solution for Connected vehicle telematics

data lake case study aws

Migrating LOB workloads for NBFC

1CloudHub helped a leading Investment and Finance Company (Five Star) in Migration.

data lake case study aws

E-Commerce Platform Migration to AWS Cloud

1CloudHub helped an Indian wardrobe store and silk saree emporium in migration

data lake case study aws

Application Modernization and Migration of Enterprise Workloads

1CloudHub helped a leading global shipping giant (CMA CGM) to organize shadow IT systems and bring in centralized governance.

data lake case study aws

Financial Services – Application Modernization to enhance agility & scalability

1CloudHub helped a Investment and Finance Company Limited in India in DevOps Implementation.

data lake case study aws

Winning the DevOps Way

1CloudHub helped a leading assessment and learning product firm (Brainvalley) in DevOps Implementation.

data lake case study aws

Application and Database Modernization along with CI/CD

1CloudHub helped the third largest Shipping and transportation company in DevOps Implementation.

data lake case study aws

Using DevOps To Keep An Edge

1CloudHub helped an Indian wardrobe store and silk saree emporium (Nalli) in DevOps Implementation.

data lake case study aws

Migration of large Windows landscape from On-prem Data center to AWS

Windows Workload Migration from On-Prem to AWS with Zero Down Time

data lake case study aws

Migration of e-Commerce portal from On-prem Data Center to AWS

1CloudHub helped the online shopping store in migrating from on-premises to AWS, reducing downtime and ensuring end-user security.

data lake case study aws

DevOps Implementation- Automated Deployment Process

Migrated the entire IT infrastructure – ERP & Financial systems – from OnPrem to Cloud without impacting the business which needs the systems to be running 24 x 7

data lake case study aws

Data Lake on AWS For Seats and Revenue Analytics

Data lake was successfully created by ingesting data from various sources which enabled the customer to perform seat and revenue analysis

data lake case study aws

IaC Automation & Data Generation

We at 1CloudHub successfully containerized an identified application, optimizing the cloud resources used by the application, for continuous delivery.

data lake case study aws

Migration of IT Infrastructure from OnPrem to Google Cloud

data lake case study aws

Leveraging AI/ML(Personalization) to Increase Checkout Ratio & Rationalize Discount Coupons for a leading B2C E-Ticketing Platform

Increase in Checkout Ratio by 2 basis points Decrease in cost to business by categorizing into 6 categories

data lake case study aws

App Cloud Maturity Enhancement (Using Containers)​

data lake case study aws

Pro-Active 24×7 Managed Services

1CloudHub manages IT assets for India’s largest and no. 1 stem banking company to support their business continuity 

data lake case study aws

CI/CD Application Deployment Process using Serverless Technology

1CloudHub helped building CI/CD pipelines for 20+ active and critical applications.

data lake case study aws

Knowledge Portal on AWS for a Leading Corporate Compliance in India​

We helped one of the largest Corporate Compliance in India to create knowledge portal on AWS, cost effectively.

data lake case study aws

Migration of E-commerce Portal from OnPrem to AWS Cloud

Seamlessly migrated 10+ VMs to AWS with zero downtime. 5% savings on Infra spend by Right sizing and cost optimization​

data lake case study aws

Migration of DNB platform for SMEs from an existing hyper cloud platform to Azure

Seamlessly migrated the DNB platform from an existing hyper cloud platform to Azure to support a transaction of INR 75 Cr every day

data lake case study aws

Personalize fitment to determine AI/ML driven Solution Roadmap for a leading ​ B2C E-Ticketing Platform​

1CloudHub helped one of India’s largest online e-ticketing company in Data Acquisition, Data Validation, Fitment to Personalize services and Roadmap with future insights.

data lake case study aws

SIFT Customer Engagement Platform on Cloud

We envisioned, designed and implemented cloud transformation strategy for APAC’s largest Customer Engagement Product company  in a span of 4 weeks. We look forward to working with the client again on more cloud transformation projects that will help them achieve their business goals.

data lake case study aws

Digital Asset Management Platform​

We helped one of the pre-school in India to build a Digital Asset Management platform which enable the organization to manage video files by tagging the assets based on their metadata for efficient search and retrieval.

data lake case study aws

DataLake and Analytics for Digital Exam Platform on AWS

We were able to unify multi source(including on-premise) data sets, created future proof analytical platform for delivering a  hierarchical student performance reports.

data lake case study aws

SAP S/4 HANA Functional Enhancement & Implementation

We are glad to have played a key enabling role in helping customer to build a pricing functionality within SAP S/4 to move away from excel/manual pricing process for accurate and real-time business reporting

data lake case study aws

Case Study : Migration of popular news sites to Cloud with Zero Downtime

We at 1CloudHub successfully migrated one of India’s established news site to cloud seamlessly while optimizing the infra for cost and performance.

data lake case study aws

Case Study : Hospital Information System (HIS) set-up on Cloud

We helped one of India’s largest Health care provider, to set up their HIS application​ in a short period of 3 weeks

data lake case study aws

Case Study: SAP ECC Migration on Azure Cloud for a Health Care Manufacturer

Fast tracked On-Premise SAP ECC Dev, QA & Prod Landscapes to Azure while ensuring a smooth cutover within an hour.

data lake case study aws

Case Study : SAP S/4 HANA Greenfield Infra Implementation

We helped one of the largest security service company in Singapore to adopt their first cloud application (SAP S/4 HANA) in Azure cloud

data lake case study aws

Case Study: Big Data on Cloud

1CloudHub helped one of the world’s largest manufacturers of commercial vehicles deploy a cost-effective, scalable cloud solution for their Big Data.

data lake case study aws

Case Study: DR for geographically diverse SAP

We helped one of the world’s largest paper, pulp, and packaging companies, a first-time cloud adaptor, to establish a unified DR site.

data lake case study aws

Case Study: RPA on cloud

We helped a global shipping leader achieve on-demand scaling through a multi-geography accessible RPA solution.

data lake case study aws

Case Study: Multi-cloud strategy

We helped India’s leading integrated healthcare delivery service provider design and implement their HIS on cloud.

data lake case study aws

Case Study: Enterprise app migration

We helped a global leader in supply chain services efficiently and effectively host their applications on the cloud during a period of business transformation.

data lake case study aws

Case Study: DC backup and DR

We helped India’s leading television entertainment network architect, deploy, and manage their data backup system.

data lake case study aws

Case Study: SAP on cloud

We helped one of the world’s largest shipping companies increase the future load-capacity of their mission-critical SAP CRM, at significantly lower costs.

data lake case study aws

Case Study: DC and app migration

We envisioned, designed and implemented an end-to-end cloud transformation strategy for a leading gaming company in Malaysia.

Sharing is caring!

data lake case study aws

AWS Data Lake - Azure Data Lake - ML/AI Case Studies

Case Studies

Aws data lake project, azure data lake project, ml/ai projects.

Data Engineering

Data Engineering

Industry: Consulting

Subject: AWS Cloud Data Lake Development; Cloud Big Data Engineering

Description:

Developing and maintaining data lakes on AWS. Data migration from RDBMS and file sources, loading data into S3, Redshift, and RDS. Designing and developing big data batch solutions using AWS Data Pipeline and AWS Glue and EMR. Developing a massive data warehouse using Redshift and Redshift Spectrum.

Project Task Summary

ETL workflows in Data Pipeline, monitoring and management of ETL pipelines .

Batch RDBMS data migration using AWS DMS .

Batch processing in EMR and Glue using Scala Spark.

Designing and developing data warehouse on Redshift.

DWH data model and table design .

Accessing and processing big data on S3 via SQL using Redshift Spectrum.

Python ML implementation with Pandas, scikit-learn using Jupyter on AWS.

CI/CD development using Gitlab and Ansible.

Technologies

DevOps

Subject: Cloud Data Lake DevOps; AWS DevOps

Provisioning and deployment of big data solutions on AWS. Operationalize cloud data solutions, implementing infrastructure as code (IaC), using CloudFormation templates for resource management. Provisioning and deploying on-demand Redshift cluster and RDS instances using CloudFormation. Development, management, and deployment of Docker images and containers.

Provisioning resources using CloudFormation templates .

Provisioning of Redshift, Data Pipeline, and Glue ETL pipelines .

User account and access management in IAM.

Develop Docker images for batch processing applications and Python, ML models, using AWS Container Registry (AWS ECR) .

Docker container deployment using AWS ECS .

CI/CD implementation using GitLab.

Data Engineering

Subject: Azure Cloud Data Lake Development; Azure Big Data Engineering

Data lake development on Microsoft Azure. Data migration from RDBMS and file sources, data loading into Azure Blob storage and Azure SQL. Design and development of big data batch solutions using Data Factory and Databricks. Massive data warehouse development using Azure SQL Data Warehouse.

Create ETL workflows in Data Factory with data factory ETL pipeline monitoring and management .

Batch processing in Azure Databricks using Scala Spark.

Data warehouse design and development using SQL Data Warehouse.

DWH data model design, featuring index and partitioning table design .

Accessing and processing big data in Blob storage via Transact-SQL using Polybase .

CI/CD development using SBT and Gitlab.

DevOps

Subject: Cloud Data Lake DevOps; Azure DevOps

Data Factory and Databricks provisioning and deployment. Operationalization of cloud data solutions and infrastructure as code (IaC) implementation using ARM templates and Azure Python SDK for resource management. Azure SQL data warehouse provisioning and deployment. CI/CD implementation using Azure DevOps tools. Development, management, and deployment of Docker images and containers.

Azure resources (VM and storage account, SQL DB and network) provisioning using Azure Python SDK and ARM template .

SQL data warehouse provisioning with Databricks and Data Factory integration, using Python scripts and ARM templates, with Azure Key Vault for deployment .

User account and role-based (RBAC) access management in Azure Active Directory.

Docker image development for batch processing applications and ML model APIs, using Azure Container Registry for build, storage, and management of images.

Azure container deployment on ACI (Azure Container Instances).

CI/CD implementation via Azure Repos, Azure Artifacts, Azure Pipelines, and Azure Test Plans.

NLP Project

NLP Project

Industry: AI/Automatization

Subject: Implementation of supervised Machine Learning Algorithm for automatic keyphrase extraction.

Implementation of automated Context Tagger for a B2B Marketing automated AI solution. Text classification models are implemented in Python using Python Text Mining, NLP and other ML and data analysis libraries (Python Data Science and ML stack). Text mining, data processing, and feature engineering of a massive dataset in Spark.

Design and implementation of a very fast multi-threaded AKKA-based stream (SAX/Stax) processing of XML data for transforming huge XML data to CSV format.

Preprocessing of the data by filtering normalizing text content and applying Spacy and NLTK.

Text mining and data preprocessing in Spark SQL Scala on Hadoop and S3.

Training of embedding and language models using fastText, Gensim, and GPT-2.

Multi-Class multi-label text classification using CNN and word embedding models using Keras and PyTorch.

Modeling term to tag relations in massive graph networks in Tigergraph.

Keyphrase extraction (automatic tagging) using N-Grams, Word2vec scoring and PageRank algorithm on massive graphs of tag-to-tag relations .

FP-Growth association rules learning.

Distributed CNN training in Docker containers on AWS using GPU instances.

Recommendation Prediction Model

Recommendation Prediction Model

Industry: E-Commerce

Subject: ML Model implementation for Recommendation Models

User tracking data is used in training ML models for user-profiling, recommendation, and prediction. RNN and CNN models are developed and trained for enrichment of user-profiles. Classification GBM (Gradient Boosting Machine) on extracted and learned features. Workflow implementation for data engineering and continuous model training implementation in Airflow.

Feature engineering using Spark SQL by joining and aggregating user tracking data.

Keras and TensorFlow implementation for training RNN and CNN models.

Using Spark ML for training gradient boosting classifiers .

Cross-validation, F1-score evaluation, hyperparameter optimization .

Containerized Spark standalone cluster, using Docker Compose for local deployment and AWS container services.

Qimia Logo

data lake case study aws

Data Management

BI & Analytics

Custom Development Services

Support & Training

Serverless Data Lakes on Amazon Web Services (AWS)

data lake case study aws

Loan Marketplace Harnesses Their Data Using Server-less Data Lakes

WCI Data Solutions was able to help the largest small business loan marketplace in the United States harness their data by designing and developing a serverless data lake service on Amazon Web Services (AWS) with Amazon S3 as the primary storage platform. 

Our customer is the largest small business loan marketplace in the United States. Small business owners utilize their free online service to find financing by browsing multiple loan products from a network of more than 75 lenders. The platform reviews metrics including the business's financial projections, use of funds, industry, and monthly revenue to find the right loan option for borrowers, helping traditionally underserved groups like women and minority-owned small businesses, mom and pop shops, and seasonal businesses. Their matching engine allows small business borrowers the opportunity to comparison shop across a broad range of lenders and loan products, decreasing their time and effort, and optimizing which lenders to send applications to.

Our customer came to us with an interest in the development and deployment of a Data Lake solution on Amazon Web Services . They wanted a system capable of supporting both analysis and reporting across the organization but also designed to accommodate future consumers, query methods, and new data sources. They wanted to begin this development with an internally developed system we’ll refer to as their “CRM”. The CRM application is also deployed in AWS and uses Aurora RDS as a backend. This will be the first data source used by the WCI developed solution, with other sources to follow. The data will be consumed by the organization using both Domo and Power BI .

The engagement required utilizing a broad array of services on AWS, including:

Amazon Elastic Compute Cloud (Amazon EC2), Amazon Relational Database Service (Amazon RDS), Amazon Glue, Amazon Athena, Amazon CloudWatch, Amazon Simple Queue Service (Amazon SQS), AWS Lambda, Amazon Virtual Private Cloud Peering (VPC-Peering), AWS Identity and Access Management (IAM) and Amazon Simple Storage Service (Amazon S3).

Serverless Data Lake Architecture on AWS

WCI assisted the customer in solving this problem by designing and building a serverless data lake service on AWS with Amazon S3 as the primary storage platform. Utilizing an Amazon S3-based data lake architecture allows our customer the means to have a centralized, secure, and highly durable cloud-based storage platform. AWS makes it easy to store data in any format, both securely and at a massive scale with Amazon S3. In an effort to reduce both complexity and cost of development, WCI decided to utilize Amazon Athena as a query service to produce views that could then be used, in conjunction with Domo, to provide valuable business metrics and insights from the CRM data.

With the use of WCI’s solution, our customer now has the ability to scale their data analytics platform as they continue to add data from new sources, regardless of whether it be structured or unstructured data. The new cost-efficient and cloud-based architecture provides business end-users a data analytics process that is both expedient and lends itself to being future proof. Using AWS native services like Athena, Glue and Lambda enables the ability to continue improving data analysis without the burden of maintaining infrastructure or complex ETL processes, and with WCI’s assistance, our customer now has a secured end-to-end framework for their data and analytics practice on AWS.

Our customer now has the ability to harness increased data from various sources and options for future integrations and for use in other business-critical functions. This gives them the capability to analyze their data using different analytics tools and services, which allows them to derive more insights and provide more value for their customers, suppliers, and partners. WCI developed solution allows them the ability to gain deeper insights into their data in ways that traditional data silos cannot, which in turn helps them continue to support their customers better than any other lending platform.

About WCI Data Solutions

WCI is a consulting services company focused on the application of data through the use of Business Intelligence and Data Warehouse technologies that bring increased business performance for our clients. WCI has serviced well over 300 companies through our knowledge and expertise of how to make data valuable to decision-makers.

Related Posts

data lake case study aws

Creating a BI Strategy for An Emergency Healthcare Company

Amalgamating Disparate Systems and Creating a Long-Term BI Strategy for an Emergency Healthcare Company "We were very impressed with the…

data lake case study aws

A Fortune 500 Real Estate Investment & Management Company

The client A Fortune 500 real estate investment & management company The challenge The company wanted to simplify its IT…

Get Started

Want a  FREE  on-site discovery session with your team?  Receive a  FREE data management evaluation session with one of our veteran data architects. Reap the benefits of an expert’s outlook on taking control of your business insights and data. Stop missing out on data-driven opportunities,  and start making smarter, more profitable decisions today.

IMAGES

  1. GitHub

    data lake case study aws

  2. AWS Data Lake Delta Transformation Using AWS Glue

    data lake case study aws

  3. Building a Data Lake in AWS. Learn how we leveraged Data Lake to…

    data lake case study aws

  4. Effective data lakes using AWS Lake Formation, Part 2: Creating a governed table for streaming

    data lake case study aws

  5. Data Lake Solution

    data lake case study aws

  6. AWS Data Lake Architecture Solution

    data lake case study aws

VIDEO

  1. メリークリスマス!Merry Christmas in Japanese #shorts #japaneselesson #japanesephrases

  2. Reality Behind AMAZON's Cloud Computing Business

  3. Data Lake Vs. Data Warehouse

  4. Kari Lake case dismissed

  5. AWS Data Lake 02

  6. Ditch crossing with the backhoe

COMMENTS

  1. What Is a Case Study?

    When you’re performing research as part of your job or for a school assignment, you’ll probably come across case studies that help you to learn more about the topic at hand. But what is a case study and why are they helpful? Read on to lear...

  2. Why Are Case Studies Important?

    Case studies are important because they help make something being discussed more realistic for both teachers and learners. Case studies help students to see that what they have learned is not purely theoretical but instead can serve to crea...

  3. What Are Some Examples of Case Studies?

    Examples of a case study could be anything from researching why a single subject has nightmares when they sleep in their new apartment, to why a group of people feel uncomfortable in heavily populated areas. A case study is an in-depth anal...

  4. Build Data Lake using AWS

    One of the initiatives adopted to rise to this challenge was the development of a data lake on Amazon Web Services (AWS). By adopting storage, databases

  5. Build a data lake on Amazon S3: Recent customer case studies

    Build a data lake on Amazon S3: Recent customer case studies · Siemens Handles 60,000 Cyber Threats per Second Using AWS Machine Learning.

  6. Nasdaq Migrates to a More Modern Data Lake Architecture

    CASE STUDY. Nasdaq sign. 2020. Nasdaq Uses AWS to Pioneer Stock Exchange Data Storage in the Cloud. Nasdaq uses a data lake based on Amazon S3 and Amazon

  7. Intel Geospatial Case Study

    Intel Launches Geospatial Data Management and AI Framework Using AWS. 2021. Intel Geospatial, a new venture from semiconductor chip manufacturer Intel, recently

  8. OneFootball AWS Lake Formation Case Study

    OneFootball Built a Data Lake in Days Using AWS Lake Formation to Serve 70 Million Fans · Feeding the World's Appetite for Soccer · Seamless Integration for Self-

  9. Data Lake

    Data Lake on AWS. Many Amazon Web Services (AWS) customers require a data storage and analytics solution that offers more

  10. Amazon Migration Analytics Case Study

    In order to meet its growing needs, Amazon's consumer business decided to migrate the Oracle data warehouse to an AWS-based solution. The new data lake

  11. Implementing a Large AWS Data Lake for Analysis of ...

    C4ADS partnered with ClearScale to develop and implement Data Lake solution for analysis of heterogeneous data on AWS.

  12. Case Study: Enterprise data lake on cloud

    1CloudHub helped India's leading television entertainment network bring its scattered big data into a single source

  13. Case Studies

    Case Studies about AWS Data Lake, Azure Data Lake and ML/AI. Click now to see project tasks, used technologies, industries and more.

  14. Serverless Data Lakes on Amazon Web Services (AWS)

    See how we helped the largest small business loan marketplace in the US harness their data by developing a serverless data lakes solution using AWS.