What is Data as a Service Platform
DaaS Platform is a self-service analytics platform on cloud using Kubernetes to simplify access, accelerate analytical processing, secure and masking data, curate datasets, and provide a unified catalog of data across all data sources. This allows many consumers, such as Tableau, BI tools, data science platforms, and dashboard tools, to assume all data exists in a single, high performant data view through ODBC and JDBC no matter where and what data structure they are.
DaaS Platform allows a user to use dpStudio UI to create a DaaS Stack by configuring the data access options such as data source locations and data structure settings.
Also, dpStudio allows a user to manage the lifecycle of DaaS Stack (start, update, terminate) and monitor the runtime behavior of the Stack through build-in Observability features that monitor data usages/structures observes the internal health states of the components.
DaaS Platform includes the following software components:
AWS EKS is used to provision stacks of DaaS Platform using the customer's AWS account.
Google GKE is used to provision stacks of DaaS Platform using the customer's Google account.
snapblocs provisions DaaS Platforms following the well-architecture guides (i.e., AWS Well-Architected for AWS, Google Cloud Architecture Framework, etc.) for provisioning, configuring production-grade Kubernetes clusters and deploying workloads into the clusters. It provides benefits from patterns that have been used successfully for many customers in production environments. Also, snapblocs makes it easy to get started and easy to configure properly.
Kubernetes is an open-source container-orchestration system for automating application deployment, scaling, and management. It is used to deploy selected Components. Dremio - Data lake engine that liberates your data with live, interactive queries directly on cloud data lake storage. Elastic is used to provide observability (monitoring, alerting, APM) for answering questions about what's happening inside the system by observing the outside of the system. Grafana is used to build visualizations and analytics to query, visualize, explore metrics, and set alerts for quickly identifying system problems to minimize disruption to services. Superset is an open-source data exploration and data visualization tool that handles data at a petabyte-scale. A superset is fast, lightweight, intuitive, and loaded with options that make it easy for users of all skill sets to explore and visualize their data, from simple line charts to highly detailed geospatial charts.
Self-Service Cloud Platform
Self-Service Infrastructure accelerates the testing and development efforts while reducing IT management and development costs. What is Self-Service? DevOps and DataOps teams want the ability to quickly launch cloud infrastructure or an entire ...
How to customize Data as a Service Platform
After configuring a new stack of Data as a Service Platform by following this, you can customize the stack. Test / Proof of Concept (POC) Stack To create a simple test DaaS stack on AWS, set the following parameters. AWS and K8S for DaaS Component: ...
How to configure a new stack for Data as a Service Platform
You can initiate configuring a new stack from a few different places: On the Home page, "Configure stack" button on the Stacks statistics block. On the Stacks page, the "Configure new stack" button on the top page On the Projects page, select Project ...
What is Data Lake Platform
Data Lake Platform provides a managed service using Kubernetes to provide integrated solutions (Data Flow, Data Transformation, Data As A Service) that ingests data from multiple data sources into a Data Lake. It provides the data workflow to ...
What is Data Transformation Platform
Data Transformation Platform provides a managed service using Kubernetes to convert data from one format or structure into another format or structure. This allows a user to use dpStudio UI to create a Data Transformation Stack by configuring the ...