DataLake

Enabling Accelerated Data Management, Data Engineering and AI Pipelines

Overview

A data lake is a centralized repository that allows you to store all your structured and unstructured data at any scale. Zeblok AI-Data Lake is a high-performance data store that allows you to import, filter, and instantly analyze objects. Our solution is designed for performance, scales up with your data, and can provide industry-grade SSL security and data redundancy for high availability of data at SSD speeds.

Seamless accessibility from almost anywhere

low-latency Zeblok Ai-Data Lake handles the tasks of data pipeline, analysis, and propagation so that you can focus on what matters most. Whether you are ingesting images, video data files coming from IoT sensors at Edge locations through 4G/5G low low-latency network, or uploading large CSV files, Zeblok Ai-Data Lake has you covered, with unprecedented data acceptance, performance, and compatibility.

Easily automate your imports, with large files using various methods such as: Browser-based portal, Zeblok Magic Commands, S3 REST API, Data Lake drag-and-drop native Windows App, and HTTP importing.

Data Browser Portal:

Zeblok Ai-Data Lake provides secure high-availability of your files and objects across a cluster of servers. Authorized users can access and manage all their data in one place through the convenience of the Data Browser Portal. The Browser Portal can securely allow a user to set permissions and upload/download/edit data of various file types and sizes. Users can see a map of files in their bucket without touching a command line.

Zeblok Magic Commands:

Zeblok's Magic Commands are ubiquitously available in every Zeblok notebook instance to allow users easy access to a reliable object storage solution. With Magic Commands, users have an array of options that allow for a quick and easy way to add blazing-fast storage to virtually any network. Magic Commands are intuitive, especially for those familiar with the common Linux/Unix file system. The command line offers users the ability to create new files, delete old ones, create new folders, share access, and more.

REST API:

Inspired by the S3 protocol originally from AWS, Zeblok's Ai-Data Lake provides a secure endpoint that enables users to programmatically upload and manage files using a REST API. The generic S3 REST API offers a unified interface for interacting with any object and provides high-performance uploads and file transfers that can be scheduled or automated between Zeblok notebook instances or external hosts (such as via cron jobs).

HTTP Source:

Zeblok AI-Data Lake also allows users to upload and download files from various HTTP sources without re-downloading files on the local machine. This space-saving method enables users to skip steps between upload and download cycles as desired.

Use the Ai-Data Lake within the Ai-WorkStation Notebook:

Access your data files residing in the Ai-Data Lake within the Ai-WorkStation notebook using the BlazingSQL engine that runs on GPUs

Blazingly fast queries with big data*:

Run big data analytics across Zeblok AI-Data Lake using our query-in-place GPU-enhanced services. Perform blazing fast queries (up to 8x faster*) with SQL expressions and use our Zeblok AI algorithms to analyze data that is stored across your private AI-Data Lake account across various notebooks. Export queries back to Ai-Data Lake or generate customization visual reports with ease.​

Data Lake Security:

The Zeblok AI-Data Lake platform is designed to be scalable and easy to use with a granular access control system where each user only has access to their own files. The platform features implementations for data encryption both in transit and at rest. With recommended SSL certificates installed, the platform offers standard bank-level encryption, which is 256-bit AES, the standard for advanced cryptography. The encryption standards available in the Data Lake platform protect your data from being intercepted by third parties. Furthermore, the REST API endpoints are hardened to only operate on pre-provisioned key/secret pairs.

Last updated

Was this helpful?