Explore Cloudera Self-Guided Tours

Cloudera and Carahsoft have partnered to provide a series of self-guided tours for Cloudera's enterprise-ready Artificial Intelligence solutions. Similar to a live demonstration, these in-depth walkthroughs explore Cloudera's wide array of use cases that can help meet you and your organization’s unique IT needs.


Learn about Cloudera’s Artificial Intelligence solutions by starting a self-guided tour below or schedule time with your dedicated Cloudera representative for personalized insights.


Cloudera AI Self-Guided Tour

Cloudera AI Self-Guided Tour

Cloudera Government Solutions simplifies data acquisition and delivery, offering public sector agencies the insights needed to stay informed and execute timely decision-making. We collaborate with Federal, State and Local Governments, as well as higher education institutions, to ensure compliance with data security and governance requirements. Cloudera’s open source solutions further the modernization of data architecture across various platforms, meeting Zero Trust requirements for data flow. Public sector agencies can leverage Cloudera to advance their mission through expedited data movement and digital transformation.

Want to learn more about Cloudera?
Start a self-guided demo now to learn more about data lifecycle management, threat detection and AI analytics.
1 of 6

Cloudera Data Platform (CDP)

Cloudera Data Platform (CDP) stands as a government-centric data cloud tailored for public sector needs. Within CDP, government entities efficiently oversee and fortify the entirety of the data life-cycle, encompassing data collection, enrichment, analysis, experimentation, and predictive endeavors. This hybrid data platform empowers agencies with freedom of choice, allowing them to leverage any cloud, analytics, or data source to fulfill their mission objectives. CDP ensures faster and simpler data management and analytics across diverse data landscapes, delivering optimal performance, scalability, and security.


  • Integrated data platform promoting agility across departments, enhances IT efficiency and security, ultimately boosting overall organizational productivity.
  • Same data management and analytic capabilities seamlessly across private and public clouds.
  • Shared Data Experience (SDX) ensures consistent data security, governance, and control across the data life-cycle and across all environments.
2 of 6

CDP Machine Learning (CML)

CDP Machine Learning fosters seamless collaboration among data science teams by offering immediate access to enterprise data pipelines, scalable compute resources, and preferred tools. This streamlined approach extends to deploying analytic workloads into production, empowering agencies to effectively manage machine learning use cases across diverse departments and projects. The platform's suite of native and robust tools for deploying, serving, and monitoring models further optimizes ML workflows, ensuring efficient governance and automation of model cataloging. Additionally, CDP Machine Learning facilitates seamless collaboration across CDP experiences, including CDP Data Warehouse and CDP Operational Database, ultimately enhancing data-driven decision-making capabilities.


  • Full visibility from data source to production environment. Enables transparent workflow and collaboration across teams securely.
  • The Data discovery and Visualization feature allows users to discover, query, and visualize data in a single UI.
  • AMPs – projects that can be deployed with one click directly from CML – enables users to go from an idea to a full ML use case. Providing end-to-end framework for building, deploying, and monitoring ready ML applications.
  • Containerized ML workspaces that provide access to project environments and automatically elastic compute resources.
3 of 6

CDP DataFlow (CDF)

Cloudera DataFlow, a cloud-native data service, harnesses the power of Apache NiFi to streamline the entire data movement process, ensuring seamless and secure universal data distribution. With DataFlow Deployments, agencies can gain access to a cloud-native runtime environment that leverages auto-scaling Kubernetes clusters, enabling efficient resource utilization and optimal performance. Moreover, the centralized monitoring and alerting features empower agencies to maintain stringent oversight over their deployments, ensuring compliance with federal regulations and enhancing operational efficiency. This scalable solution allows agencies to independently scale flow deployments as per their requirements, fostering agility and adaptability within their data management infrastructure.


  • CDF streamlines operations for federal agencies by seamlessly converting UI actions into CLI statements, simplifying the deployment of NiFi flows to a single command and enhancing resource allocation efficiency.
  • CDF offers dual auto-scaling for NiFi flows, adjusting based on CPU utilization within set boundaries and enabling Flow Metrics Scaling to predict traffic on initial connections, prioritizing scaling based on backpressure metrics or CPU utilization.
  • DataFlow Functions offers the deployment of NiFi flows as serverless functions on cloud providers like AWS Lambda, Azure Functions, and Google Cloud Functions, catering to use cases that don't require continuous operation, enabling developers to prioritize agency missions over operational management, and adopting a pay-for-value model with serverless architecture.
  • NiFi's processor library facilitates seamless data connections across diverse sources, while users can efficiently monitor, manage, and deploy flows through a unified dashboard, incorporating KPI alerts and ReadyFlows for effective use case addressing.
4 of 6

CDP Stream Processing (CSP)

Cloudera Stream Processing (CSP) leverages the combination of Apache Flink and Kafka to provide users with the ability to analyze streaming data and turn it into actionable insights. By offering enterprise-grade stream management and stateful processing capabilities, CSP ensures robust handling of data streams. This solution integrates Kafka as the storage substrate and Flink as the core processing engine, providing a seamless environment for developers, data analysts, and data scientists. With support for interfaces like SQL and REST, CSP enables the development of complex hybrid streaming data pipelines, facilitating the creation of real-time data products, dashboards, and more.


  • Cloudera's Streaming Analytics offers low-latency stream processing capabilities, enabling users to derive insights from streaming data instantly.
  • Cloudera Streaming Analytics simplifies the development process, allowing users to write streaming applications efficiently.
  • Cloudera Streams Messaging offers tools like Streams Messaging Manager for cluster monitoring and operation, ensuring reliable and efficient data processing.
  • Streams Replication Manager facilitates HA/DR deployments, ensuring data availability and resilience in case of failures or disasters.
5 of 6

CDP Data Hub

CDP Data Hub, part of Cloudera Data Platform (CDP) Public Cloud, facilitates high-value analytics seamlessly from the Edge to AI within a familiar cloud cluster model. Offering diverse analytical workloads – from streaming and ETL to data marts and machine learning – it enables easy migration of existing workloads or direct cloud-based development. Powered by Cloudera Runtime and build on SDX, this cloud-native solution provides a flexible range of cluster shapes, workload options, templates, and configurations. CDP Data Hub ensures an intuitive experience tailored for users accustomed to traditional architectures.


  • Data Hub empowers users with flexibility, scalability, and ease of use, enabling them to rearrange worker roles, configure GPU support, and adjust resource management settings to implement complex, multi-function analytics at scale.
  • Clusters can be quickly provisioned and disposed of, offering pre-built or custom configuration options for infrastructure. Pre-configured cluster definitions and templates allow for rapid deployment of workload clusters tailored to specific use cases, enhancing operational efficiency.
  • Users can provision multiple clusters on shared data, enabling the launch of new applications with full isolation, security, and governance without disrupting existing production applications. This feature ensures smooth integration of new functionalities without compromising on security or performance.
  • Data Hub facilitates the easy migration of legacy workloads to a cloud model, maintaining familiarity in the form factor. Its cloud-based architecture decouples data from compute infrastructure, improving flexibility, agility, data protection, and scalability.
6 of 6

CDP Operational Database

Accelerate the development and deployment of critical applications with CDP Operational Database, a versatile database-as-a-service built on Apache HBase. This solution facilitates rapid creation of future-proof applications designed to accommodate evolving data needs. With features like auto-scale, auto-heal, and auto-tune, CDP Operational Database streamlines and automates database management, empowering developers to focus on innovation. Integrated seamlessly within Cloudera Data Platform (CDP), it offers end-to-end visibility and security through SDX, while enabling effortless integration with other CDP services such as Data Engineering, Machine Learning, and Data Warehouse.


  • CDP Operational Database offers both SQL and NoSQL interfaces along with wide-column support, allowing developers to choose the best storage, indexing, and querying methods for their applications.
  • Features like auto-tune and auto-heal minimize overhead by enhancing database performance and automatically detecting and resolving failures, simplifying database operations management.
  • CDP Operational Database enables seamless active-active architectures across on-premises and multi-cloud environments, facilitating quick and hassle-free migration of applications with real-time replication and compatibility with existing HBase deployments.
  • The platform ensures near-infinite scalability, dynamically adjusting to application needs without the complexity of manual sizing or shard management.

Cloudera's Benefits Snapshot:


  • 100% Open Source: Open compute and open storage ensures zero vendor lock-in and maximum interoperability.
  • Data Security and Compliance: Sustains consistent data security and governance across all environments.
  • Hybrid and Multi-Cloud: Delivers the same data management capabilities across all clouds and data centers.