Conduit Deployment Diagrams
Conduit supports multiple deployment architectures. By design, Conduit is scalable and can be run from configurations requiring only a single box to multiple servers deployed across different data centers to ensure high availability.
Ā
Additionally, Conduit by design can be interfaced to use different scalable processing engines, e.g. Spark, BlazinsSQL with GPU, Databricks clusters.
Ā
HA stands for Conduit running in High Availability mode.
Ā
TABLE OF CONTENTS
- 1 Deployment types
- 1.1 1. Single Box Deployment
- 1.2 2. SingleBox + Spark/HDFS cluster attached
- 1.3 3. HA + Cloud Storage
- 1.4 4. HA + HDFS
- 1.5 5. HA + Spark/HDFS cluster attached
- 1.6 6. SingleBox + Cloud Storage (S3, AZBS, GCS)
- 1.7 7. HA + Spark cluster + Cloud Storage
- 1.8 8. Conduit HA + BlazingSQL cluster + Cloud Storage / HDFS
- 1.9 9. Conduit HA + Databricks cluster + Cloud Storage
- 1.10 10. Conduit HA + Cloudera CDH
- 2 Use cases
- 3 Related articles
Conduit can be run in High Availability modes in the following scenarios:
on premise
Google Cloud
Azure
Please contact us for support regarding installing and running Conduit in High Availability mode.
Deployment types
1. Single Box Deployment
Data Store is configured to use local file system on the Conduit VM.
Ā
2. SingleBox + Spark/HDFS cluster attached
Ā
3. HA + Cloud Storage
there are 3+ VMs Ā for Conduit services,Ā
one node has Spark driver - leader election,Ā
all nodes are Spark workers
Spark master and master failover on at least 2 nodes
storage in the cloud
Ā
4. HA + HDFS
3+ VMs Ā for Conduit services
one node has Spark driver - leader election,Ā
all nodes are Spark workers,Ā
Spark master and master failover on at least 2 nodes
storage on HDFSĀ deployed on all 3 nodes (the only option on-premise)
on every node HDFS DN - HDSF Data Node
on one of the VMs we have HDSF NN - HDFS Name Node
on one of the VMs we have HDFS Stby - standby, different VM from VM with HDFS NN
each VM will have a bounded area with HDFS components
Ā
5. HA + Spark/HDFS cluster attached
only one Spark driver in the Conduit VM farm, leader election
Ā
6. SingleBox + Cloud Storage (S3, AZBS, GCS)
Ā
7. HA + Spark cluster + Cloud Storage
Ā
8. Conduit HA + BlazingSQL cluster + Cloud Storage / HDFS
Ā
9. Conduit HA + Databricks cluster + Cloud Storage
Ā
10. Conduit HA + Cloudera CDH
Ā
Use cases
OnPremise to Cloud Migration Platform
Ā