Skip to content
GigaSpaces Logo GigaSpaces Logo
  • Products
    • Our Products
      • eRAG
        • GenAI Catalyst
        • Instant Data
        • Respond Proactively
        • Act Autonomously
      • Smart DIH
      • XAP
    • Solutions for
      • Pharma
      • Procurement
    • vid-icon

      Conventional RAG Falls Short with Enterprise Databases

      Watch the Webinaricon
  • Solutions
    • Business Solutions
      • Digital Innovation Over Legacy Systems
      • Integration Data Hub
      • API Scaling
      • Hybrid / Multi-cloud Integration
      • Customer 360
      • Industry Solutions
      • Retail
      • Financial Services
      • Insurance Companies
    • vid-icon

      Massimo Pezzini, Gartner Analyst Emeritus

      5 Top Use Cases For Driving Business With Data Hub Architecture

      Watch the Webinaricon
  • How it Works
    • eRAG Technology Overview
      • AI-Ready, IT-Friendly
      • Semantic Reasoning
      • Questions to SQL Queries
      • Asked & Answered in Natural Language
      • Multiple Data Sources
      • Proactive AI Governance
    • vid-icon

      Ensure GenAI compliance and governance

      Read the Whitepapericon
  • Success Stories
    • By Use Case
      • Procurement
      • Operations
      • Budget Management
      • Sales Operations
      • Service Providers
      • Utilities Management
      • Restaurant Management
    • By Industry
      • Logistics
      • Pharma
      • Education
      • Retail
      • Shipping
      • Energy
      • Hospitality
    • vid-icon

      Monkey See, AI Do - All about CUA

      Watch Webinaricon
  • Resources
    • Content Hub
      • Case Studies
      • Webinars
      • Q&As
      • Videos
      • Whitepapers & Brochures
      • Events
      • Glossary
      • Blog
      • FAQs
      • Technical Documentation
    • vid-icon

      Taking the AI leap from RAG to TAG

      Read the Blogicon
  • Company
    • Our Company
      • About
      • Customers
      • Management
      • Board Members
      • Investors
      • News
      • Press Releases
      • Careers
    • col2
      • Partners
      • OEM Partners
      • System Integrators
      • Technology Partners
      • Value Added Resellers
      • Support & Services
      • Services
      • Support
    • vid-icon

      GigaSpaces, IBM & AWS make AI safer

      Read Howicon
  • Book a Demo
  • Products
    • Our Products
      • eRAG
        • GenAI Catalyst
        • Instant Data
        • Respond Proactively
        • Act Autonomously
      • Smart DIH
      • XAP
    • Solutions for
      • Pharma
      • Procurement
    • vid-icon

      Conventional RAG Falls Short with Enterprise Databases

      Watch the Webinaricon
  • Solutions
    • Business Solutions
      • Digital Innovation Over Legacy Systems
      • Integration Data Hub
      • API Scaling
      • Hybrid / Multi-cloud Integration
      • Customer 360
      • Industry Solutions
      • Retail
      • Financial Services
      • Insurance Companies
    • vid-icon

      Massimo Pezzini, Gartner Analyst Emeritus

      5 Top Use Cases For Driving Business With Data Hub Architecture

      Watch the Webinaricon
  • How it Works
    • eRAG Technology Overview
      • AI-Ready, IT-Friendly
      • Semantic Reasoning
      • Questions to SQL Queries
      • Asked & Answered in Natural Language
      • Multiple Data Sources
      • Proactive AI Governance
    • vid-icon

      Ensure GenAI compliance and governance

      Read the Whitepapericon
  • Success Stories
    • By Use Case
      • Procurement
      • Operations
      • Budget Management
      • Sales Operations
      • Service Providers
      • Utilities Management
      • Restaurant Management
    • By Industry
      • Logistics
      • Pharma
      • Education
      • Retail
      • Shipping
      • Energy
      • Hospitality
    • vid-icon

      Monkey See, AI Do - All about CUA

      Watch Webinaricon
  • Resources
    • Content Hub
      • Case Studies
      • Webinars
      • Q&As
      • Videos
      • Whitepapers & Brochures
      • Events
      • Glossary
      • Blog
      • FAQs
      • Technical Documentation
    • vid-icon

      Taking the AI leap from RAG to TAG

      Read the Blogicon
  • Company
    • Our Company
      • About
      • Customers
      • Management
      • Board Members
      • Investors
      • News
      • Press Releases
      • Careers
    • col2
      • Partners
      • OEM Partners
      • System Integrators
      • Technology Partners
      • Value Added Resellers
      • Support & Services
      • Services
      • Support
    • vid-icon

      GigaSpaces, IBM & AWS make AI safer

      Read Howicon
  • Book a Demo
  • Products
    • Our Products
      • eRAG
        • GenAI Catalyst
        • Instant Data
        • Respond Proactively
        • Act Autonomously
      • Smart DIH
      • XAP
    • Solutions for
      • Pharma
      • Procurement
  • Solutions
    • Digital Innovation Over Legacy Systems
    • Integration Data Hub
    • API Scaling
    • Hybrid/Multi-cloud Integration
    • Customer 360
    • Retail
    • Financial Services
    • Insurance Companies
  • How it Works
    • eRAG Technology Overview
      • AI-Ready, IT-Friendly
      • Semantic Reasoning
      • Questions to SQL Queries
      • Asked & Answered in Natural Language
      • Multiple Data Sources
      • Governance
  • Success Stories
    • By Use Case
      • Procurement
      • Operations
      • Budget Management
      • Sales Operations
      • Service Providers
      • Utilities Management
      • Restaurant Management
    • By Industry
      • Logistics
      • Pharma
      • Education
      • Retail
      • Shipping
      • Energy
      • Hospitality
  • Resources
    • Webinars
    • Videos
    • Q&As
    • Whitepapers & Brochures
    • Customer Case Studies
    • Events
    • Glossary
    • FAQs
    • Blog
    • Technical Documentation
  • Company
    • About
    • Customers
    • Management
    • Board Members
    • Investors
    • News
    • Press Releases
    • Careers
    • Partners
      • OEM Partners
      • System Integrators
      • Technology Partners
      • Value Added Resellers
    • Support & Services
      • Services
      • Support
  • Pricing
  • Book a Demo

Make Data-Driven Services Easy with Efficient Data Pipelines

229

Subscribe for Updates
Close
Back

BLOG

Make Data-Driven Services Easy with Efficient Data Pipelines

Dmitry Andreyev
May 22, 2023 /
11min. read

Key Takeaways

Set up data pipelines from any data source – using a mix of your own connectors OR Smart DIH connectors. This lets you:

  • Build digital services over an increasing number of diverse systems
  • Shorten time to value for new service delivery
  • Lower TCO by saving on connector licensing costs

Open Platform Design Empowers Data Professionals

This blog series so far has focused on two angles of Smart DIH’s Open Platform architecture:

The first blog in the series “How Open Platform Architecture is Reflected in the Data Hub Digitization Layer” discussed how support for the OpenAPI specification lets data professionals easily create and deploy APIs with low code. As a result, developers can share and reuse API definitions and build on each other’s work – all leading to faster and more agile service delivery.  

The second blog in the series covered SQL extensibility: How Smart DIH implements SQL to provide composability and flexibility, empowering data professionals to use familiar skill sets to develop data-driven services quickly. 

Quick reminder: Smart DIH is an operational data hub designed to speed up new data driven digital services and enable rapid development of new business apps by delivering the ‘always fresh – always on’ data that modern applications rely on. 

Smart DIH aggregates multiple back-end systems into a low-latency, scalable, high performance data layer exposing APIs and events. By decoupling systems of record from digital applications, Smart DIH lets enterprises drastically shorten the development and deployment of new digital services. With Smart DIH organizations can rapidly scale to serve millions of concurrent users – no matter which IT infrastructure or cloud topologies they rely on – cloud, on-prem or hybrid.

This third blog in our Open Platform series focuses on the flexibility of the Smart DIH ‘Pluggable Connector Framework’. The Pluggable Connector Framework allows data professionals to set up data pipelines rapidly – using connectors already deployed in their organization, or native Smart DIH connectors. This approach is at the heart of open platform design: Many platforms claim to have connectors that cover all data sources. However, this is rarely true. The Pluggable Connector Framework, on the other hand, connects to multiple data sources with a hybrid set of connectors: third party, home grown or native Smart DIH. This approach lowers costs, shortens time to value and eliminates redundant licensing costs.

Data Pipelines in a Nutshell 

Data pipelines are central to the Smart DIH data integration layer.  Data pipelines are well-defined flows that manage the data journey from a data source into the Smart DIH hosting layer. Data sources may be relational databases, no-SQL databases, object stores, file systems, or message brokers. Data may either be structured or semi-structured, and can be integrated as a stream or in batches. 

As noted, Smart DIH relies on its Pluggable Connector Framework to construct data pipelines from diverse underlying data sources using a hybrid approach: you can bring your own connector, or use a Smart DIH connector. This approach also allows the integration of connectors from multiple GigaSpaces partners to create the ultimate ecosystem for connectors to co-exist.

This design provides singular benefits derived from implementing Smart DIH as a fully pre-integrated solution. These include:

  • Standardization of the data journey, regardless of source and choice of connector. 
  • Real-time performance for event-driven updates of stream-based data pipelines, using frameworks such as Kafka, Flink and, of course, GigaSpaces’ own Space.
  • Rapid data load, required in initialization and recovery scenarios, using both Kafka and Space partitions.
  • Built-in continuous update mechanisms (CDC, incremental batch), enabling the addition of new tables and sources while keeping the data up to date for operational services.
  • Built-in data cleansing capabilities including validation and registration for the purpose of observability and governance. 
  • Built-in reconciliation mechanisms, to support various recovery and schema change scenarios

Unlike other integration methods which rely on SDKs and require programming, the Smart DIH Pluggable Connector Framework offers a declarative language approach that allows users to set up a data pipeline very quickly. The configuration file is written in simple syntax and hooks into any connector that supports Kafka. Ease-of-use does not limit functionality: The configuration file can support complex messaging logic, set CDC (Change Data Capture) rules, and define data modeling rules. 

Connect to Any Data Source, Reduce Costs and Speed Up Time to Value

The Open Platform philosophy behind the Pluggable Connector Framework provides many benefits to IT teams. It  contributes effectively to digital innovation projects designed to increase competitiveness and accelerate digital service delivery.  

  • Fast time to value: The ability to create data pipelines quickly with a simple configuration file, without having to rely on dedicated development teams, shortens time to value and allows organizations to better utilize the skill sets of IT teams. Ultimately, data access services can be exposed as soon as the data pipelines are set up – allowing data teams to deliver value fast. 
  • Lower TCO: By being able to hook into any connector that supports Kafka, Smart DIH lowers TCO for IT teams: They can use connectors already deployed in their organization, saving licensing and subscription fees for custom connectors.
  • Open Ecosystem: The Smart DIH Pluggable Connector Framework offers an open ecosystem that can support non-standard messaging formats used in home-grown or proprietary applications. No matter what the data source or message format – IT teams will still be able to easily create the data pipelines needed to power data access services. 

Many enterprises are facing a fierce competitive landscape. They are under great pressure to deliver new revenue streams, offer superior customer journeys and respond quickly to market demands. Digital modernization driven by modern data access services are key to success. Smart DIH, with its open architecture design, offers the flexibility IT teams need: It easily integrates data from diverse legacy systems on the south end, while quickly exposing data services to consuming applications on the north end. 

The Smart DIH Pluggable Connector Framework is key to helping data and IT teams realize this vision: the ability to use a  mix of connectors offers endless opportunities to expand modernization by building new digital services over an increasing number of underlying systems; the simplicity in which this is achieved with the Smart DIH declarative approach cuts development times and speeds up time to market. From here, the path to business agility, fast service roll-out and the rapid launch of new apps is a downhill ride. 

Tags:

data pipeline open platform
Dmitry Andreyev

Dmitry Andreyev is a Cloud and Data Architect at GigaSpaces. Dmitry has over 20 years of in-depth experience as a software engineer, team leader and architect - including analysis, design, architecture, development and management of complex software solutions, both on premises and for the cloud. He is currently a cloud and data architect in the GigaSpaces Innovation Team. Prior to joining GigaSpaces, Dmitry held software architect and software engineering positions at several global software solution providers.

All Posts (1)

Share this Article

Subscribe to Our Blog



PRODUCTS & SOLUTIONS

  • Products
    • eRAG
    • Smart DIH
    • XAP
  • Our Technology
    • Semantic Reasoning
    • Natural language to SQL
    • RAG for Structured Data
    • In-Memory Data Grid
    • Data Integration
    • Data Operations by Multiple Access Methods
    • Unified Data Model
    • Event-Driven Architecture

RESOURCES

  • Resource Hub
  • Webinars
  • Q&As
  • Blogs
  • FAQs
  • Videos
  • Whitepapers & Brochures
  • Customer Case Studies
  • Events
  • Use Cases
  • Analyst Reports
  • Technical Documentation

COMPANY

  • About
  • Customers
  • Management
  • Board Members
  • Investors
  • News
  • Careers
  • Contact Us
  • Book A Demo
  • Partners
  • OEM Partners
  • System Integrators
  • Value Added Resellers
  • Technology Partners
  • Support & Services
  • Services
  • Support
Copyright © GigaSpaces 2026 All rights reserved | Privacy Policy | Terms of Use
LinkedInXFacebookYouTube
Skip to content
Open toolbar Accessibility Tools

Accessibility Tools

  • Increase TextIncrease Text
  • Decrease TextDecrease Text
  • GrayscaleGrayscale
  • High ContrastHigh Contrast
  • Negative ContrastNegative Contrast
  • Light BackgroundLight Background
  • Links UnderlineLinks Underline
  • Readable FontReadable Font
  • Reset Reset
  • SitemapSitemap

Hey
tell us what
you need

You can unsubscribe from these communications at any time. For more information on how to unsubscribe, our privacy practices, and how we are committed to protecting and respecting your privacy, please review our Privacy Policy.

Hey , tell us what you need

You can unsubscribe from these communications at any time. For more information on how to unsubscribe, our privacy practices, and how we are committed to protecting and respecting your privacy, please review our Privacy Policy.

Oops! Something went wrong, please check email address (work email only).
Thank you!
We will get back to You shortly.