IBM InfoSphere DataStage integrates data across multiple and high volumes of data sources and target applications.
It integrates data on demand with a high performance parallel framework, extended metadata management, and enterprise connectivity.
Supports the collection, integration and transformation of large volumes of data, with data structures ranging from simple to highly complex.
Offers scalable platform that enables companies to solve large-scale business problems through high-performance processing of massive data volumes
Supports real-time data integration.
Enables developers to maximize speed, flexibility and effectiveness in building, deploying, updating and managing their data integration infrastructure.
Completes connectivity between any data source and any application
Key Product Highlights:
The powerful ETL solution supports the collection, integration and transformation of large volumes of data, with data structures ranging from simple to highly complex. IBM InfoSphere DataStage manages data arriving in real-time as well as data received on a periodic or scheduled basis.
The scalable platform enables companies to solve large-scale business problems through high-performance processing of massive data volumes. By leveraging the parallel processing capabilities of multiprocessor hardware platforms, IBM InfoSphere DataStage Enterprise Edition can scale to satisfy the demands of ever-growing data volumes, stringent real-time requirements, and ever shrinking batch windows.
Comprehensive source and target support for a virtually unlimited number of heterogeneous data sources and targets in a single job includes text files; complex data structures in XML; ERP systems such as SAP and PeopleSoft; almost any database (including partitioned databases); web services; and business intelligence tools like SAS.
Real-time data integration support operates in real-time. It captures messages from Message Oriented Middleware (MOM) queues using JMS or WebSphere MQ adapters to combine data into conforming operational and historical analysis perspectives.It provides a service-oriented architecture (SOA) for publishing data integration logic as shared services that can be reused across the enterprise. These services are capable of simultaneously supporting high-speed, high reliability requirements of transactional processing and the high volume bulk data requirements of batch processing.
Advanced maintenance and development enables developers to maximize speed, flexibility and effectiveness in building, deploying, updating and managing their data integration infrastructure. Full data integration reduces the development and maintenance cycle for data integration projects by simplifying administration and maximizing development resources.
IBM InfoSphere DataStage is consist of:
Administrator - Administers DataStage projects, manages global settings and interacts with the system. Administrator is used to specify general server defaults, add and delete projects, set up project properties and provides a command interface to the datastage repository. With Datastage Administrator users can set job monitoring limits, user privileges, job scheduling options and parallel jobs default.
Manager - it's a main interface to the Datastage Repository, allows its browsing and editing. It displays tables and files layouts, routines, transforms and jobs defined in the project. It is mainly used to store and manage reusable metadata.
Designer - used to create DataStage jobs which are compiled into executable programs. is a graphical, user-friendly application which applies visual data flow method to develop job flows for extracting, cleansing, transforming, integrating and loading data.
Director - manages running, validating, scheduling and monitoring DataStage jobs. It’s mainly used by operators and testers.
CRMT is reseller and implementatin partner for IBM Infosphere Datastage. Please contact us on info@crmt.com or click here.