Our client requires the services of a Data Engineer/Scientist ( Senior ) - Midrand/Menlyn/Rosslyn/Home as possible ROLE: Data Engineers are responsible for building and maintaining Big Data Pipelines using using GROUP Data Platforms. Data Engineers are custodians of data and must ensure that data is shared in line Unix Big Data Powershell / Bash ADVANTAGEOUS TECHNICAL SKILLS Demonstrate expertise in data modelling complex data sets. Perform thorough testing and data validation to ensure the accuracy of data transformations
Our client requires the services of a Data Engineer/Scientist ( Expert) - Midrand/Menlyn/Rosslyn/Home as possible ROLE: Data Engineers are responsible for building and maintaining Big Data Pipelines using using GROUP Data Platforms. Data Engineers are custodians of data and must ensure that data is shared in line Boto3 ETL Docker Linux / Unix Big Data Powershell / Bash GROUP Cloud Data Hub (CDH) GROUP CDEC Blueprint Knowledge of data formats such as Parquet, AVRO, JSON, XML, CSV. Experience working with Data Quality Tools
Ensures that the required data collection sheets are filled out (master data & customizing) Ensures processes, e.g., Bank Statement Processing, Bank Account Management, In House Bank and Cash Management
Identify and document the mainframe applications and data sets to be migrated. Ensure projects/ maintenance issues. Ensure compliance with GROUP regulations and data protection standards during the migration process strategy for migrating mainframe data to the cloud, considering data integrity, security, and minimal best practices for the cloud environment. Ensure data encryption, access controls, and compliance with
understanding of all stock locations and monitor data associated with the locations for anomalies. Also risks. Review data quality relating to stock and look for anomalies in data and missing SMI data, errors in
process partners and other departments and maintaining data consistency across departments and process partners Azure or AWS or SAP BTP knowledge PostgreDB, Python data analysis Power apps or other low code tools SAP
management and maintenance and preparation of test data. Coordination between development and support environments end (Web), back end (API) and integration. Test data management. Performance, security and load testing elicitation, documentation. Business process modelling, data modelling IT Architecture: Cloud Architecture, On-prem/hybrid
Deployment Understanding GROUPs CA Data Management / “Follow the Data” as an VDI deployment approach. Driving certification Any operating system certification relating to data management Any programming certification Any web
input in terms of benefits and risks. Preparing test data for testing of user stories Execute and/or support process owners Preparing cut-over strategy, e.g., data migration Go-Live preparation and post Go-Live Support
and perform internal testing Preparation of Master Data templates for various objects like Material Master Functional Specifications for them Preparing test data for testing of CR's (Change Requests) Testing CR's