PwC Labs- Data Engineer

Full Time
Remote
Posted
Job description
A career in Products and Technology is an opportunity to bring PwC's strategy to life by driving products and technology into everything we deliver. Our clients expect us to bring the right people and the right technology to solve their biggest problems; Products and Technology is here to help PwC meet that challenge and accelerate the growth of our business. We have skilled technologists, data scientists, product managers and business strategists who are using technology to accelerate change. Our team establishes and builds processes and structures based on business and technical requirements to channel data from multiple inputs, route appropriately and store using any combination of distributed (cloud) structures, local databases, and other applicable storage forms as required. We develop the data sets and pipelines that will support analysis and model development including improving data quality, integration of disparate data sources, enhancement of data, transformation of data and development of performant infrastructure for access and reporting. As well, we design, build and oversee the deployment and operation of technology architecture, solutions and software to capture, manage, store and utilize structured and unstructured data from internal and external sources.


To really stand out and make us fit for the future in a constantly changing world, each and every one of us at PwC needs to be an authentic and inclusive leader, at all grades/levels and in all lines of service. To help us achieve this we have the PwC Professional; our global leadership development framework. It gives us a single set of expectations across our lines, geographies and career paths, and provides transparency on the skills we need as individuals to be successful and progress in our careers, now and in the future.

As an Associate, you'll work as part of a team of problem solvers, helping to solve complex business issues from strategy to execution. PwC Professional skills and responsibilities for this management level include but are not limited to:

  • Invite and provide evidence-based feedback in a timely and constructive manner.
  • Share and collaborate effectively with others.
  • Work with existing processes/systems whilst making constructive suggestions for improvements.
  • Validate data and analysis for accuracy and relevance.
  • Follow risk management and compliance procedures.
  • Keep up-to-date with technical developments for business area.
  • Communicate confidently in a clear, concise and articulate manner - verbally and in written form.
  • Seek opportunities to learn about other cultures and other parts of the business across the Network of PwC firms.
  • Uphold the firm's code of ethics and business conduct


Responsibilities
:

As a Data Engineer you will focus on the design and build out of data models, codification of business rules, mapping of data sources to the data models (structured and unstructured), engineering of scalable ETL pipelines, development of data quality solutions, and continuous evaluation of technologies to enhance the broader Innovation group.

Job Requirements and Preferences:

Basic Qualifications:

Minimum Degree Required:
Bachelor Degree

Additional Educational Requirements:

Bachelor's degree or in lieu of a degree, demonstrating, in addition to the minimum years of experience required for the role, three years of specialized training and/or progressively responsible work experience in technology for each missing year of college.

Minimum Years of Experience:
1 year(s)

Preferred Qualifications:

Degree Preferred:
Master Degree

Preferred Fields of Study:
Data Processing/Analytics/Science, Business Analytics, Computer and Information Science, Mathematics, Management Information Systems, Engineering, Computer Engineering, Electrical Engineering, Industrial Engineering, Systems Engineering

Certification(s) Preferred:

  • CCP Data Engineer Exam (DE575)
  • CCA Spark and Hadoop Developer (CCA175)
  • Oracle Certified Professional, Java SE 8 Programmer Certification Overview
  • Certified Professional in Python Programming Level 1 or 2

Preferred Knowledge/Skills:

Demonstrates some abilities and/or a proven record of success, as both an individual contributor and team member, with identifying and addressing business and client needs including:

  • Designing data integrations and data quality frameworks utilizing cloud computing platforms such as AWS, GCP and Azure;
  • Working within relational databases and writing SQL queries;
  • Demonstrating knowledge of Python and experience with data extraction, data cleansing and data wrangling;
  • Working within data machine learning toolkits such as SparkML, messaging systems (Kafka) and NoSQL databases (Cassandra, HBase, MongoDB);
  • Building data lakes and performing data analysis to troubleshoot data related issues and assists in the resolution of data issues;
  • Demonstrating expertise in object-oriented/object function scripting languages such as Python, R, C/C++, Java, Scala, etc.;
  • Determining the appropriate software packages or modules to run, and how easily they can be modified;
  • Managing large scale structured and unstructured data;
  • Architecting highly scalable distributed data pipelines using open source tools and big data technologies such as Hadoop, Pig, Hive, Presto, Spark, Drill, Sqoop and ETL frameworks;
  • Utilizing Linux shell scripting and containerization technologies (Docker, Kubernetes).
  • Leveraging knowledge and skills in data modeling, data mapping, data governance and the processes and technologies commonly used in this space;
  • Demonstrating proficiency in data integration tools (e.g. Talend, SnapLogic, Informatica) and data warehousing / data lake tools;
  • Working within Agile and Scrum methodologies;
  • Working within Relational SQL, distributed SQL and NoSQL databases including, but not limited to, MSSQL, PostgreSQL, MySQL, MemSQL, CrateDB, MongoDB, Cassandra, Neo4j, AllegroGraph, ArangoDB, etc.;
  • Possessing intermediate knowledge of data modeling tools such as ERWin, Enterprise Architect, Visio, etc.;
  • Demonstrating intermediate knowledge of data pipeline and workflow management tools such as Azkaban, Luigi, Airflow, etc.; and,
  • Demonstrating intermediate knowledge of Tableau, PowerBI, Zoomdata, Pentaho.

Demonstrates some abilities and/or a proven record of success, as both an individual contributor and team member, by:

  • Collaborating with analytics and business teams to improve data models;
  • Implementing processes and systems to monitor data quality, ensuring production data is always accurate and available for key stakeholders and business processes that depend on it;
  • Building enterprise data pipelines and crafting code in SQL and Python;
  • Building batch data pipeline with relational and columnar database engines and understanding their respective strengths and weaknesses;
  • Applying computer science fundamentals such as data structures, algorithms, programming languages, distributed systems, and information retrieval;
  • Transforming and analyzing large data sets and deriving insights using data analytics tools;
  • Working with Graph databases and graph modeling; and,
  • Working with the requirements of data science teams.

At PwC, our work model includes three ways of working: virtual, in-person, and flex (a hybrid of in-person and virtual). Visit the following link to learn more: https://pwc.to/ways-we-work.

PwC does not intend to hire experienced or entry level job seekers who will need, now or in the future, PwC sponsorship through the H-1B lottery, except as set forth within the following policy: https://pwc.to/H-1B-Lottery-Policy.

All qualified applicants will receive consideration for employment at PwC without regard to race; creed; color; religion; national origin; sex; age; disability; sexual orientation; gender identity or expression; genetic predisposition or carrier status; veteran, marital, or citizenship status; or any other status protected by law. PwC is proud to be an affirmative action and equal opportunity employer.

For positions based in San Francisco, consideration of qualified candidates with arrest and conviction records will be in a manner consistent with the San Francisco Fair Chance Ordinance.

For positions in Albany (NY), California, Colorado, Nevada, New York City, Washington State, or Westchester County (NY), please visit the following link for pay range information: https://pwc.to/payrange-v1-productstechassociate2

#LI-Remote

offroadmanagementgroup.com is the go-to platform for job seekers looking for the best job postings from around the web. With a focus on quality, the platform guarantees that all job postings are from reliable sources and are up-to-date. It also offers a variety of tools to help users find the perfect job for them, such as searching by location and filtering by industry. Furthermore, offroadmanagementgroup.com provides helpful resources like resume tips and career advice to give job seekers an edge in their search. With its commitment to quality and user-friendliness, offroadmanagementgroup.com is the ideal place to find your next job.

Intrested in this job?

Related Jobs

All Related Listed jobs