The Principal Data Pipeline Engineer is responsible for performing thorough data processing, testing and validation in order to support accuracy of data transformations, and data verification, from flow processing through machine learning models. The Engineer strives to ensure proper data governance and quality across the data and analytics organization, and across the business as a whole.
The ideal candidate should be an analytical and creative thinker, an innovative problem solver, and be self-motivated and proactive. The candidate should also be highly organized and be comfortable handling multiple and simultaneous tasks to meet aggressive deadlines, while demonstrating an exceptional ability to stay calm and composed in the face of adversity.
Maintains data storage of, and access to, multiple disparate datasets, from traditional databases to streaming sources to support data integration and data reporting needs at scale to support CBRE 360 platform
Maintains data dictionary by revising and entering definitions and prepare high-level ETL mapping specifications
Identify, analyze, and interpret trends or patterns in complex data sets. Work with team leads to prioritize business and information needs
Manage storage and organization of data for ML processes, while maintaining compliance with appropriate privacy requirements
Working with application development teams, confirms project requirements by studying user requirements; conferring with others on the AI/ML team.
QUALIFICATIONS & EDUCATION
Bachelor's degree (BA/BS) in a related field such as information systems, mathematics, or computer science, or equivalent work experience. Typically has five to seven years of relevant work experience, with at least three years in an architectural or design capacity for 'large scale' enterprise systems
Demonstrated experience working with large and complex data sets, as well as experience analyzing large volumes of data
Strong working and conceptual knowledge of building and maintaining physical and logical data models and experience with business intelligence tools
Exceptional analytical skills, showing fluency in the use of tools common to a big-data ecosystem, including strong Python, Shell, Java/Scala, Hive/Pig, and SQL programming
Ability to clearly communicate capabilities, opportunities, and recommendations to both technical and nontechnical audiences
Has deep understanding of data architecture & data modeling best practices and guidelines for different data and analytic platforms.
Experience working with AWS, Azure, or similar cloud platform
Internal Number: 18019945
With broader and deeper capabilities than any other company, CBRE is the leading full-service real estate services and investment organization in the world.
CBRE Group, Inc. is the world’s largest commercial real estate services and investment firm, with 2017 revenues of $14.2 billion and more than 80,000 employees (excluding affiliate offices). CBRE has been included in the Fortune 500 since 2008, ranking #214 in 2017. It also has been voted the industry’s top brand by the Lipsey Company for 17 consecutive years, and has been named one of Fortune’s “Most Admired Companies” in the real estate sector for six years in a row. Its shares trade on the New York Stock Exchange under the symbol “CBRE.”
CBRE offers a broad range of integrated services, including facilities, transaction and project management; property management; investment management; appraisal and valuation; property leasing; strategic consulting; property sales; mortgage services and development services.