Big Data

大数据工程师职位描述模板

A Big Data Engineer is a person who creates and manages a company’s Big Data infrastructure and tools, and is someone that knows how to get results from vast amounts of data quickly.

Share

A Big Data Engineer is a person who creates and manages a company’s Big Data infrastructure and tools, and is someone that knows how to get results from vast amounts of data quickly.

该角色的实际定义各不相同,并且经常与 Data Scientist role. Here, 我们将假设这是一个专注于工程的角色, 不需要统计学和强大的机器学习技能.

The world of Big Data has grown significantly during the last decade; therefore, 技能开始变得更加具体. 而在大多数情况下,它是围绕Hadoop构建的, 有许多工具本身已经变得非常重要. 我们在下面的示例描述中介绍了一些常见的情况.

大数据工程师-职位描述和广告模板

复制此模板,并将其修改为自己的模板:

Company Introduction

{{写一段简短而醒目的关于你公司的文字. Make sure to provide information about the company culture, perks, and benefits. Mention office hours, remote working possibilities, 以及所有你认为能让你的公司有趣的东西. Big Data Engineers like to work on huge problems - mentioning the scale (or the potential) can help gain the attention of top talent.}}

Job Description

我们正在寻找一个大数据工程师,将收集工作, storing, processing, 以及对海量数据的分析. The primary focus will be on choosing optimal solutions to use for these purposes, then maintaining, implementing, and monitoring them. You will also be responsible for integrating them with the architecture used across the company.

Responsibilities

  • Selecting and integrating any Big Data tools and frameworks required to provide requested capabilities
  • Implementing ETL process {{如果从现有数据源导入数据是相关的}}
  • Monitoring performance and advising any necessary infrastructure changes
  • 定义数据保留策略
  • {{添加任何其他相关职责}}

Skills and Qualifications

  • 精通分布式计算原理
  • 管理Hadoop集群,包括所有的服务 {{除非你将拥有特定的大数据开发运维角色}}
  • 能够解决集群运行中出现的任何问题 {{除非你将拥有特定的大数据开发运维角色}}
  • 熟练使用Hadoop v2, MapReduce, HDFS
  • Experience with building stream-processing systems, using solutions such as Storm or Spark-Streaming {{如果流处理与角色相关}}
  • 熟悉大数据查询工具,如Pig、Hive、Impala
  • Experience with Spark {{如果您正在包含或计划包含它}}
  • 具有集成多个数据源数据的经验
  • 有使用NoSQL数据库的经验,如HBase, Cassandra, MongoDB
  • 了解各种ETL技术和框架,如Flume
  • 有各种消息传递系统的经验,如Kafka或RabbitMQ
  • 有使用大数据ML工具包的经验,如Mahout, SparkML或H2O {{if you are going to integrate Machine Learning in your Big Data infrastructure}}
  • Good understanding of Lambda Architecture, along with its advantages and drawbacks
  • 有使用Cloudera/MapR/Hortonworks的经验 {{you can specify the distribution you are currently using or planning to use here}}
  • {{列出您正在使用或计划使用的任何其他技术. 大多数大数据工程师都知道下面列出的一些: The Hadoop Ecosystem Table}}
  • {{列出您需要的教育程度或证书}}

Toptal Engineers最近的大数据文章

如何聘请优秀的大数据架构师

大数据是一个非常广泛的领域, 通常由数据科学家组成的混合团队来解决, software engineers, and statisticians. Real expertise in big data therefore requires far more than learning the ins and outs of a particular technology. This guide offers a sampling of effective questions to help evaluate the breadth and depth of a candidate's mastery of this complex domain.

Read Hiring Guide

现在就聘请顶级大数据架构师

Toptal是一个面向顶级大数据架构师的市场. Top companies and startups choose Toptal big data freelancers for their mission-critical software projects.

See Their Profiles

James Cahall

Freelance Big Data Architect
United StatesToptal的自由大数据开发人员 Since October 17, 2016

James is a results-driven, can-do, and entrepreneurial engineer with 15 years of C-level experience and 20+ years of professional engineering. He consistently delivers successful bleeding-edge products to support business goals. He's an architect in innovative tech initiatives that add to and accelerate business revenue streams. 他是OTTera公司的架构师和首席开发人员., a white-label OTT provider servicing 100+ OTT services and 2000+ Linear Channels used by over 500 million users.

Show More

George Kobiashvili

Freelance Big Data Architect
GeorgiaToptal的自由大数据开发人员 Since September 4, 2019

George is a seasoned systems engineer with great breadth and depth knowledge of building and automating complex systems. As an early adopter of cloud technology, he led a team to design and build an on-premise cloud. His 12 years of teaching developed the skill of coaching and communicating complex concepts. George is fluent in C, Go, and Python languages, 对数据科学和人工智能有浓厚兴趣, 专注于提供最高质量的结果. 他渴望处理复杂的问题.

Show More

Sung Jun (Andrew) Kim

Freelance Big Data Architect
AustraliaToptal的自由大数据开发人员 Since June 18, 2020

作为一名拥有20多年经验的高效技术领导者, Andrew专门研究数据:集成, conversion, engineering, analytics, visualization, science, ETL, big data architecture, analytics platforms, and cloud architecture. 他拥有构建数据平台的一系列技能, analytic consulting, trend monitoring, data modeling, data governance, and machine learning.

Show More

Bruno Machado Agostinho

Freelance Big Data Architect
BrazilToptal的自由大数据开发人员 Since June 18, 2020

For over the past decade, Bruno's been working with databases in various fields. He also has an Oracle SQL Expert certification and specializes in optimizing SQL queries and PL/SQL procedures, 但他也使用PostgreSQL和MySQL进行开发. Bruno likes to keep himself up to date, and that's why he's undertaking a Ph.D. in computer science.

Show More

Pieter van Beek

Freelance Big Data Architect
PortugalToptal的自由大数据开发人员 Since September 8, 2014

Pieter has 39 years of programming experience, including time spent as a software product manager. He is a challenger, an independent worker, 在情况需要的时候也要有团队精神, 他在一系列的话题上拥有专业知识和技能, including big data, cryptography, and machine learning.

Show More

Benjamin Li

Freelance Big Data Architect
CanadaToptal的自由大数据开发人员 Since November 3, 2021

Benjamin has over two decades of software and big data development experience, 包括数据建模和数据仓库设计. 他的活跃工具集包括Spark, Python, Scala, AWS, Azure, SQL, Hive, Linux, Microsoft BI solutions, C#.NET, and Java. His orientation to detail and strong analytical and problem-solving skills make him an excellent addition to any team. A kind and intentional communicator, Benjamin always produces high-quality work.

Show More

Daphne Liu

Freelance Big Data Architect
United StatesToptal的自由大数据开发人员 Since June 18, 2020

Daphne is a highly motivated big data analytic architect and SQL/Tableau developer with strong business analytic solution delivery skills and 20 years of progressively responsible OLTP/OLAP database development/architecture experience. She is a frequent seminar speaker and workshop trainer in business intelligence and analytic solutions. Daphne is experienced collaborating with business users in data modeling and business analytic solutions.

Show More

Tafsuth Boumali

Freelance Big Data Architect
FranceToptal的自由大数据开发人员 Since November 1, 2021

Tafsuth is a highly efficient and dedicated professional with a broad software and data engineering skillset. Her career assignments have ranged from building real-time prediction pipelines for startups to leading project teams and designing and maintaining large data lakes for Fortune 500 companies. Tafsuth感兴趣的是帮助企业做出数据驱动的决策, 她喜欢通过指导工程师来分享她的知识.

Show More

Lian Yagoda

Freelance Big Data Architect
IsraelToptal的自由大数据开发人员 Since November 26, 2021

Lian在不同的BI平台上有十年的经验, 作为BI开发人员和技术支持顾问. 她是数据建模专家, querying, manipulation, 以及数据输出的可视化, and she likes to use Sisense, Tableau, Qlik Sense, Power BI, and Looker. Lian enjoys using her skills to contribute to the exciting technological advances daily.

Show More

Piotr Pietruszka

Freelance Big Data Architect
PolandToptal的自由大数据开发人员 Since February 5, 2021

Piotr is a database developer with 12 years of experience in business intelligence projects as a back- and front-end developer. He designed and developed SQL ETL jobs to migrate a financial system from Oracle to SAP at the European Space Agency. Piotr擅长Oracle数据库, SQL, ETL processes development, 并创建高质量的报告. He has been working on big data projects and building data pipelines using Apache Spark technology for over three years.

Show More

Igor Gorbenko

Freelance Big Data Architect
United Arab EmiratesToptal的自由大数据开发人员 Since October 18, 2021

Igor is a data engineer and cloud architect with nearly 13 years of solid experience building high-load reliable systems, DWH, ETL, 以及俄罗斯天然气工业银行的机器学习管道, Stanford, GlaxoSmithKline, Fujitsu, AbbVie, and Royal Mail. He is a cloud-agnostic engineer specializing in Flask, FastAPI, and database integration. Igor is also keen on building GCP-based systems to leverage businesses to work more efficiently, gain more flexibility, and allow a strategic advantage.

Show More

在整个网络中发现更多大数据架构师

Start Hiring

Toptal Connects the Top 3% 世界各地的自由职业人才.

Join the Toptal community.