Data Science 简明教程
Data Science - Careers
有许多工作与数据科学家领域相关或重叠。
There are several jobs available that are linked to or overlap with the field of data scientist.
与数据科学相关的工作清单−
List of jobs related to data science −
以下是与数据科学家相关的工作清单。
Below is a list of jobs that are related to data scientists.
-
Data Analyst
-
Data Scientist
-
Database Administrator
-
Big Data Engineer
-
Data Mining Engineer
-
Machine Learning Engineer
-
Data Architect
-
Hadoop Engineer
-
Data Warehouse Architect
Data Analyst
数据分析师分析数据集,以找出客户相关问题的解决方案。此信息也由数据分析师传达给管理层和其他利益相关者。这些人从事于各种领域的工作,包括商业、银行、刑事司法、科学、医疗和政府。
A data analyst analyses data sets to identify solutions to customer-related issues. This information is also communicated to management and other stakeholders by a data analyst. These people work in a variety of fields, including business, banking, criminal justice, science, medical, and government.
数据分析师是具有专业知识和能力,可以将原始数据转换为可用于做出业务决策的信息和见解的人。
A data analyst is someone who has the expertise and abilities to transform raw data into information and insight that can be utilized to make business choices.
Data Scientist
数据科学家是一个使用分析、统计和编程能力获取海量数据的人。他们有义务利用数据为组织的特定需求创建个性化解决方案。
A Data Scientist is a professional who uses analytical, statistical, and programming abilities to acquire enormous volumes of data. It is their obligation to utilize data to create solutions that are personalized to the organization’s specific demands.
公司在日常运营中越来越依赖数据。数据科学家检查原始数据并从中提取有意义的含义。然后,他们利用这些数据来识别趋势并提供业务增长和竞争所需的解决方案。
Companies are increasingly relying on data in their day-to-day operations. A data scientist examines raw data and pulls meaningful meaning from it. They then utilize this data to identify trends and provide solutions that a business needs to grow and compete.
Database Administrator
数据库管理员负责管理和维护业务数据库。数据库管理员负责执行数据管理策略,并确保企业数据库在发生内存丢失时可操作并得到备份。
Database administrators are responsible for managing and maintaining business databases. Database administrators are responsible for enforcing a data management policy and ensuring that corporate databases are operational and backed up in the case of memory loss.
数据库管理员(有时称为数据库经理)管理业务数据库,以确保信息得到安全维护,并且只有获得授权的人员才能访问。数据库管理员还必须保证这些人能够在他们需要的时间以他们需要的方式访问他们需要的信息。
Database administrators (sometimes known as database managers) administer business databases to ensure that information is maintained safely and is only accessible to authorized individuals. Database administrators must also guarantee that these persons have access to the information they need at the times they want it and in the format they require.
Big Data Engineer
大数据工程师为一家公司创建、测试和维护使用大数据解决方案。他们的工作是从许多不同来源收集大量数据,并确保以后使用数据的人员能够快速轻松地获得这些数据。大数据工程师基本上确保公司的 data pipeline 可扩展、安全且能够为多个用户服务。
Big data engineers create, test, and maintain solutions for a company that use Big Data. Their job is to gather a lot of data from many different sources and make sure that people who use the data later can get to it quickly and easily. Big data engineers basically make sure that the company’s data pipelines are scalable, secure, and able to serve more than one user.
当今制作和使用的数据量似乎是无穷无尽的。问题是如何保存、分析和展示这些信息。大数据工程师处理这些问题的办法和技术。
The amount of data made and used today seems to be endless. The question is how this information will be saved, analyzed, and shown. A big data engineer works on the methods and techniques to deal with these problems.
Data Mining Engineer
数据挖掘是对信息进行分类的过程,以查找企业可用于改善其系统和操作的答案。如果数据没有以正确的方式进行处理和展示,那么它就没有多大用处。
Data mining is the process of sorting through information to find answers that a business can use to improve its systems and operations. Data isn’t very useful if it isn’t manipulated and shown in the right way.
数据挖掘工程师设置并运行用于存储和分析数据的系统。总体任务包括设置数据仓库、组织数据以使其易于查找以及安装数据流经的管道。数据挖掘工程师需要知道数据的来源、使用方法以及谁将使用它。ETL(提取、转换和加载)是一个数据挖掘工程师的关键首字母缩略词。
A data mining engineer sets up and runs the systems that are used to store and analyze data. Overarching tasks include setting up data warehouses, organizing data so it’s easy to find, and installing conduits for data to flow through. A data mining engineer needs to know where the data comes from, how it will be used, and who will use it. ETL, which stands for "extract, transform, and load," is the key acronym for a data mining engineer.
Machine Learning Engineer
机器学习 (ML) 开发人员知道如何使用数据训练模型。然后,这些模型用于自动执行诸如图像分组、语音识别和市场预测之类的事情。
A machine learning (ML) developer knows how to train models with data. The models are then used to automate things like putting images into groups, recognising speech, and predicting the market.
机器学习可用于不同的角色。数据科学家和人工智能 (AI) 工程师的工作往往存在一定的重叠之处,有时这两个工作甚至会相互混淆。机器学习是人工智能的一个子领域,专注于查看数据以查找输入和预期输出之间的联系。
Different roles can be given to machine learning. There is often some overlap between the jobs of a data scientist and an AI (artificial intelligence) engineer, and sometimes the two jobs are even confused with each other. Machine learning is a subfield of AI that focuses on looking at data to find connections between what was put in and what was wanted to come out.
机器学习开发人员确保每个问题都有一个完美的解决方案。只有通过仔细处理数据并选择最适合情况的算法,才能获得最佳结果。
A machine learning developer makes sure that each problem has a solution that fits it perfectly. Only by carefully processing the data and choosing the best algorithm for the situation can you get the best results.
Data Architect
数据架构师通过找到设置和构建它的最佳方式来构建和管理公司的数据库。他们与数据库管理员和分析师合作以确保公司数据易于访问。任务包括创建数据库解决方案、确定需求和编写设计报告。
Data architects build and manage a company’s database by finding the best ways to set it up and structure it. They work with database managers and analysts to make sure that company data is easy to get to. Tasks include making database solutions, figuring out what needs to be done, and making design reports.
数据架构师是一位设计组织的数据策略的专家,其中包括数据质量标准、数据在组织内的流动方式以及数据的安全存储方式。数据管理专业人员看待事物的态度将业务需求转化为技术需求。
A data architect is an expert who comes up with the organization’s data strategy, which includes standards for data quality, how data moves around the organisation, and how data is kept safe. The way this professional in data management sees things is what turns business needs into technical needs.
作为业务和技术之间的关键纽带,对数据架构师的需求正与日俱增。
As the key link between business and technology, data architects are becoming more and more in demand.
Hadoop Engineer
Hadoop 开发人员负责创建和编码 Hadoop 应用程序。Hadoop 是一个开源框架,用于管理和存储处理大量数据并在集群系统上运行的应用程序。基本上,Hadoop 开发人员构建应用程序来帮助公司管理和跟踪其大数据。
Hadoop Developers are in charge of making and coding Hadoop applications. Hadoop is an open-source framework for managing and storing applications that work with large amounts of data and run on cluster systems. Basically, a Hadoop developer makes apps that help a company manage and keep track of its big data.
Hadoop 开发人员负责编写 Hadoop 应用程序的代码。此角色类似于软件开发人员。这两个角色非常相似,但前者属于大数据领域。我们来看看 Hadoop 开发人员必须做什么以更好地了解此角色。
A Hadoop Developer is the person in charge of writing the code for Hadoop applications. This job is like being a Software Developer. The jobs are pretty similar, but the first one is in the Big Data domain. Let’s look at some of the things a Hadoop Developer has to do to get a better idea of what this job is about.
Data Warehouse Architect
数据仓库架构师负责提出数据仓库解决方案并使用标准数据仓库技术来提出计划,以最大程度地帮助企业或组织。在设计特定架构时,数据仓库架构师通常会考虑雇主的目标或客户的需求。然后,该架构可以由工作人员维护并用于实现目标。
Data warehouse architects are responsible coming up with solutions for data warehouses and working with standard data warehouse technologies to come up with plans that will help a business or organization the most. When designing a specific architecture, data warehouse architects usually take into account the goals of the employer or the needs of the client. This architecture can then be maintained by the staff and used to achieve the goals.
因此,就像普通建筑师设计建筑物或海军建筑师设计船舶一样,数据仓库架构师设计和帮助启动数据仓库,根据客户的需求对其进行定制。
So, just like a regular architect designs a building or a naval architect designs a ship, data warehouse architects design and help launch data warehouses, customizing them to meet the needs of the client.
Data Science Job Trends 2022
到 2022 年,对数据科学家的需求将大幅增长。IBM 表示,在 2020 年将创造 364,000 到 2,720,000 个新工作岗位。这种需求将持续增长,很快将出现 700,000 个职位空缺。
By 2022, there will be a big rise in the need for data scientists. IBM says that between 364,000 and 2,720,000 new jobs will be created in the year 2020. This demand will continue to rise, and soon there will be a 700,000 openings.
Glassdoor 表示,其网站上的头号工作是数据科学家。未来,这个职位不会有任何改变。也有人认为数据科学的职位空缺将持续 45 天。这比平均劳动力市场长五天。
Glassdoor says that the top job on its site is for a Data Scientist. In the future, nothing will change about this position. It is also looked at that the job openings in data science are open for 45 days. This is five days longer than the average job market.
IBM 将与学校和企业合作,为有抱负的数据科学家创造一个工作学习环境。这将有助于缩小技能差距。
IBM will work with schools and businesses to create a work-study environment for aspiring data scientists. This will help close the skills gap.
对数据科学家的需求以二次方速度增长。这是因为新的工作和行业已经产生。数据的不断增加和不同类型的数据加剧了这种情况。
The need for data scientists is growing at a rate that is a power of two. This is because new jobs and industries have been created. This is made worse by the growing amount of data and the different kinds of data.
未来,数据科学家只会扮演更多角色,而且数量也会更多。数据科学家的工作包括数据工程师、数据科学经理和大数据架构师。此外,金融和保险行业正成为数据科学家的最大雇主之一。
In the future, there will only be more roles for data scientists and more of them. Data scientist jobs include data engineer, data science manager, and big data architect. Also, the financial and insurance sectors are becoming some of the biggest employers of data scientists.
随着培训数据科学家的机构越来越多,使用数据的人也可能越来越多。
As the number of institutes that train data scientists grows, it is likely that more and more people will know how to use data.