Ask The Science

Sign Up

Sign In

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Sorry, you do not have permission to ask a question, You must login to ask a question.

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Ask A Question

Asked: November 21, 2022In: Data Engineer Interview Questions

Describe the Snowflake Schema.
Best Answer

Ask The Science
Added an answer on November 21, 2022 at 2:48 pm
A Snowflake Schema is an extended model of the Star Schema, which adds new dimensions and resembles a snowflake. It divides data into extra tables by the normalization of the dimension tables.

A Snowflake Schema is an extended model of the Star Schema, which adds new dimensions and resembles a snowflake. It divides data into extra tables by the normalization of the dimension tables.
See less
0

Share
Share

Share on Facebook

Share on Twitter

Share on LinkedIn

Share on WhatsApp
Asked: November 21, 2022In: Data Engineer Interview Questions

What do you know about FSCK?
Best Answer

Ask The Science
Added an answer on November 21, 2022 at 2:46 pm
File System Check or FSCK is a command that HDFS leverages. This command checks inconsistencies and problems in files.

File System Check or FSCK is a command that HDFS leverages. This command checks inconsistencies and problems in files.
See less
0

Share
Share

Share on Facebook

Share on Twitter

Share on LinkedIn

Share on WhatsApp
Asked: November 21, 2022In: Data Engineer Interview Questions

How is a big data solution deployed?
Best Answer

Ask The Science
Added an answer on November 21, 2022 at 2:46 pm
This is one of a few big data engineer interview questions you might encounter. Here’s how you can deploy a big-data solution: Combine data from many sources, including RDBMS, SAP, MySQL, and Salesforce. Save the extracted data in a NoSQL database or an HDFS file system. Utilize processing frameworkRead more

This is one of a few big data engineer interview questions you might encounter.

Here’s how you can deploy a big-data solution:

Combine data from many sources, including RDBMS, SAP, MySQL, and Salesforce.

Save the extracted data in a NoSQL database or an HDFS file system.

Utilize processing frameworks like Pig, Spark, and MapReduce to deploy a big data solution.

See less
0

Share
Share

Share on Facebook

Share on Twitter

Share on LinkedIn

Share on WhatsApp
Asked: November 21, 2022In: Data Engineer Interview Questions

Describe the Star Schema.
Best Answer

Ask The Science
Added an answer on November 21, 2022 at 2:45 pm
A star schema, often known as a star join schema, is the most fundamental type of data warehouse model. It is called a star schema due to its structure. The Star Schema allows for numerous related dimension tables and one fact table in the star's center. This model is ideal for querying large data cRead more

A star schema, often known as a star join schema, is the most fundamental type of data warehouse model. It is called a star schema due to its structure. The Star Schema allows for numerous related dimension tables and one fact table in the star’s center. This model is ideal for querying large data collections.
See less
0

Share
Share

Share on Facebook

Share on Twitter

Share on LinkedIn

Share on WhatsApp
Asked: November 21, 2022In: Data Engineer Interview Questions

What does COSHH stand for?
Best Answer

Ask The Science
Added an answer on November 21, 2022 at 2:44 pm
COSHH stands for Classification and Optimization based Schedule for Heterogeneous Hadoop systems. It lets you schedule tasks at both application and cluster levels to save on task completion time.

COSHH stands for Classification and Optimization based Schedule for Heterogeneous Hadoop systems. It lets you schedule tasks at both application and cluster levels to save on task completion time.
See less
0

Share
Share

Share on Facebook

Share on Twitter

Share on LinkedIn

Share on WhatsApp
Asked: November 21, 2022In: Data Engineer Interview Questions

Describe the attributes of Hadoop
Best Answer

Ask The Science
Added an answer on November 21, 2022 at 2:43 pm
The following are key attributes of Hadoop: Open-source, freeware framework Compatible with a wide range of hardware to simplify access to new hardware inside a given node Enables faster-distributed data processing Stores data in the cluster, separate from the other operations. Allows the creation oRead more

The following are key attributes of Hadoop:

Open-source, freeware framework

Compatible with a wide range of hardware to simplify access to new hardware inside a given node

Enables faster-distributed data processing

Stores data in the cluster, separate from the other operations.

Allows the creation of three replicas for each block using various nodes.

See less
0

Share
Share

Share on Facebook

Share on Twitter

Share on LinkedIn

Share on WhatsApp
Asked: November 21, 2022In: Data Engineer Interview Questions

What happens when Block Scanner finds a faulty data block?
Best Answer

Ask The Science
Added an answer on November 21, 2022 at 2:42 pm
First, DataNode alerts NameNode. Then, NameNode creates a new replica using the corrupted block as a starting point. The goal is to align the replication factor with the replication count of the proper replicas. If a match is discovered, the corrupted data block won't be removed.

First, DataNode alerts NameNode. Then, NameNode creates a new replica using the corrupted block as a starting point.

The goal is to align the replication factor with the replication count of the proper replicas. If a match is discovered, the corrupted data block won’t be removed.
See less
0

Share
Share

Share on Facebook

Share on Twitter

Share on LinkedIn

Share on WhatsApp
Asked: November 21, 2022In: Data Engineer Interview Questions

Explain HDFS’s Block and Block Scanner.
Best Answer

Ask The Science
Added an answer on November 21, 2022 at 2:42 pm
A block is the smallest data file component. Hadoop automatically divides large files into small workable segments. On the flip side, the Block Scanner verifies a DataNode's list of blocks.

A block is the smallest data file component. Hadoop automatically divides large files into small workable segments. On the flip side, the Block Scanner verifies a DataNode’s list of blocks.
See less
0

Share
Share

Share on Facebook

Share on Twitter

Share on LinkedIn

Share on WhatsApp
Asked: November 21, 2022In: Data Engineer Interview Questions

Expand on HDFS.
Best Answer

Ask The Science
Added an answer on November 21, 2022 at 2:41 pm
HDFS stands for Hadoop Distributed File System. This file system handles extensive data collection and runs on commodity hardware, i.e., inexpensive computer systems.

HDFS stands for Hadoop Distributed File System. This file system handles extensive data collection and runs on commodity hardware, i.e., inexpensive computer systems.
See less
0

Share
Share

Share on Facebook

Share on Twitter

Share on LinkedIn

Share on WhatsApp
Asked: November 21, 2022In: Data Engineer Interview Questions

Describe streaming in Hadoop.
Best Answer

Ask The Science
Added an answer on November 21, 2022 at 2:40 pm
Streaming enables the construction of maps and reduces jobs and the submission of those jobs to a particular cluster.

Streaming enables the construction of maps and reduces jobs and the submission of those jobs to a particular cluster.
See less
0

Share
Share

Share on Facebook

Share on Twitter

Share on LinkedIn

Share on WhatsApp

1 2 3 4 5

Describe the Snowflake Schema.

What do you know about FSCK?

How is a big data solution deployed?

Describe the Star Schema.

What does COSHH stand for?

Describe the attributes of Hadoop

What happens when Block Scanner finds a faulty data block?

Explain HDFS’s Block and Block Scanner.

Expand on HDFS.

Describe streaming in Hadoop.

Why Should We Hire You?

Why do we not fall off from the Earth?

What is the internal structure of the Earth?

How do we discover what is inside the Earth?

How did we discover that the Earth is round?