Data Engineering 2021 Kahoots

Date Kahoot Name
24 Nov 2021 Data Visualization
25 Nov 2021 Power BI Recap
7 Dec 2021 Power BI Recap 3
8 Dec 2021 Data Eng 1
9 Dec 2021 Relational And MongoDB
12 Jan 2022 Python Data Basics - Short
13 Jan 2022 Duplicate of Pandas

Kahoot 1

Kahoot Name: Data Visualization
Date: 24 Nov 2021

Question 1

Question: What is data visualisation?

Options: ['Form of communication', 'Method of passing information clearly', 'Making complex data more uderstandable', 'All answers are correct']

Correct Answer: All answers are correct

Question 2

Question: How many categories are represented in this bar graph?

Options: [40.0, 6.0, 0.0, 5.0]

Correct Answer: 6.0

Question 3

Question: Which graph(s) make(s) it easier to determine which investment has greater market share?

Options: ['A', 'B', 'Both', 'None']

Correct Answer: B

Question 4

Question: Line Charts are usually best for:

Options: ['Representing precise values', 'Accurate data comparison', 'Demonstrating trends', 'Presenting background details']

Correct Answer: Demonstrating trends

Question 5

Question: Which of these charts is easy to understand?

Options: [1.0, 2.0, 'Both', 'None']

Correct Answer: None

Question 6

Question: What was Steve's trick with the graph?

Options: ['Colours', 'Fonts', '3D Angle', 'Legend']

Correct Answer: 3D Angle

Question 7

Question: What makes Option 2 worse among those Treemaps?

Options: ['Fonts', 'Missing labels', 'Groupings', 'All of the above']

Correct Answer: All of the above

Question 8

Question: Which table more reliably highlights areas of poor performance?

Options: ['Option A', 'Option B', 'Both Are Equal']

Correct Answer: Option A

Question 9

Question: What's wrong with this chart?

Options: ['Bad Scale', 'Not enough points', 'Too many labels', 'Comment Boxes']

Correct Answer: Bad Scale


Kahoot 2

Kahoot Name: Power BI Recap
Date: 25 Nov 2021

Question 1

Question: Which of the following is True?

Options: ['A PowerBI report can only contain data from a single source', 'PowerBI can load data directly from an Oracle database', 'csv files are the preferred file format when using PowerBI']

Correct Answer: PowerBI can load data directly from an Oracle database

Question 2

Question: Which best describes a Power BI "report"?

Options: ['A collection of data', 'A group of visualizations that may be spread across multiple pages', 'A chart or a graph', 'A view of a table of data']

Correct Answer: A group of visualizations that may be spread across multiple pages

Question 3

Question: How do you see a chart's raw data in the Power BI desktop client?

Options: ['The raw data is only available on the "Data" View', 'Click the (...) in the corner of the visualisation, click "Show as a table"', 'You must look at the original source of the data']

Correct Answer: Click the (...) in the corner of the visualisation, click "Show as a table"

Question 4

Question: What is a slicer in Power BI?

Options: ['A drilldown type', 'A tool that allows a user to capture a static image of a visualisation', 'A visualisation that allows viewers to filter the data shown on the page', 'A filtering option only available when loading an Excel file']

Correct Answer: A visualisation that allows viewers to filter the data shown on the page

Question 5

Question: Which best describes a "measure" in Power BI?

Options: ['A Table that returns specified values from a dataset', 'A Column that returns specified values from a dataset', 'A Formula that returns a value that is evaluated as the report is viewed', 'An Editor that allows you to enter a DAX expression']

Correct Answer: A Formula that returns a value that is evaluated as the report is viewed

Question 6

Question: Which best describes a "Calculated Column" in Power BI?

Options: ['A column in the dataset that was read from a data source', 'A column that was not in the original data, it was calculated by PowerBI', 'A column that is calculated as the report is viewed']

Correct Answer: A column that was not in the original data, it was calculated by PowerBI

Question 7

Question: Which best describes DAX?

Options: ['A formula language used to create custom calculations and measures', 'The powerBI file format', 'An XML Data Schema', 'A database file format']

Correct Answer: A formula language used to create custom calculations and measures


Kahoot 3

Kahoot Name: Power BI Recap 3
Date: 7 Dec 2021

Question 1

Question: What is a Slicer used for?

Options: ['to change the colours of a pie chart', 'to add some borders to the page', 'to filter other visuals on the page', 'to add a title to the page']

Correct Answer: to filter other visuals on the page

Question 2

Question: What file extension is used when saving a Power BI report

Options: ['.pbix', '.powerbi', '.pb', '.pwbx']

Correct Answer: .pbix

Question 3

Question: Which of the following is NOT a chart type in Power BI

Options: ['Cards', 'Donut', 'Table', 'Pivot']

Correct Answer: Pivot

Question 4

Question: A donut visual is used to show values as % of total

Options: [True, False]

Correct Answer: True

Question 5

Question: What is a Legend?

Options: ['A type of Chart', 'An extra grouping you can add into some chart types', 'A colour scheme', 'A Chart Title']

Correct Answer: An extra grouping you can add into some chart types

Question 6

Question: Which visual can be used to display single numbers

Options: ['Column', 'Slicer', 'Table', 'Card']

Correct Answer: Card

Question 7

Question: Which of these charts could you use a secondary axis on?

Options: ['Pie', 'Donut', 'Table', 'Line']

Correct Answer: Line

Question 8

Question: Where do you change the Chart Title

Options: ['Change Chart Title icon on the Ribbon', 'Double click on the chart', 'Formatting Section of the Visualizations Pane', 'You cannot change the title - it is set by default']

Correct Answer: Formatting Section of the Visualizations Pane

Question 9

Question: Excel is the only data source you can connect to in Power BI

Options: [True, False]

Correct Answer: False

Question 10

Question: What type of summarizations can you use on a value field in a chart

Options: ['Average', 'Count', 'Sum', 'All of the Above']

Correct Answer: All of the Above


Kahoot 4

Kahoot Name: Data Eng 1
Date: 8 Dec 2021

Question 1

Question: Which is NOT commonly one of the V's of big data

Options: ['Variety', 'Value', 'Vulnerability', 'Volatility']

Correct Answer: Vulnerability

Question 2

Question: Raw, unstructured Data whose purpose is yet to be determined is commonly stored in

Options: ['Data Warehouse', 'Operational Database', 'Data Lake', 'Oracle']

Correct Answer: Data Lake

Question 3

Question: The full form of OLAP is

Options: ['Online Advanced Processing', 'Online Analytical Processing', 'Online Analytical Performance', 'Online Advanced Preparation']

Correct Answer: Online Analytical Processing

Question 4

Question: A website backend system that is often storing, retrieving and updating data is performing

Options: ['OLAP', 'OLTP']

Correct Answer: OLTP

Question 5

Question: An ETL Job would commonly be used to move data from

Options: ['OLAP system => OLTP Database', 'OLTP Database => OLAP System', 'Data Warehouse => OLTP Database', 'Data Lake => OLTP System']

Correct Answer: OLTP Database => OLAP System

Question 6

Question: Commonly Data Normalization results in

Options: ['Duplicated data to make retrieval faster', 'Reduction in duplication of data', 'Faster write of new records', 'Less Tables']

Correct Answer: Reduction in duplication of data

Question 7

Question: In a relational model, when a primary key from one table is placed in a column in another table it's called a

Options: ['Secondary Key', 'Foreign Key', 'This is an Error', 'Partition Key']

Correct Answer: Foreign Key

Question 8

Question: A "CREATE TABLE" statement is what category of SQL

Options: ['DML', 'DDL', 'DQL', 'DCL']

Correct Answer: DDL


Kahoot 5

Kahoot Name: Relational And MongoDB
Date: 9 Dec 2021

Question 1

Question: In a relational database, a primary key is

Options: ['Also a Foreign Key', 'Unique and Not Null', 'Always a number', 'Allowed to contain Null values']

Correct Answer: Unique and Not Null

Question 2

Question: In a relational database, if you specify a column must be unique, what happens if you try to insert a duplicate value

Options: ['The insert is paused until you fix the data entered.', 'The database ignores the row with the duplicate data', 'The database gives an error for the entire insert', 'Any non-duplicate records in the insert are added']

Correct Answer: The database gives an error for the entire insert

Question 3

Question: SELECT count(*) FROM table; counts what?

Options: ['number of columns in the table', 'number of rows with no null values', 'number of rows', 'number of non null values in the first column']

Correct Answer: number of rows

Question 4

Question: In SQL, LIKE operator compares values

Options: ['based on how they sound', 'based on similar data types', 'LIKE is not a valid operator in SQL', 'using wildcards "_" and "%"']

Correct Answer: using wildcards "_" and "%"

Question 5

Question: Which is NOT a common Category of NoSQL Database

Options: ['Key-Value Store', 'Random-ordered Database', 'Column-Oriented Database', 'Document-oriented Database']

Correct Answer: Random-ordered Database

Question 6

Question: Vertical Scaling is adding more computers to a cluster to get more work done

Options: [True, False]

Correct Answer: False

Question 7

Question: In MongoDB terminology a collection is equivalent to a ___ in SQL

Options: ['Database', 'Row in a Table', 'Column in a Table', 'Table']

Correct Answer: Table

Question 8

Question: Which operation finds restaurants whose borough field is equal to "Queens" in MongoDB?

Options: ['db.restaurants.find( { "borough" : "Queens" } )', 'select * from restaurants where borough LIKE "Q%";', 'restaurants.select("Queens")', 'database.findAll("restaurants")']

Correct Answer: db.restaurants.find( { "borough" : "Queens" } )


Kahoot 6

Kahoot Name: Python Data Basics - Short
Date: 12 Jan 2022

Question 1

Question: The "Python Interpreter" is?

Options: ['A person who understands Python code', 'A program that executes Python code', 'A program that converts between Python and another language', 'Any web application written in Python']

Correct Answer: A program that executes Python code

Question 2

Question: What is Jupyter?

Options: ['A web application enabling writing Python code in a web browser', 'An application for managing Python data science tools', 'The tool most often used to write Python code', 'A Python library for managing tables of data']

Correct Answer: A web application enabling writing Python code in a web browser

Question 3

Question: What is the usual file extension for a Jupyter notebook?

Options: ['.nb', '.py', '.ipynb', '.csv']

Correct Answer: .ipynb

Question 4

Question: What are the two types of cell most commonly used in a Jupyter notebook?

Options: ['Markup and Markdown cells', 'Pandas and Anaconda cells', 'Markdown and Code cells', 'Python and C++ cells']

Correct Answer: Markdown and Code cells

Question 5

Question: A Jupyter markdown cell can contain HTML code?

Options: [True, False]

Correct Answer: True

Question 6

Question: Which is valid Python to import the pandas library?

Options: ['include pandas as pd', 'pd.import()', 'import pandas as pd', 'pd.read_csv()']

Correct Answer: import pandas as pd

Question 7

Question: Which of the following is a Python tuple?

Options: [' [3]', '{1, 2, 3}', ' (1, 2, 3, 4)', ' None of the Above']

Correct Answer: (1, 2, 3, 4)

Question 8

Question: How do you access items from a list?

Options: ['#A', '#B', '#C', '#D']

Correct Answer: #B

Question 9

Question: How do we create a list in Python?

Options: ['List1 = {}', 'List1 = ()', 'List1 = []', 'List1 = ""']

Correct Answer: List1 = []

Question 10

Question: What is a dictionary in Python?

Options: ['an unordered set of key-value pairs', 'an ordered set of key-value pairs', 'an unordered set of values referenced by an index number', 'an ordered set of values referenced by an index number']

Correct Answer: an unordered set of key-value pairs


Kahoot 7

Kahoot Name: Duplicate of Pandas
Date: 13 Jan 2022

Question 1

Question: Pandas key data structure is called?

Options: ['Array', 'Dataset', 'DataFrame', 'Recordset']

Correct Answer: DataFrame

Question 2

Question: DataFrame in pandas is

Options: ['1D Array', '3D Array', '2D Array', 'None']

Correct Answer: 2D Array

Question 3

Question: Which of the following are correct features of DataFrame?

Options: ['All columns share an index which provides efficient lookups', 'Columns have names and datatypes', 'Can Perform Arithmetic operations on rows and columns', 'All of the above']

Correct Answer: All of the above

Question 4

Question: How do import a csv file into a dataframe specifying the date column as the index and enabling data/time intelligence

Options: ["df = pd.read_csv('file.csv', index_col='Date',parse_dates=True)", "df = pd.read_csv('file.csv')", "df = pd.read_csv('file.csv', index_col='Date',parse_dates=False)", "df = pd.read_csv('file.csv', parse_dates=True"]

Correct Answer: df = pd.read_csv('file.csv', index_col='Date',parse_dates=True)

Question 5

Question: To retrieve the first 5 rows of a DataFrame

Options: ['df.top()', 'df.head()', 'df.first()', 'pd.first(df)']

Correct Answer: df.head()

Question 6

Question: What is a correct syntax to return the row at index '2000-01-01' in a DataFrame?

Options: ['df[first]', "df.loc['2000-01-01']", "df['2000-01-01']", 'df(0)']

Correct Answer: df.loc['2000-01-01']

Question 7

Question: Which would get the mean of each of the 'Open' and 'Close' columns

Options: ["df[ 'Open', 'High' ].mean()", "df.loc[ ['Open', 'High'] ].mean()", "df[ ['Open', 'High'] ].mean()", 'for item in df: item.mean()']

Correct Answer: df[ ['Open', 'High'] ].mean()