Statıstıcs I (ENG) - İşletme (İngilizce)

Ünite 1

Soru 1

The size of a ......I..... is always less than the size of the ......II....... from which it is taken.
Which of the following terms correctly completes the sentence above?

I- interval
II- Data

I-Data
II-Nominal

I-Ordinal
II-Population

I-Ratio
II-Interval

I-Sample
II-Population

Açıklama:

A population contains all members of a specified group Example: The population may be "ALL people living in the Turkey"
A sample data set contains a part, or a subset, of a population. The size of a sample is always less than the size of the population from which it is taken. Example: The sample may be "SOME people living in the Turkey"

Doğru Cevap: E

Soru 2

I- Students' GPAs
II-Political parties
III- Educational level
What type of variable are the ones described above?

I-Interval
II-Nominal
III-Ordinal

I-Ratio
II-Nominal
III-Interval

I-Nominal
II-Interval
III-Ordinal

I-Interval
II-Ratio
III-Ratio

I-Ordinal
II-Nominal
III-Interval

Açıklama:

Nominal scale is a scale of measurement that is used for identification purposes. A nominal scale describes a variable with categories that do not have a natural order or ranking.
Ordinal Scale involves the ranking or ordering of the attributes depending on the variable being scaled.
Interval scale of data measurement is a scale in which the levels are ordered and each numerically equal distances on the scale have equal interval difference, but no absolute zero.
Ratio scale has all the properties of an interval variable, and also has a clear definition of zero. When the variable equals zero, there is none of that variable.

Doğru Cevap: A

Soru 3

The variables are measured using an .............. scale, which not only shows the order but also shows the exact difference in the value.
Which type of scale should replace the blank space in the above sentence?

Ordinal

Categorical

Nominal

Interval

Ratio

Açıklama:

Unlike ordinal variables that take values with no standardized scale, every point in the interval scale is equidistant. Arithmetic operations can also be performed on the numerical values of the interval variable.
The interval scale of data measurement is a scale in which the levels are ordered and each numerically equal distances on the scale have equal interval difference.

Doğru Cevap: D

Soru 4

150 college students were randomly assigned to played either a violent or nonviolent video game. A short time later, the students who played the violent video game punished an opponent (received a noise blast with varying intensity) for a longer period of time than did students who had played the nonviolent video game.
What type of study is described above?

Experimental

Correlational

Survey

Observational

Qualitative

Açıklama:

Observational study - data are observed and collected on each subject and no manipulation of the subjects’ environment occurs.
Experimental study- Manipulate the subjects’ environment, then measure the response variable.

Doğru Cevap: A

Soru 5

.................... is the combination of statistics, mathematics, programming, problem-solving, capturing data in ingenious ways, the ability to look at things differently, and the activity of cleansing, preparing, and aligning the data.
What is the correct concept for the sentence above?

Big data

Statistics

Data science

Population

Sample

Açıklama:

Dealing with unstructured and structured data, Data Science is a field that comprises everything that related to data cleansing, preparation, and analysis. Data science is the combination of statistics, mathematics, programming, problem-solving, capturing data in ingenious ways, the ability to look at things differently, and the activity of cleansing, preparing, and aligning the data. In simple terms, it is the umbrella of techniques used when trying to extract insights and information from data.

Doğru Cevap: C

Soru 6

Statisticians are the most sought-after professionals these days.
The Statistician's mission is handling, analysing, interpreting and facilitating decisions from data.
The principal reason why Statistics is a key necessity to humankind today is the massive amounts of available information.

Which of the above statements about statistics is correct?

I, II, III

I and II

II and III

I and III

Açıklama:

Statisticians are the most sought-after professionals these days.We are living in the information age, and the Statistician is the king of handling, analysing, interpreting and facilitating decisions from data. The principal reason why Statistics is a key necessity to humankind today is the massive amounts of available information.

Doğru Cevap: A

Soru 7

The main criteria for selecting a sample will be that the sample is _____I_____ of the population and that there’s no or very little ____II_____ in the choice of the sampling units.
Which of the following options fills the blank in the above sentence in the most correct way?

I - representative II - subjectivity

I - representative II - objectivity

I - inclusive II - subjectivity

I - inclusive II - objectivity

I - containing II - objectivity

Açıklama:

The main criteria for selecting a sample will be that the sample is representative of the population and that there’s no or very little subjectivity in the choice of the sampling units.

Doğru Cevap: A

Soru 8

Which of the following is not a nominal variable?

Exam grade

Gender

Region of residence

Field of study

Type of transport

Açıklama:

The easiest form of data is called categorical, or qualitative. Categorical variables and data can be either nominal or ordinal. Exam grade is an ordinal categorical variable, since its categories are ordered: A is better than a B, B is better than a C, and so on. Other examples of nominal categorical variables are gender, region of residence, field of study, type of transport, type of housing, etc.

Doğru Cevap: A

Soru 9

Which of the following is not an ordinal variable?

Type of housing

Income group

Exam grade

Educational level

An attitude question in a survey where possible responses are agree / disagree.

Açıklama:

Categorical variables and data can be either nominal or ordinal. Examples of nominal categorical variables are gender, region of residence, field of study, type of transport, type of housing, etc. There is no ordering in the categories of these variables. . Exam grade is an ordinal categorical variable, since its categories are ordered: A is better than a B, B is better than a C, and so on. Other examples of ordinal categorical variables are income group (If incomes have been categorized), an attitude question in a survey where possible responses are strongly agree/agree/disagree/strongly disagree (these categories have an order).

Doğru Cevap: A

Soru 10

Age is a _______ scale data. Which of the following options fills the blank in the above sentence in the most correct way?

Ratio

Interval

Nominal

Ordinal

Qualitative

Açıklama:

The other main type of data is called continuous, or quantitative, for example data on variables “blood pressure”, “age” and “income”. These are observations of variables on continuous scales, usually rounded in some convenient way. For example, although age is a continuous time variable, and we are getting older all the time by seconds, minutes and hours, someone’s age is almost always rounded to the number of years completed. There is a subtle difference between interval-scale and ratio-scale continuous data, which is worth mentioning here. Age is an interval-scale variable: to compare two children of ages 10 and 12, we would compute the interval difference, .e. 2 years.

Doğru Cevap: B

Soru 11

As a general rule, most data on monetary values and those coming from physical measurements (e.g., lira, gold price, centimeters, kilograms) are _____ variables.
Which of the following options fills the blank in the above sentence in the most correct way?

Interval-scale

Nominal-scale

Ratio-scale

Ordinal-scale

Categorical-scale

Açıklama:

As a general rule, most data on monetary values and those coming from physical measurements (e.g., lira, gold price, centimeters, and kilograms) are ratio-scale variables.

Doğru Cevap: C

Soru 12

Commerce, especially online electronic commerce
Finance, for example share prices on stock markets, all managed electronically
Insurance, all the premiums, incidents, actuarial transactions in an insurance company
Transport, for example in the airline industry, all the flights, all the passengers

Which of the above can be considered for big data?

I, II and III

I, III and IV

I, II and IV

II, III and IV

I, II, III and IV

Açıklama:

What are the “big data” sets today and where do they come from? These are mostly found in the following areas:

Commerce, especially online electronic commerce
Finance, for example share prices on stock markets, all managed electronically
Insurance, all the premiums, incidents, actuarial transactions in an insurance company
Biomedicine, especially in genetics, where information is literally exploding as gene-sequencing reveals and codes the total genetic profile of a person
Transport, for example in the airline industry, all the flights, all the passengers
Climate data, measurements from tens of thousands of weather stations across the world

Doğru Cevap: E

Soru 13

Inflation rate, which compares the prices of a basket of products over time, is a _______ variable.
Which of the following correctly fills the blank above?

Interval-scale

Nominal-scale

Ratio-scale

Ordinal-scale

Categorical-scale

Açıklama:

As a general rule, most data on monetary values and those coming from physical measurements (e.g., lira, gold price, centimeters, and kilograms) are ratio-scale variables.

Doğru Cevap: C

Soru 14

The first role of Statistics is to reduce this mass of complex data to a simpler form in order to facilitate understanding and learning from the data.
Words, SMSs, tweets, social media posts, verbal responses in questionnaires, these can all be treated as data.
Climate data, measurements from tens of thousands of weather stations across the world, is an example of big data.

Which of the above statements related to the data is correct?

I, II and III

I and II

II and III

I and III

III

Açıklama:

The first role of Statistics is to reduce this mass of complex data to a simpler form in order to facilitate understanding and learning from the data. This is often done by making graphical representations of the data. Statistics can make a lot of numerical data easily understandable. The world today abounds in textual data. Words, SMSs, tweets, social media posts, verbal responses in questionnaires, these can all be treated as data. Climate data, measurements from tens of thousands of weather stations across the world, is an example of big data.

Doğru Cevap: A

Soru 15

Which of the following variable is an example of interval scale?

Hours of sleep

Inflation rate

Income group

Exam grade

Educational level

Açıklama:

The other main type of data is called continuous, or quantitative, for example data on variables “blood pressure”, “age” and “income”. These are observations of variables on continuous scales, usually rounded in some convenient way. For example, although age is a continuous time variable, and we are getting older all the time by seconds, minutes and hours, someone’s age is almost always rounded to the number of years completed. There is a subtle difference between interval-scale and ratio-scale continuous data, which is worth mentioning here. Other measures of time are interval-scale variables, for example hours of sleep (on Sundays I sleep an hour longer - I would not say I sleep 14% longer).

Doğru Cevap: A

Soru 16

Which of the following are not done by using statistics?

analyse website traffic

decide if medical treatments are effective

analyse the examination grades

check the financial transactions of a sports club

interpret visitors’ data

Açıklama:

check the financial transactions of a sports club

Doğru Cevap: D

Soru 17

Which of the following is not variable?

price

blue

radio stations

color of hair

age of tooth

Açıklama:

Blue is data, not variable.

Doğru Cevap: B

Soru 18

Which of the following is not data?

face photo

100 kg

50 %

Eskişehir

address

Açıklama:

Address is variable, not data.

Doğru Cevap: E

Soru 19

What type of variable is your weight?

qualitative nominal

qualitative ordinal

quantitative ratio

quantitative interval

continuous regular

Açıklama:

quantitative ratio

Doğru Cevap: C

Soru 20

What type of variable is Aristotle’s logic’s truth which gets values true or false?

qualitative ordinal

qualitative nominal

continuous regular

quantitative interval

quantitative ratio

Açıklama:

qualitative ordinal

Doğru Cevap: B

Soru 21

Is your answer to this question data?

Yes

Maybe

Sometime

It depends on time

Açıklama:

Yes

Doğru Cevap: B

Soru 22

What type of variable is a girl’s answer to a marriage offer?

continuous regular

quantitative interval

quantitative ratio

qualitative ordinal

qualitative nominal

Açıklama:

qualitative nominal

Doğru Cevap: E

Soru 23

Which of the following do not constitute big data?

Anadolu University’s website’s visitor traffic

internet browsing traffic in Anadolu University

Statistics course book

phone call traffic in Anadolu University

Anadolu University’s library’s user traffic

Açıklama:

Statistics course book

Doğru Cevap: C

Soru 24

Which of the following is not variable?

silence

altitude

sound

taste

smell

Açıklama:

silence is data, not variable

Doğru Cevap: A

Soru 25

Which of the following is not data?

Anadolu University

Statistics Department

Statistics course

Anadolu University students

Anadolu University campus

Açıklama:

Anadolu University students is variable, not data

Doğru Cevap: D

Soru 26

What cannot be an example showing that statistics is part of our daily lives?

changing climate records

making weather forecast

understanding climate patterns

displaying weather forecasting

keeping wind direction records

Açıklama:

Statistics is at the heart of understanding climate patterns and making weather forecasts. The first role of Statistics is to reduce this mass of complex data to a simpler form in order to facilitate understanding and learning from the data. This is often done by making graphical representations of the data.

Doğru Cevap: A

Soru 27

Which of the statements below CANNOT be true about Statistics?

Statistics can make a lot of numerical data easily understandable.

We often talk of estimates in Statistics.

Most of the observations can be reduced to some numerical quantity through statistical methods.

We cannot get access to every single data on the topic that we are interested in.

Statistics usually samples from a very small data, not from a large population.

Açıklama:

It may seem that everything may be recorded and stored somewhere. But in reality - unless we somehow centralize and link all the databases in the world, and have free access to them - we can get access to only a small part of whatever data we are interested in. For example, it is impossible to ask the whole population of Turkey what their view on climate change is, whether they believe it is natural or manmade. This is where the most basic concept in Statistics comes into play: sampling from a population.

Doğru Cevap: E

Soru 28

What type of variable are your exam grades such as A, B?

interval-scale variable

nominal categorical variable

ordinal categorical variable

ratio-scale variable

continous interval variable

Açıklama:

Categorical variables and data can be either nominal or ordinal.
The question about climate change, with possible responses “natural”, “manmade” or “don’t know/can’t answer” is a nominal categorical variable, as is the variable “country” - there is no ordering in the categories of these variables. By contrast, exam grade is an ordinal categorical variable, since its categories are ordered: A is better than a B, B is better than a C, and so on.

Doğru Cevap: C

Soru 29

"Comparing the prices of a basket of products over time" is an example of .....?

interval-scale variable

ratio-scale variable

nominal categorical variable

ordinal categorical variable

qualitative variable

Açıklama:

There is a subtle difference between interval-scale and ratio-scale continuous data, which is worth mentioning here. Age is an interval-scale variable: to compare two children of ages 10 and 12, we would compute the interval difference, i.e. 2 years. We would not say the 12-year old is 20% older than the 10-year old. But comparing prices or incomes, for example, we would tend to compute percentage differences, making them ratio-scale variables. A good example is the inflation rate, comparing the prices of a basket of products over time, not as a difference but as a percentage. As a general rule, most data on monetary values and those coming from physical measurements (e.g., lira, gold price, centimeters, kilograms) are ratio-scale variables.

Doğru Cevap: B

Soru 30

"Twitter is an American microblogging and social networking service on which users post and interact with messages known as tweets" (https://en.wikipedia.org)
What type of data are tweets you write on Twitter?

textual data

verbal data

ordinal data

scale data

interval data

Açıklama:

Doğru Cevap: A

Soru 31

What is Biostatistics ?

Statistics in Biology research

Statistics in Medical research

Statistics in Biometry research

Statistics in Biochemistry research

Statistics in Social research

Açıklama:

Medical research is a good context to understand these differences - Statistics in medical research is often called Biostatistics.

Doğru Cevap: B

Soru 32

In a research investigating the effects of playing online games on aggressive behavior, 100 teenagers are observed. What statistical term represents these 100 teenagers in this particular experiment?

population

data

sample

variable

scale

Açıklama:

The way the sample is collected is crucial to obtaining a valid estimate, and this is an important subject which will be dealt with in this course. The main criteria for selecting a sample will be that the sample is representative of the population and that there is no or very little subjectivity in the choice of the sampling units. Sampling is not only conducted by survey researchers on human populations, but also by auditors on a company’s accounts, by agricultural researchers on different pieces of land, and by quality control inspectors on products in a factory, to name only a few examples.

Doğru Cevap: C

Soru 33

In a research investigating the effects of playing online games on aggressive behavior, 40 teenagers at the age of 10 plays 4 hours of violence games and 40 teenagers at the age of 10 does not play any video games. What type of research is this research?

observation

statistical

case

experimental

single-case

Açıklama:

In order to be able to prove that aspirin is the cause of the improvement in health, an experiment needs to be conducted where conditions are controlled between those taking aspirin and those not taking it. Such an experiment might be designed as follows, restricted to men, for example, since the effects are suspected to be different for men and women.

Doğru Cevap: D

Soru 34

To prove that aspirin is the cause of the improvement in health, what type of research needs to be conducted?

case

observation

statistical

experimental

report

Açıklama:

Doğru Cevap: D

Soru 35

Which one below is a term that has the same meaning the with the term Statistics?

Data Science

Database Management

Data Visualization

Computer Science

Analytics

Açıklama:

At the start of this Introduction we asked “What is Statistics? What is Data Science? What is Analytics?” To deal with Analytics first, this is a term now used in business circles as a substitute for the word Statistics, but it really means the same thing. The word Statistics is considered by some people, especially businessmen, as a bit old-fashioned, and sometimes even difficult to pronounce! But don’t be fooled: Analytics is a fancy word for Statistics.
When it comes to Data Science, however, the term does have some different meaning. Data Science is a field that includes Statistics as well as areas such as Computer Science, Database Management and Data Visualization, for example, and has come into being mainly as a result of the spectacular growth in the amount of available data in this new information world that we live in.

Doğru Cevap: E

Soru 36

The main criteria for selecting a sample will be that the sample is representative of the population and that there is no or very little subjectivity in the choice of the sampling units.

Sampling is not only conducted by survey researchers on human populations, but also by auditors on a company’s accounts, by agricultural researchers on different pieces of land, and by quality control inspectors on products in a factory, to name only a few examples.

Data come in the form of numbers as well as text.

All observations can be reduced to some numerical quantity.

The way the sample is collected is crucial to obtaining a valid estimate.

Which of the above are correct?

I and II

I, II and III

III, IV and V

I, II, IV and V

I, II, III, IV and V

Açıklama:

Recommended Correction
Page 3
When 4 is added to 5, the result is exactly 9; when hyrdrogen and nitrogen are synthesized,…
When 4 is added to 5, the result is exactly 9; when hydrogen and nitrogen are synthesized,…
Data come in the form of numbers as well as text, which is something we are discovering more and more. We live in an information world and information is the virtual gold of our society. All observations can be reduced to some numerical quantity, and there are even fields of digital philosophy and digital sociology. (Page 6)
…The key to all of the above is the phrase: “a sample of about 1000 carefully selected people”. The way the sample is collected is crucial to obtaining a valid estimate, and this is an important subject which will be dealt with in this course. The main criteria for selecting a sample will be that the sample is representative of the population and that there is no or very little subjectivity in the choice of the sampling units. Sampling is not only conducted by survey researchers on human populations, but also by auditors on a company’s accounts, by agricultural researchers on different pieces of land, and by quality control inspectors on products in a factory, to name only a few examples. (Page 7)
As also understood from the information given, the correct answer isI, II, III, IV and V.

Doğru Cevap: E

Soru 37

Blood pressure

A verbal response

Number of supermarket visits

Purchased products

Course grade

Which of the above are statistical variables?

I and V

I, II and III

II, IV and V

I, II, IV and V

I, II, III, IV and V

Açıklama:

Doğru Cevap: E

Soru 38

The observations made on the variables constitute the data.

The subjects or individuals or companies on which these observations are made are called cases.

Categorical variables and data can be either nominal or ordinal.

Continuous variables and data can be either interval-scale or ratio-scale

Which of the above are correct?

I and II

I and III

I, II and III

II, III and IV

I, II, III and IV

Açıklama:

The easiest form of data is called categorical, or qualitative, for example data on variables “country” (e.g., the data observation might be Germany) or “question response” (e.g., believe that climate change is manmade) or “exam grade” (e.g., B). Categorical variables and data can be either nominal or ordinal. The question about climate change, with possible responses “natural”, “manmade” or “don’t know/can’t answer” is a nominal categorical variable, as is the variable “country” - there is no ordering in the categories of these variables. By contrast, exam grade is an ordinal categorical variable, since its categories are ordered: A is better than a B, B is better than a C, and so on.
Other examples of nominal categorical variables are gender, region of residence, field of study, type of transport, type of housing, etc.
Other examples of ordinal categorical variables are income group (if incomes have been categorized), an attitude question in a survey where possible responses are strongly agree/agree/disagree/strongly disagree (these categories have an order), social class (with classes usually in an inherent order), terrorist threat levels (in the UK these are low/moderate/substantial/severe/critical), etc.
The other main type of data (see Fig. 1.4) is called continuous, or quantitative, for example data on variables “blood pressure”, “age” and “income”. These are observations of variables on continuous scales, usually rounded in some convenient way. For example, although age is a continuous time variable, and we are getting older all the time by seconds, minutes and hours, someone’s age is almost always rounded to the number of years completed. There is a subtle difference between interval-scale and ratio-scale continuous data, which is worth mentioning here. Age is an interval-scale variable: to compare two children of ages 10 and 12, we would compute the interval difference, i.e. 2 years. We would not say the 12-year old is 20% older than the 10-year old. But comparing prices or incomes, for example, we would tend to compute percentage differences, making them ratio-scale variables. A good example is the inflation rate, comparing the prices of a basket of products over time, not as a difference but as a percentage. As a general rule, most data on monetary values and those coming from physical measurements (e.g., lira, gold price, centimeters, kilograms) are ratio-scale variables.
As also understood from the information given, the correct answer is I, II, III and IV.

Doğru Cevap: E

Soru 39

A group of tourists entering Turkey

Which country they come from

How many days they plan to stay

What their tourism objectives are

Which of the above is/are case(s)?

Only I

I and II

I and III

I, II and III

II, III and IV

Açıklama:

Data come in various forms and are measured in different ways. For example, a doctor measures your blood pressure with an instrument, a survey researcher asks you a question and you give a verbal response, you go on holiday to a particular country, you are at a certain age and have a certain income, you have a grade for a completed university course, or this past week you have gone to the supermarket a certain number of times, and bought a certain basket of products, etc. Blood pressure, question response, country, age, income, course grade, number of supermarket visits and purchased products, these are all statistical variables. The observations made on these variables constitute the data. The subjects or individuals or companies on which these observations are made are called cases. Thus, cases might be a group of tourists entering Turkey, for each of whom we have data on which country they come from, how many days they plan to stay in Turkey, what their tourism objectives are (e.g., cultural events, beach holiday, etc.). Or cases might be hospital patients, on whom we have measured the standard medical indicators such as blood pressure, cholesterol levels, blood sugar, and so on.
As also understood from the information given a group of tourist entering Turkey is case, so the correct answer is A group of tourists entering Turkey. “which country they come from”, “how many days they plan to stay” and “what their tourism objectives are” are statistical variables.

Doğru Cevap: A

Soru 40

Region of Residence

Field of Study

Type of Transport

Type of Housing

Exam Grade

Which of the above are nominal categorical variables?

I and II

I, II and III

II, III and IV

I, II, III and IV

II, III, IV and V

Açıklama:

Doğru Cevap: D

Soru 41

Hours of Sleep

The Inflation Rate

Gold Price

Time to Run 100 meters

Which of the above are interval-scale variables?

I and II

III and IV

I, II and V

II, III and IV

III, IV and V

Açıklama:

The other main type of data (see Fig. 1.4) is called continuous, or quantitative, for example data on variables “blood pressure”, “age” and “income”. These are observations of variables on continuous scales, usually rounded in some convenient way. For example, although age is a continuous time variable, and we are getting older all the time by seconds, minutes and hours, someone’s age is almost always rounded to the number of years completed. There is a subtle difference between interval-scale and ratio-scale continuous data, which is worth mentioning here. Age is an interval-scale variable: to compare two children of ages 10 and 12, we would compute the interval difference, i.e. 2 years. We would not say the 12-year old is 20% older than the 10-year old. But comparing prices or incomes, for example, we would tend to compute percentage differences, making them ratio-scale variables. A good example is the inflation rate, comparing the prices of a basket of products over time, not as a difference but as a percentage. As a general rule, most data on monetary values and those coming from physical measurements (e.g., lira, gold price, centimeters, kilograms) are ratio-scale variables. Other measures of time are interval-scale variables (the word “interval” gives you a clue to that!), for example hours of sleep (on Sundays I sleep an hour longer - I would not say I sleep 14% longer) and time to run 100 meters (e.g., at the 2009 World Championships in Berlin, Usain Bolt shaved more than a tenth of a second off his record, clocking 9.58 seconds - we wouldn’t say he reduced the time from 9.69 seconds by 1.1%).
As also understood from the information given “Age”, “Hours of Sleep” and “Time to Run 100 meters” are interval scale variables, so the correct answer is I, II and V. “The Inflation Rate” and “Gold Price” are ratio-scale variables.

Doğru Cevap: C

Soru 42

Frequently occurring words

The lengths of sentences

The number of words used just once

Verbal responses in questionnaires

Social media posts

Which of the above need recoding in order to create quantitative variables?

I and II

II and III

IV and V

I, II and V

III, IV and V

Açıklama:

The world today abounds in textual data. Words, SMSs, tweets, social media posts, verbal responses in questionnaires, these can all be treated as data. Some recoding will be necessary, since text is not numerical. Frequently occurring words can be counted, the lengths of sentences can be measured, the number of words used just once can be identified, and so on, in order to create quantitative variables from text. Textual data have been used, for example, in identifying the author of threatening letters, in comparing political party manifestos, in classifying respondents in a survey who give answers to open-ended questions.
As also understood from the information given, the correct answer is IV and V. “Verbal responses in questionnaires” and “Social media posts” need recoding in order to create quantitative variables. “Frequently occurring words”, “The lengths of sentences” and “The number of words used just once” are the ones which are recoded in order to create quantitative variables.

Doğru Cevap: C

Soru 43

A study which aims to find some evidence of a result

A study which aims to find causes of a result

A study in which the groups are “balanced” in terms of known factors

A study conducted where conditions are controlled

Which of the above can be given as an example/examples of observational studies?

Only I

I and II

I and IV

II and IV

II, III and IV

Açıklama:

A study that involves Statistics will clearly involve a process of data collection. But there are differences in study objectives which are important to recognize. The main distinguishing factor is whether the study aims to find some evidence of a result or whether it aims to find causes of a result. Medical research is a good context to understand these differences - Statistics in medical research is often called Biostatistics.
It is true that many benefits have been detected of taking a low-dosage of aspirin every day. For example, it has been observed that people who take aspirin regularly (less than 300 mg per day) generally have less health problems such as heart attacks, strokes and cancers. The keyword is “observed”: in a community health study involving thousands of people there will be many that take aspirin regularly for problems such as back pain or headaches. For those people it could be observed that they have less chronic diseases compared to people who don’t take aspirin. This is evidence of a difference, but it does not prove that aspirin is the actual cause of the improved health. People who do not take aspirin daily might be poorer and neglect taking medication, and they suffer from more chronic diseases than people with a higher income, so the real difference is socio-economic. From such an observational study it is not possible to conclude a causal effect, it just gives some tentative evidence of a possible relationship between the treatment and the outcome.
In order to be able to prove that aspirin is the cause of the improvement in health, an experiment needs to be conducted where conditions are controlled between those taking aspirin and those not taking it. Such an experiment might be designed as follows, restricted to men, for example, since the effects are suspected to be different for men and women. Suppose we take a large group of men in the age group 60- 70 years of age that have no history of chronic disease. We divide them into two groups so that the groups are “balanced” in terms of known factors such as age, social class, and so on (we don’t want one group to have older men than the other). By the way, this “dividing into two groups” is not a trivial matter, but that is a subject for later in this course. Assuming the two groups are comparable, then one group is given the daily low dose of aspirin and the other not, and the groups are followed up for a five-year period and then compared for the incidence of health problems that develop during this time. If the group taking aspirin has less health problems, this would indicate a beneficial effect caused by the aspirin.
This type of experiment on people has all sorts of problems, but it is really leading to a conclusion about whether aspirin is the real cause of differences between the two groups. There is the ethical problem of not giving the daily aspirin, which has suspected benefits, to a large group of people. There is also the problem that one group knows it is taking the medication and might change their lifestyle to favour a good outcome by living more healthily. This effect can be eliminated by not telling either group what they are getting and giving the non-aspirin group a so-called placebo, which is medication in the form of an aspirin, so both groups think they are getting the medication.
As also understood from the information given, “A study which aims to find some evidence of a result” is an example of observational studies, so the correct answer is only I. “A study which aims to find causes of a result”, “A study in which the groups are “balanced” in terms of known factors” and “A study conducted where conditions are controlled” are the examples of experimental studies.

Doğru Cevap: A

Soru 44

Statistics

Computer Science

Database Management

Data Visualization

Analytics

Which of the above really mean the same thing?

I and II

I and III

II and III

III and IV

I and V

Açıklama:

To deal with Analytics first, this is a term now used in business circles as a substitute for the word Statistics, but it really means the same thing. The word Statistics is considered by some people, especially businessmen, as a bit old-fashioned, and sometimes even difficult to pronounce! But don’t be fooled: Analytics is a fancy word for Statistics. When it comes to Data Science, however, the term does have some different meaning. Data Science is a field that includes Statistics as well as areas such as Computer Science, Database Management and Data Visualization, for example, and has come into being mainly as a result of the spectacular growth in the amount of available data in this new information world that we live in. The need has been recognized for someone who not only has statistical skills, but also advanced programming skills and knowledge about handling huge data sets, the so-called “Big Data” of today. Thus a new profession has been born, that of the Data Scientist. Again, don’t be fooled: the main skill of a data scientist is knowledge of Statistics,
As also understood from the information given, the correct answer is I and V. “Statistics” and “Analytics” really mean the same thing. Statistics, Computer Science, Database Management and Data Visualization are the areas which Data Science includes. They do not mean the same thing.

Doğru Cevap: E

Soru 45

Commerce, especially online electronic commerce

Finance, for example share prices on stock markets, all managed electronically

Insurance, all the premiums, incidents, actuarial transactions in an insurance company

Biomedicine, especially in genetics, where information is literally exploding as gene-sequencing reveals and codes the total genetic profile of a person

Transport, for example in the airline industry, all the flights, all the passengers

Climate data, measurements from tens of thousands of weather stations across the world

In which of the areas above are big data mostly found?

I, II and III

III, IV and V

IV, V and VI

I, II, III, IV and V

I, II, III, IV, V and VI

Açıklama:

What are the “big data” sets today and where do they come from? These are mostly found in the following areas:

Commerce, especially online electronic commerce

Finance, for example share prices on stock markets, all managed electronically

Insurance, all the premiums, incidents, actuarial transactions in an insurance company

Biomedicine, especially in genetics, where information is literally exploding as gene-sequencing reveals and codes the total genetic profile of a person

Transport, for example in the airline industry, all the flights, all the passengers

Climate data, measurements from tens of thousands of weather stations across the World

As also understood from the list given, the correct answer is I, II, III, IV, V and VI.

Doğru Cevap: E

Soru 46

Which of the following is not done by using statistics?

weather forecast

gain estimation

process time prediction

exam grade classification

password validation

Açıklama:

password validation. pg. 3. Correct answer is E.

Doğru Cevap: E

Soru 47

Which of the following are not variables?

exam grade

words in sentence

your Anadolu University student number

radio waves

class

Açıklama:

Your Anatolian University student number is data, not variable. pg. 14. Correct answer is C.

Doğru Cevap: C

Soru 48

Which of the following is not data?

eyeglass

94.3 Mhz TRT 3 FM

Anadolu University Statistics Department

Anadolu University Yunus Emre Campus

Porsuk river

Açıklama:

eyeglass is variable, not data. pg. 14. Correct answer is A.

Doğru Cevap: A

Soru 49

What type of variable is your heart beat?

qualitative ordinal

qualitative nominal

continuous regular

quantitative ratio

quantitative interval

Açıklama:

quantitative interval. pg. 8. Correct answer is E.

Doğru Cevap: E

Soru 50

What type of variable is your mood which gets values good or normal or bad?

continuous regular

qualitative nominal

qualitative ordinal

quantitative interval

quantitative ratio

Açıklama:

qualitative ordinal. pg. 14. Correct answer is C.

Doğru Cevap:

Soru 51

Is your answer to this question variable?

Maybe

Sometime

It depends on time

Yes

Açıklama:

No, it is data. pg. 14. Correct answer is D.

Doğru Cevap: D

Soru 52

What type of variable is your answer to a job offer?

continuous regular

quantitative ratio

quantitative interval

qualitative nominal

qualitative ordinal

Açıklama:

qualitative nominal. pg. 14. Correct answer is D.

Doğru Cevap: D

Soru 53

Which of the following is not big data?

Eskişehir’s food order website’s visitor traffic

internet traffic of Eskişehir Train Station Free Wi-Fi

phone call traffic in Eskişehir

Anadolu University TV's broadcast

Eskişehir Yunus Emre Hospital's user traffic

Açıklama:

Anadolu University TV's broadcast. pg. 11. Correct answer is D.

Doğru Cevap: D

Soru 54

Which of the following is not variable?

depth

darkness

noise

sense

feeling

Açıklama:

darkness is data, not variable. pg. 14. Correct answer is B.

Doğru Cevap: B

Soru 55

Which of the following is not data?

Eskişehir

Eskişehir train station

Eskişehir's pirate ship

Eskişehir's beach's users

Anadolu University airport

Açıklama:

Eskişehir's beach's users is variable, not data. pg. 14. Correct answer is D.

Doğru Cevap: D

Soru 56

Your body temperature changes through exercises.
What type of variable is it?

A textual variable

A continuous interval-scale variable

An ordinal categorical variable

A nominal categorical variable

A continuous ratio-scale variable

Açıklama:

There is no absolute zero in the temperature measurements. So body temperature is a continuous interval-scale variable.

Doğru Cevap: B

Soru 57

Students in a course get a grade, either "AA", "BA", "BB", or "BC", etc. What type of variable is the grade?

A continuous interval-scale variable

A textual variable

An ordinal categorical variable

A continuous ratio-scale variable

A nominal categorical variable

Açıklama:

Grades like "AA" or "BB" taken from a course is an ordinal categorical variable.

Doğru Cevap: C

Soru 58

Which of the following matches is wrong?

Nominal-Country

Ordinal-Olimpic Gold Medalist

Interval-Temperature

Continuous-Age

Interval-Time to run 100 meters

Açıklama:

Age is an interval-scale variable: to compare two children of ages 10 and 12, we would compute the interval difference, i.e. 2 years. We would not say the 12-year old is 20% older than the 10-year old.

Doğru Cevap: D

Soru 59

Which of the following are not data?

Age

Blood sugar

Cholesterol levels

Gender

Hospital patient

Açıklama:

The subjects or individuals or companies on which these observations are made are called cases. Hospital patients on whom we have measured the standard medical indicators such as blood pressure, cholesterol levels, blood sugar, and so on might be cases.

Doğru Cevap: E

Soru 60

A national sport committee decides to impose stricter conditions for athletes to enter the tournament. One year later, athletes obtain scores that are in general better than those the year before.
Which of the following can be considered as valid conclusions?

The increase in scores might be attributable to the introduction of stricter entry conditions.

The imposition of stricter entry conditions caused the increase in scores.

The imposition of stricter entry conditions could not possibly have caused the increase in scores.

There is not enough information, we need to collect more data.

We cannot conclude anything either way.

Açıklama:

A national sport committe decides to impose stricter conditions for athletes to enter the tournament. One year later, athletes obtain scores that are in general better than those the year before. The increase in scores might be attributable to the introduction of stricter entry conditions.

Doğru Cevap: A

Soru 61

Which of the following cannot be considered as a field of big data?

Biomedicine

Transport

Astrology

Climate data

Electronic commerce

Açıklama:

Bigdata are mostly used in following areas:

Commerce (especially online electronic commerce),
Biomedicine (especially in genetics), climate data,
Transport (for example in the airline industry)
Climate data
Insurance
Finance

Doğru Cevap: C

Soru 62

A survey on married couples asked the following questions: "What is your gender? Woman or Man"
Which type of data requested in the question?

A textual variable

An ordinal categorical variable

A continuous ratio-scale variable

A nominal categorical variable

A continuous interval-scale variable

Açıklama:

Gender is a nominal categorical variable.

Doğru Cevap: D

Soru 63

Which of the following do not constitute “big data”?

All the comments written on Facebook page

All comments posted on Youtube video

All men basketball player statistics in one season

All football teams statistics in Turkish Super League

All data that are very large numbers, every value in the billions and trillions

Açıklama:

Big Data are all forms of data we have obtained from different sources and transformed into meaningful and workable forms. All data that are very large numbers, every value in the billions and trillions do not constitute "big data" because cannot be transformed into meaningful and workable forms.

Doğru Cevap: E

Soru 64

A study was conducted to determine the effects of two different training programs on athletes. Sixty athletes were randomly assigned into two training programs each including 30 athletes. One group is given only coffee and water to drink before trainings while the other group is given only water to drink before trainings. After training programs, an assessment test was applied to the athletes. One month later, the group who had drunk coffee and water before the trainings progressed more than the group who only had drunk water before the trainings.
What do you think is the most acceptable conclusion of this experiment?

The sample size is too small to conclude anything at all; if this was done on bigger samples of people, the evidence one way or the other would be more solid.

This result is definitely a result of random variation and no attention should be paid to it.

We cannot conclude anything either way, since this is done just for two groups of athletes.

This result constitutes tentative evidence of the benefits of coffee to athletes.

Drinking coffee and water on pre-trainings appears to be beneficial to athletes. An experiment with more athletes should be undertaken.

Açıklama:

The paragraph is an example of experimental design in sports science. So, drinking coffee and water on pre-trainings appears to be beneficial to athletes. More experiments with more athletes should be undertaken in other conditions and controls.

Doğru Cevap: E

Soru 65

A market researcher's surveying customers
A teacher's qualitative inquiry of motivation
A psychologist's measure of addiction level

Which of the above is/are basic interest of statistics?

Only II

I and II

I and III

II and III

I, II and III

Açıklama:

Statistics mainly work on numerical data. Qualitative inquiry is not an interest of statistics.

Doğru Cevap: C

Soru 66

Any person have free access to data in the world
Data may be in form of text or numbers
All observations can be reduced to some numerical quantity

Which of the above is true for data?

Only I

I and II

I and III

II and III

I, II and III

Açıklama:

Data come in the form of numbers as well as text, which is something we are discovering more and more. We live in an information world and information is the virtual gold of our society. All observations can be reduced to some numerical quantity, and there are even fields of digital philosophy and digital sociology. It may seem that everything may be recorded and stored somewhere. But in reality - unless we somehow centralize and link all the databases in the world, and have free access to them - we can get access to only a small part of whatever data we are interested in.

Doğru Cevap: D

Soru 67

Assume that you are investigating the smart phone preferences of people living in Turkey and you surveyed 2000 people in Denizli.
What can you say about results?

Can give statistical estimates of preferences of Turkish people

Cannot say anything about people live in Turkey

Should gather more data from Denizli

Can present exact proportions of preferences in Turkey

Can present exact proportions of preferences in Denizli

Açıklama:

A reasonable survey would carefully select people as a sample and gathering data from Denizli would not represent Turkey. Statistics working on samples cannot give absolutely correct information. Sample being carefully selected, it can give a statistical estimate of proportions.

Doğru Cevap: B

Soru 68

Your professor is surveying you and asking your gender, hometown and branch. What form of data is s/he dealing with?

nominal

ordinal

interval

ratio

continuous

Açıklama:

Data such as gender, hometown and branch do not represent a computational score. They can only be used to categorize the sample.

Doğru Cevap: A

Soru 69

A teacher labels students successful, average and unsuccessful based on their exam marks. What form of data does s/he produce?

numerical

ordinal

interval

ratio

continuous

Açıklama:

Data labeled as successful, average and unsuccessful does not represent a computational score but does represent an order. So, it is an ordinal data.

Doğru Cevap: B

Soru 70

A scientist is studying on the yearly average temperatures in Celsius in last 100 years. What measurement type is s/he studying on?

nominal

ordinal

interval

ratio

categorical

Açıklama:

Temperature in Celsius does not have an absolute zero. Talking about 20 degrees and 40 degrees, you cannot say its twice warmer but you can say it is 20 degrees more/less. So, the measurement is interval.

Doğru Cevap: C

Soru 71

Your professor asked you to write an essay and stated that you would be assessed for the count of words in your essay. What form of numerical data would s/he study on?

nominal

ordinal

interval

ratio

qualitative

Açıklama:

Because the difference in the counts of words could be stated in percentages, it is a ratio type of measurement.

Doğru Cevap: D

Soru 72

A researcher wants to investigate the effects of tea on heart diseases. What should the researcher do in order to conclude a causal effect?

Ask a sample if they have a heart disease and how much tea they drink a day

Ask the patients with heart problems whether they are drinking tea

Give tea to a sample and observe what they are doing

Conduct an experiment with control and treatment groups

Survey a sample about their beliefs on the effect of tea on heart diseases

Açıklama:

In order to prove whether tea has an effect on heart diseases, an experiment needs to be conducted where conditions are controlled.

Doğru Cevap: D

Soru 73

Widespread social media platforms you visit everyday are gathering information about your profile, friends, tendencies, behaviors and so. What are they working on?

Identity theft

Security

World peace

Climate change

Big data

Açıklama:

When it comes to Data Science, however, the term does have some different meaning. Data Science is a field that includes Statistics as well as areas such as Computer Science, Database Management and Data Visualization, for example, and has come into being mainly as a result of the spectacular growth in the amount of available data in this new information world that we live in. The need has been recognized for someone who not only has statistical skills, but also advanced programming skills and knowledge about handling huge data sets, the so-called “Big Data” of today.

Doğru Cevap: E

Soru 74

Commerce

Insurance

Transport

In which areas above do big data set could be found?

Only I

Only III

I and II

II and III

I, II and III

Açıklama:

What are the “big data” sets today and where do they come from? These are mostly found in the following areas:
• Commerce, especially online electronic commerce
• Finance, for example share prices on stock markets, all managed electronically
• Insurance, all the premiums, incidents, actuarial transactions in an insurance company
• Biomedicine, especially in genetics, where information is literally exploding as gene-sequencing reveals and codes the total genetic profile of a person
• Transport, for example in the airline industry, all the flights, all the passengers
• Climate data, measurements from tens of thousands of weather stations across the world

Doğru Cevap: E

Soru 75

(I) The voting age is 18 in Turkey. There are 56 million people who are able to vote in Turkey.
(II) Before election, a public opinion poll was conducted with 2000 people.
(III) In this poll, 2000 people stated the party of their choice.
Which terms in the statements I, II and III above correspond to the following concepts in the statistics?

I-Categorical data
II- Variable
III- Population

I-Variable
II- Interval scale
III- Sample

I-Ratio scale
II-Continuous variable
III- Sample

I-Population
II- Sample
III- Data

I-Mean
II- Sample
III- Ordinal scale

Açıklama:

These terms are the basic concepts in statistics which are population, sample and data.
A population data set contains all members of a specified group (the entire list of possible data values). In this question, 56 million people show the entire population that can vote in the country.
A sample contains a part, or a subset, of a population. The size of a sample is always less than the size of the population from which it is taken. Here, instead of asking 56 million people in the process of public opinion poll, 2000 people selected from the entire population. We call this groups as sample.
Data are individual pieces of factual information recorded and used for the purpose of analysis. These 2000 people were asked which party they would vote for and their responses were recorded as data.

Doğru Cevap: D

Soru 76

"The number of applications downloaded to a phone".
What is the level of measurement in this problem?

Interval

Ordinal

Nominal

Textual

Ratio

Açıklama:

This is a ratio measure. It is ratio scale because it has a true value of 0 (no downloads at all). If your answer is incorrect, please review “Measurement Scales” section in the first chapter.

Doğru Cevap: E

Soru 77

What is the level of measurement of times of day as "Dawn, Morning, Noon, Afternoon, Evening, Night"

Interval

Nominal

Ordinal

Textual

Ratio

Açıklama:

The ordinal scales are categorical variables and typically measures of non-numeric concepts. This type of variable classifies according to rank.

Doğru Cevap: C

Soru 78

I- IQ levels (intelligence scale)
II- Weights of newborn babies
III-Monthly Income
IV-Types of blood such as A, B, AB and O
V-Level of Agreement: yes, maybe, no
In which of the variables given above, the level of measurement is ratio?

I - II

I-III

I-V

II-III

IV-V

Açıklama:

Only II and III are ratio variables.
I- IQ (intelligence scale): This is an interval variable. It has values of equal intervals that mean something but there is not an absolute zero. In other words, zero IQ does not mean you do not have any cognitive abilities.
II- Weights of newborn babies: This variable is continuous and ratio. It is continuous because weights (kg, g, mg or etc.) can be broken down into very small amounts.Ratio scales have a clear definition of zero.
III-Income earned in a month: It is also ratio because it has an absolute zero and the ratios are meaningful.
IV-Types of blood as A, B, AB and O. This variable is categorical and nominal: The type of blood tells us something meaningful (dominant or recessive genes etc.) but has no meaningful order.
V-Level of Agreement: yes, maybe, no. This is an ordinal variable and the order of categories cannot quantify-how much better it is. For example, is the difference between “Yes” and “Maybe” the same as the difference between “Maybe” and “No” ? We can’t say.

Doğru Cevap: D

Soru 79

What is the level of a measurement for a variable with outcomes defined as "completely agree", "mostly agree", "neutral", "mostly disagree", "completely disagree" when measuring people's opinion about a particular subject?

Interval

Ratio

Ordinal

Textual

Nominal

Açıklama:

This is an ordinal (categorical) scale because ordinal scales provide an information about the order of choices, such as in a people opinion survey.

Doğru Cevap: C

Soru 80

A researcher believes that playing piano is the explanation for increased mathematics academic success. In order to get an understanding on this, the researcher collected data by conducting a survey, asking elementary students his or her level of math ability (grades) and whether they play a piano or not.
Which of the following statements about the research is wrong?

The elementary students' characteristics cannot be manipulated

The research problem is about a causal relationship

This type of study refers to an observational research

The researcher has no control of the variables

The researcher can only describe the phenomena as they exist

Açıklama:

This is an observational study. Observational research is non-experimental because nothing (variables, participants etc.) is manipulated or controlled, and as such we cannot arrive at causal conclusions using this approach. the goal is to obtain a snapshot of specific characteristics of an individual, group, or setting.

Doğru Cevap: B

Soru 81

What type of measurement level is the reason for the people to travel shown in the figure above?

Interval

Nominal

Ordinal

Ratio

Textual

Açıklama:

The pie graph shows a nominal variable.A nominal scale describes a variable with categories that do not have a natural order or ranking. Here, the frequency of the categories of the nominal variable is shown in percentages. Numbers do not indicate that the variable is a numerical variable, it just refers to the frequency of the categories.

Doğru Cevap: B

Soru 82

A researcher wants to know if field trips will improve science academic achievement. For this purpose, it forms two separate groups from 6th grade students. One of the groups regularly goes to field trips (museum visits, science centers, etc.) once a week for 1 year. Students in the other group only take science lessons in the classroom. Science academic achievements of children are measured and compared periodically throughout the year.
Which of the following statements about this research is correct?

The science course's related variables cannot be manipulated

The researcher has no control of the variables

The research problem is about a causal relationship

This type of study refers to an observational research

The researcher can only describe the phenomena as they exist

Açıklama:

It shows an experimental research. Experiment is a type of study designed specifically to answer the question of whether there is a causal relationship between at least two variables. In other words, whether changes in an independent variable (science course design: with field trip or without field trip) cause a change in a dependent variable (science academic achievement). Experiments have two fundamental features. The first is that the researchers manipulate , or systematically vary, or control the level of the independent variable.

Doğru Cevap: C

Soru 83

Which of the following is a newborn concept relative to others in statistics?

Big data

Geometric mean

Observational research

Median

Experimental Data

Açıklama:

Big data refers to the large, diverse sets of information that grow at ever-increasing rates. It encompasses the volume of information, the velocity or speed at which it is created and collected, and the variety or scope of the data points being covered. Big data often comes from multiple sources and arrives in multiple formats. Other concepts are old terms commonly used in statistics.

Doğru Cevap: A

Ünite 2

Soru 1

Statistical analysis requires that the factual information of interest in a research be collected and organized in a useful manner. Which one below refers to such facts?

Element

Data sets

Data

Variable

Case

Açıklama:

Statistical analysis requires that the factual information of interest in a research be collected and organized in a useful manner. Such facts are described as data.

Doğru Cevap: C

Soru 2

Which statement below is correct about the table above?

"Gender" is a quantitative data.

"Stephen" is a data set.

"Ronnie" is a qualitative data.

"Age" is a qualitative data.

"Susan" is an element.

Açıklama:

A data set is a collection of facts aggregated for a specific purpose. Elements are the entities on which the data are collected. In the table above, an element of the data set is a particular worker. For instance, worker "Ronnie" is an element of the data set. Age is a quantitative variable because it takes on numerical measurements. However, gender is a qualitative variable because its outcomes are nonnumeric. Of the four variables in the data set in the table above, two are qualitative (Name and Gender) and two are quantitative (Age and Weekly Wage).

Doğru Cevap: E

Soru 3

Which statements below are correct about the table above?
I Mark's weekly wage being 320 is a data set.
II Being 56 years old male, Mark's earning 320 dollars per week is a case.
III Mark's being male is a qualitative data.
IV Mark's being 56 years old is a qualitative data.

Only I

II and III

III and IV

II, III and IV

All of the above

Açıklama:

The outcomes obtained on all variables for one element in the data set is called a case. Sometimes a case is defined as a record or observation vector. For example, in the table above the outcomes on the four variables for worker Mark constitutes a case.

Doğru Cevap: B

Soru 4

There are four participants in our research and their names are Tom, Jane, Tim and Beth. We assign arbitrarily 1 for Tom, 2 for Jane, 3 for Tim and 4 for Beth. Which word below describes the meaning of number 4 in our research?

case

nominal data

numeral

observation

ordinal scale

Açıklama:

Nominal scales of measurement classify things or individuals into qualitatively different classes. For example, the variable gender has two categories, female and male. Thus, researches could describe sex of the workers using a nominal scale by categorizing people as female and male. Typically, we can use numerals instead of strings to represent individuals’ genders. For example, we can arbitrarily assign number 0 for females and number 1 for males.

Doğru Cevap: C

Soru 5

We ask the students to number the most important language skill for them in their academic classes as 1 and the least important one as 2. Which option below best describes this measurement?

Ordinal scale

Nominal scale

Interval Scale

Ratio Scale

None of the above

Açıklama:

Ordinal scales of measurement have the property of both classifying and magnitude. Subjects are categorized into different rank ordered groups. Each value on the ordinal scale has a unique meaning, and it has an ordered relationship to every other value on the scale. Suppose we want to measure customers’ preferences for five brands of chocolates, brands A, B, C, D, and E. We could ask each customer to rank order the five brands by assigning number 1 to the most preferred brand, number 2 to the next most preferred brand, and so on.

Doğru Cevap: A

Soru 6

In this type of measurement scale, there is a natural or zero-valued base value that cannot be changed. What is the name of this scale?

Nominal Scale

Ordinal scale

Cardinal Scale

Interval Scale

Ratio Scale

Açıklama:

Ratio scales of measurement, in addition to having all properties of the interval scale, have a natural or zero-valued base value that cannot be changed. For example, an individual’s age, weight, height, systolic blood pressure are ratio scale variables because they have natural base value. For example, John and Mary are 20 and 40 years old, respectively. We can say that Mary is two times older than John.

Doğru Cevap: E

Soru 7

In order to investigate the impact of playing online games on developing English speaking skills, we make a group of students play online games for four hours a day and prevent another group of students playing any English online games. What type of research we are conducting?

Case

Sampling

Observational

Experimental

Interval

Açıklama:

Experimental study is a study in which the researcher manipulates some of the variables and try to determine how the manipulation influences other variables. In an experimental study, one or more independent variables are controlled so as to obtain information about their influence on the dependent variable. However, researchers cannot control all the variables having effects on the dependent variable. In this case, randomization techniques are applied to balance out the influence of any uncontrolled variable that might affect the variable of interest. Suppose we want to investigate the effects of exercise on cold by using an experimental design. For this purpose, we obtain a group of individuals who are the volunteers to participate the study. Then, we randomly assign the participants to the treatment (exercise) and control (no exercise) groups. After a lapse of time, we record the number of colds for each individual from the two experimental groups.

Doğru Cevap: D

Soru 8

Which type of response does the question above require?

Open-ended response

Multiple response

Ranked response

Rated response

Clarity response

Açıklama:

Strongly Agree - Agree - Undecided / Neutral - Disagree - Strongly Disagree
Always - Often - Sometimes - Seldom - Never
Extremely - Very - Moderately - Slightly - Not at all
Excellent - Above Average - Average - Below Average - Very Poor

Doğru Cevap: D

Soru 9

Which one below is one of the limitations of interview method?

The individuals included in the study might alter their behavior.

Direct contact with the responders avoids misunderstanding of the questions.

People will tend to give answers to the question when they are approached personally.

The data collection by interview usually includes irrelevant information from those people who are conducted.

The researcher may select an irrelevant individual about the study, this leads to bias into the results.

Açıklama:

The advantages of the data collection by interview are:
1. People will tend to give answers to the question when they are approached in person or by telephone, so the data collection by interview usually includes usable information from those people who are conducted.
2. Direct contact with the responders avoids misunderstanding of the questions.
On the other hand, the limitations of the interviewing method are:

If the questioner does not obey the rules for selecting individuals or may select an irrelevant individual about the study, this leads to bias into the results.
The questioner may affect the individuals’ opinion about a question and this leads to get incorrect answers.
The questioner may make recording errors.

Doğru Cevap: E

Soru 10

A university student who successfully completed the course filled out the assessment questionnaire about the lecturer. What type of research method is mentioned here?

Observation

Interview

Self-enumeration

Open-ended

Frequency

Açıklama:

In a self-enumeration method, individuals answer the questions printed on a questionnaire paper, or displayed on a computer monitor. In other words, self-enumeration method refers to the completion of survey questionnaires by the respondents themselves. Some of the examples are as follows:

A recent customer checked out from a five-star hotel received a self-enumeration satisfaction questionnaire through the e-mail that request information about the hotel activities.
A university student who successfully completed the course filled out the assessment questionnaire about the lecturer.

Doğru Cevap: C

Soru 11

There is a data set of clinic’s patients above. Which of the following statements about this table is false?

The weight in the table is a qualitative variable

In the table, an element of the data set is a particular patient, for example Ahmet

Age is a variable and takes on different values for different patients

90 kg is the observation on the variable weight for patient Gökhan

In the table, the outcomes on the four variables for patient Elif constitutes a case

Açıklama:

A data set is a collection of facts aggregated for a specific purpose. Elements are the entities on which the data are collected. In the table, an element of the data set is a particular patient. A variable is a characteristic of interest about an element. This characteristic takes on different values for different elements. The Gender and the name in the table are a qualitative variable because their outcomes are nonnumeric. Of the four variables in the data set in table, two are qualitative (Name and Gender) and two are quantitative (Age and Weight). The outcomes obtained on all variables for one element in the data set is called a case.

Doğru Cevap: A

Soru 12

Which scales of measurement have a natural or zero-valued base value that cannot be changed?

Ratio scale

Interval scale

Ordinal scale

Nominal scale

Qualitative scale

Açıklama:

Doğru Cevap: A

Soru 13

Which scales of measurement have the properties of classifying, magnitude, and equal intervals?

Ratio scale

Interval scale

Ordinal scale

Nominal scale

Quantitative scale

Açıklama:

Interval scales of measurement have the properties of classifying, magnitude, and equal intervals. While the ordinal scales of measurement show that individuals have more or less something than the others, interval scales have more precise information indicating how much of something individuals have.

Doğru Cevap: B

Soru 14

Which of the following is an internal data source for the firm?

Reference books

Newspapers

Sectoral magazines

Statistics published by governments

Firm’s accounting records

Açıklama:

We can obtain some data from an internal data source, such as an organization’s operating and accounting records. These routine data are usually saved in computer data files or databases for efficient entry, storage, and retrieval of information. Internal data is obtained from inside the company for successful operations. The information obtained from internal data source is important to determine the company strategies. We usually obtain data from external data sources. External data sources may be a reference book or statistical periodical published by a government agency, a trade association, or a private service company.

Doğru Cevap: E

Soru 15

Experimental study is a study in which the researcher manipulates some of the variables and try to determine how the manipulation influences other variables.
In an observational study, researchers simply collect data based on what is seen and heard and infer based on the data collected.
In an observational study, one or more independent variables are controlled so as to obtain information about their influence on the dependent variable.

Which of the above statements are true?

I, II

I, III

II, III

I, II, III

Açıklama:

Experimental study is a study in which the researcher manipulates some of the variables and try to determine how the manipulation influences other variables. In an observational study, researchers simply collect data based on what is seen and heard and infer based on the data collected.

Doğru Cevap: B

Soru 16

A researcher recorded the observed daily closing prices of several publicly traded common stocks for a financial study.
A sales manager of a company conducted a research about purchases of a specific product of the company.
A university student who successfully completed the course filled out the assessment questionnaire about the lecturer.

What are the correct definitions of the data collection methods given above?

I-Observation method, II-interview method, III-self-enumeration method

I-interview method, II-observation method, III-enumeration method

I-self-enumeration method, II-interview method, III-observation method

I-observation method, II-self-enumeration method, III - interview method

I-interview method, II-enumeration method, III- observation method

Açıklama:

Observation is making direct examination and taking measurements of an ongoing activity. In other words, observation is way of obtaining data by watching behavior, events, or noting physical characteristics in their natural setting. One of the most common methods of collecting data from individuals is interviewing. In an interview procedure, a researcher or observer asks the questions from a questionnaire and records the individual’s answers. In a self-enumeration method, individuals answer the questions printed on a questionnaire paper, or displayed on a computer monitor. In other words, self-enumeration method refers to the completion of survey questionnaires by the respondents themselves.

Doğru Cevap: A

Soru 17

The above table contains the daily sales data of a market. We want to construct grouped frequency distribution table for the data. What is the class width of the data?

100

110

120

Açıklama:

The first step in constructing the grouped frequency distribution table is to determine the number of classes.

Doğru Cevap: A

Soru 18

The above table contains the daily sales data of a market. We want to construct relative frequency distribution table for the data. What is the first class’ relative frequency?

0,32

0,47

0,53

0,60

0,64

Açıklama:

Doğru Cevap: A

Soru 19

The frequency distribution table of the students’ performance scores of a school were constructed as follows. What is the ratio of the students whose score under 80?

0,88

0,84

0,80

0,73

0,70

Açıklama:

The ratio of the students whose score under 80 is 0,88

Doğru Cevap: A

Soru 20

We ask each customer to rank order the three brands by assigning number 1 to the most preferred brand. Customers assigns Number 2 to the next most preferred brand and so on. Which scale is used in this study?

Interval scale

Ratio scale

Ordinal scale

Nominal scale

Quantitative scale

Açıklama:

Doğru Cevap: C

Soru 21

Element
Variable
Concept
Case

Which of the above are the key components of a data set?

I and II

II and III

I, II and III

I, II and IV

I, III and IV

Açıklama:

Several characteristics define a data set’s structure and properties. Element, Variable, Case and Observation are the key components of data sets.

Doğru Cevap: D

Soru 22

Which of the followings could be a measure of a ratio scale?

students' ranking in a class

different classes of same level

degree of attitude towards science class

gender of students in a class

weights of students in a class

Açıklama:

Doğru Cevap: E

Soru 23

Which of the following is the measurement of a magnitude without equal intervals?

Nominal scale

interval scale

ordinal scale

ratio scale

qualitative scale

Açıklama:

Doğru Cevap: C

Soru 24

data is collected based on what is seen or heard

researcher do not intervene to the subjects

variables might be manipulated by the researcher

Which of the above is/are the characteristics of observational studies?

Only I

Only III

I and II

I and III

II and III

Açıklama:

In an observational study, researchers simply collect data based on what is seen and heard and infer based on the data collected. Researchers observe subjects and measure variables of interest without any intervention to the subjects. Experimental study is a study in which the researcher manipulates some of the variables and try to determine how the manipulation influences other variables.

Doğru Cevap: C

Soru 25

the individuals included in the study might be aware of this
observer must record the events correctly
data could be obtained over an extended period of time

Which of the above is/are advantage(s) of observation method?

Only I

Only III

I and II

I and III

II and III

Açıklama:

Data collection by observation procedure has some advantages and limitations. The advantages are:
1. The direct recording of the data avoids problems such as incomplete or distorted recall.
2. Data can be obtained continuously over an extended period of time.
The limitations are:
1. The observer or the instrument to be used for data gathering must be able to record the events correctly. For example, human observers must get through training about the study and the data to be collected and so that different observers will record the same events in the same manner.
2. The individuals included in the study might be aware of this fact and then altered their behavior, decision or answers. This leads to bias in the study.

Doğru Cevap: B

Soru 26

What is the underlying reason of using closed-ended questions rather than open-ended question in many surveys?

Gathering more detailed data

Obtaining higher response rates

Being easier to read

gathering more accurate data

Making responder think deeper

Açıklama:

In many surveys, closed-ended questions are preferred because close-ended questions lead to obtain higher response rates when responders don’t have to type so much.

Doğru Cevap: B

Soru 27

Which of the following is generally used to measure the attitudes of individuals towards a subject?

Open-ended questions

Multiple responses

Ranked responses

Rated responses

Ordered responses

Açıklama:

Likert type of scale is generally used to measure the attitudes of an individual towards a specific subject.

Doğru Cevap: D

Soru 28

Data set is large
Measurements type is ratio scale
Interpretation should be easier

In which condition(s) above, it is more appropriate to use grouped frequency for summarizing data?

Only I

Only II

Only III

I and II

I and III

Açıklama:

When the data set is large or the measurements are obtained using ratio scale, grouped frequency is more appropriate for summarizing the data.

Doğru Cevap: D

Soru 29

Which of the following is true about relative frequency distribution table?

It could be easier or clearer to interpret the table when using percentage of the frequency

It provides information of how many observations occurred for each value

It is more appropriate to use when data set is too large

It is better to use if measurement is obtained using ratio scale

It is used to determine the number of elements that falls above or below a particular value

Açıklama:

Interpretation of the frequency distribution table can be easier or clearer when we use the percentage of the frequency.

Doğru Cevap: A

Soru 30

Which of the following is true about cumulative frequency distribution table?

It could be easier or clearer to interpret the table when using percentage of the frequency

It consists of classes and the number of elements in these classes

It is more appropriate to use when data set is too large

It is better to use if measurement is obtained using ratio scale

It can be used to determine the number of elements that falls above or below a particular value

Açıklama:

A frequency distribution table provides information of how many observation or elements occurred for each value or group of values of a variable. Cumulative frequency is used to determine the number of elements that falls above or below a particular value in a given class interval.

Doğru Cevap: E

Soru 31

Which of the following is not an component of data sets?

Element

Variable

Case

Observation

Analysis

Açıklama:

Analysis is not a component of data sets. Analysis can be done over datasets.

Doğru Cevap: E

Soru 32

Which of the following is a nominal scale type data?

Age

Height

Weight

Gender

Income

Açıklama:

Nominal scale classifies things into qualitative different classes. Gender is qualitative.

Doğru Cevap: D

Soru 33

Which of the following scales have a natural or zero-valued base that cannot be changed?

Ratio scale

Interval scale

Nominal scale

Ordinal scale

Cardinal scale

Açıklama:

That type is ratio scale. An example of this type scales is age. The age of a person can be zero at minimum. So if one person is 60 years old, he/she is 3 times older than a person who is 20 years old.

Doğru Cevap: A

Soru 34

A researcher goes to a bus station and takes record of the number of people getting into a bus in that station everyday. Which type of data collection method is the researcher using?

Interview

Observation

Self-Enumeration

Questionnaire

Sampling

Açıklama:

The researcher is making an observation.

Doğru Cevap: B

Soru 35

"Which type of transportation vehicle do you use most often?
Answer:.............................................................."
Which type of questionnaire question is the one above?

Multiple Response

Single Response

Open-Ended

Closed-Ended

Ranked

Açıklama:

It's a open-ended question type because the respondent can freely answer whatever he thinks of. He doesn't choose among or rank the given options.

Doğru Cevap: C

Soru 36

How can we determine whether respondents are interpreting questions as intended and whether the order of questions may influence responses?

By conducting a pretest over a small sample of survey population.

By carefully reviewing the survey questionnaire.

By analyzing the results of the survey.

By discussing the questionnaire questions with an experienced statistician.

We can never determine this.

Açıklama:

A pretest over a small sample can help us in determining whether questions are clearly understood and whether the order of questions cause a difference in results.

Doğru Cevap: A

Soru 37

An advertisement company conducts a survey on parfume preferences of adults in a city in which the half of the population is female. The researcher collects a sample of 200 people of whom 170 are male.
What type of error does the researcher make?

Error in population spesification

Error in measurement

Error in sampling

Error in modelling

Error in analyzing

Açıklama:

Since almost half of the population is female, he is making a mistake in sampling. The parfume preferences of males and females could be different and his sample is male biased.

Doğru Cevap: C

Soru 38

A researcher collects data about the weight of pupils in a school. There are 500 students in that school whose weight differs from 20kgs to 40kgs.
If this researcher wants to constitute a grouped frequency distribution table, what is the class width for this case?

0.04 kg

0.5kg

0.89 kg

1 kg

2 kg

Açıklama:

Class Width=Range/Number of classes
where Number of Classes=√n, where n=number of observations
Thus Number of Classes=√500=22.36
But when the number of classes is larger than 20 we take it as 20. So in this case:
Class Width=(40-20)/20=20/20=1kg

Doğru Cevap: D

Soru 39

The number of workers in a certain factory is given below. What is the cumulative frequency of of workers whose age is less than 50?

Age Range	Frequency
20-24	50
25-29	40
30-34	35
35-39	20
40-44	25
45-49	20
50-54	10

0.65

0.675

0.775

0.8

0.95

Açıklama:

There are 200 workers in factory and in total 190 them are below 50 years old. Thus the cumulative frequency is 190/200=0.95 (95 %)

Doğru Cevap: E

Soru 40

Which of the following terms stands for errors caused by unknown and unpredictable factors?

Systematic error

Random error

Measurement error

Specification error

Sampling error

Açıklama:

The definition corresponds to random error. Random errors are caused by unknown and unpredictable factors that randomly affect measurement of the variable across the sample.

Doğru Cevap: B

Soru 41

According to the data set given in the following table, what constitutes the case for John?

Male

52, 4300

Female, 52

Male, 52, 4300

Male, 45, 10000

Açıklama:

The outcomes obtained on all variables for one element in the data set is called a case. In this table, the outcomes on the four variables for John constitutes a case. name,gender, age, salary= John, Male, 52, 4300

Doğru Cevap: D

Soru 42

Which is TRUE about the table below specifying types of books a particular customer prefers?

It has the properties of classifying, magnitude, and equal intervals.

It has a natural or zero-valued base value that cannot be changed.

It is important that there is no any particular order or ranking for classes.

The consecutive categories do not represent equal differences of the measured attribute.

It is considered a nominal scale of measurement.

Açıklama:

Ordinal scales of measurement have the property of both classifying and magnitude. Subjects are categorized into different rank ordered groups. Each value on the ordinal scale has a unique meaning, and it has an ordered relationship to every other value on the scale. From the table, we can conclude that the customer prefers historical fiction to science fiction, science fiction to detective fiction, detective fiction to romance. However, even though the differences in the consecutive numbers of the ranks are equal, we cannot say that how much the customer prefers one type of book over another type. That is, consecutive categories do not represent equal differences of the measured attribute.

Doğru Cevap: D

Soru 43

An English teacher wants to look into the effect of a certain teaching strategy on learning. Which is NOT TRUE about her study?

The researcher will conduct an observational study.

The researcher will manipulate some of the variables.

The researcher will apply randomization techniques.

The researcher will record the effectiveness of the new strategy.

The researcher will have experimental and control groups.

Açıklama:

In observational studies, researchers observe subjects and measure variables of interest without any intervention to the subjects. In this case, the teacher wants to learn if the new strategy affects the learning. The researcher needs to manipulate some of the variables and try to determine how the manipulation influences other variables.

Doğru Cevap: A

Soru 44

An employee is given a survey where he is given three options like "Agree", "Undecided", and "Disagree". What type of responses does he need to give?

Close-ended responses

Ranked responses

Rated responses

Multiple responses

Single responses

Açıklama:

Rated responses generally include three-point, five-point, and seven-point scales. A rating scale should provide more than two options. The mostly used rating scale is five-point Likert (1932) type scale. Likert type scale can be designed in the following forms.
• Strongly Agree - Agree - Undecided / Neutral - Disagree - Strongly Disagree
• Always - Often - Sometimes - Seldom - Never
• Extremely - Very - Moderately - Slightly - Not at all
• Excellent - Above Average - Average - Below Average - Very Poor

Doğru Cevap: C

Soru 45

Which one is TRUE about error in sampling?

It occurs when the researcher determines an inappropriate population from which to collect data.

It can be described as any discrepancy between the actual result obtained and
the correct result that would be provided by an ideal procedure.

It is caused by unknown and unpredictable factors that randomly affect measurement
of the variable across the sample.

It arises from problematic, poor calibrated or incorrectly used equipment.

It arises from not representing the targeted population and the results yield biased or inaccurate information.

Açıklama:

Sample in statistics means a small part of the targeted population. A sample must be representative of the population. Sampling methods must be used to achieve a representative sampling. Otherwise, the sampling does not represent the targeted population and the results yield biased or inaccurate information.
Error in population specification occurs when the researcher determines an inappropriate population from which to collect data.
Error in measurement can be described as any discrepancy between the actual result obtained and the correct result that would be provided by an ideal procedure. From a statistical point of view any observation is composed of the true value plus some random error value. However, all error is not random. The error component of any observation can be divided into two subcomponents, random error and systematic error.
Random errors are caused by unknown and unpredictable factors that randomly affect measurement of the variable across the sample. For example, a school teacher conducted a particular survey on the students to measure their performances. Some students may be feeling in a good mood and others may be depressed. This may artificially deflate performance scores of the depressed students. Random error does not have any consistent effects across the entire sample. Instead, it affects observed scores up or down randomly. Random error adds variability to the data and it is sometimes called noise.
Systematic errors are reproducible inaccuracies that shift measurements from their true value by the same amount and consistently in the same direction. This type error arises from problematic, poor calibrated or incorrectly used equipment. For example, an industrial scale showed heavier weights than it should be for a particular product because it was not calibrated properly and thereby provided incorrect measurements.

Doğru Cevap: E

Soru 46

Customers asked to respond to the following statement. “The picture quality of your TV is satisfactory”. Customers responded to the statement as 1=Strongly Disagree, 2=Disagree, 3=Undecided, 4=Agree, and 5=Strongly Agree.
If the measurement categories and the number of responses within a given measurement category are used, what method of data organization is implemented in the case specified above?

Raw data

Grouped Frequency Distribution Table

Frequency Distribution Table

Cumulative Frequency Distribution Table

Relative Frequency Distribution Table

Açıklama:

Sometimes raw data in a frequency distribution table yield more useful information. To construct a frequency distribution table, the measurement categories and the number of responses within a given measurement category are used.

Doğru Cevap: C

Soru 47

There are 85 people interviewed on their weekly salary. If the frequency for Class 2 is 21 what is the percentage of relative frequency?

0,247

24,7

21,3

2,13

0,85

Açıklama:

Interpretation of the frequency distribution table can be easier or clearer when we use the percentage of the frequency. Percentage representation of frequency can also be displayed in the frequency distribution table. Percentage of frequency is called the relative frequency and the table is called relative frequency distribution table. Relative frequency can be used for both quantitative and qualitative variables. The relative frequency for a class is calculated as follows: Relative Frequency = fi/ n where, fi is the frequency for class i and n is the sample size. In this case, fi is 21and n is 85. The percentage of relative frequency is 24,7.

Doğru Cevap: B

Soru 48

In a salary data example, the frequency table shows that there are 20 workers of whose weekly salaries are between 415 and 525 dollars. Which of the below should we use if we need to know, for example, how many workers earn under 525 dollars?

Class Interval

Frequency

Cumulative Relative Frequency

Cumulative Frequency

The Percentage of Cumulative Relative Frequency

Açıklama:

We can obtain such information by using cumulative frequency. Cumulative frequency is used to determine the number of elements that falls above or below a particular value in a given class interval. The cumulative frequency of a class is calculated by adding its frequency to the sum of all predecessor class frequencies. Consequently, the last value must be equal to the sample size.

Doğru Cevap: D

Soru 49

Which is NOT TRUE about the table below?

There are three categorical variables.

It is a contingency table of the variables gender and excel knowledge.

We may conclude that number of males who have technical knowledge is greater than those in female.

It is used to determine if one categorical variable is related to another categorical variable.

It excludes relative frequencies or percentages.

Açıklama:

In this table, there are two categorical variables. One is the variable gender which has two categories, female and male. The other categorical variable is the variable excel knowledge which has two categories, yes and no. Thus, there are two categorical variables.
Ifa data set includes two different categorical variables, we use a two-way table (contingency table) todemonstrate the relationship and interaction of the two categorical variables. A two-way table of counts organizes data about two categorical variables measured from the same set of individuals. A contingency table is a special type of frequency distribution table, where two variables are shown simultaneously and
it is used to determine if one categorical variable is related to another categorical variable.

Doğru Cevap: A

Soru 50

According to the contingency table below, what are the percentage of male and female respectively within the people who do not have technical knowledge (No)?

75%, 25%

35%, 65%

50%, 50%

40%, 60%

20%, 80%

Açıklama:

When we create the contingency table with row and column percentages, within the people who do not have technical knowledge (No) the percentage of male and female are 50% and 50%, respectively.

Doğru Cevap: C

Soru 51

What are the gray, orange and green highlighted places in the dataset table above, called respectively?

Case-Element-Variable

Observation-Case-Data

Data-Variable-Element

Variable-Observation-Case

Element-Case-Variable

Açıklama:

The statements given in the first row of the table show the variables in the data set. Variable is a characteristic, number, or quantity that increases or decreases over time, or takes different values in different situations (e.g. income, age, weight, etc., and “occupation”, “industry”, “disease”, etc.).
The outcome about a single variable for an element in the data set is called an observation.
The outcomes obtained on all variables for one element in the data set is called a case. Sometimes a case is defined as a record or observation vector.
Variable-Observation-Case

Doğru Cevap: D

Soru 52

Self-reported data for high school students is presented in the table above. In which of the following correctly refers to the measurement levels of the variables "career plans and age"?

Career Plans: Nominal scale
Age: Ratio scale

Career Plans: Interval scale
Age: Interval scale

Career Plans: Ordinal scale
Age: Interval scale

Career Plans: Ordinal scale
Age: Ratio scale

Career Plans: Nominal scale
Age: Ordinal scale

Açıklama:

Nominal variables are used to “name,” or label a series of values.
Ordinal scales provide good information about the order of choices, such as in a customer satisfaction survey.
Interval scales give us the order of values and the ability to quantify the difference between each one.
Ratio scales give us the ultimate-order, interval values, plus the ability to calculate ratios since a “true zero” can be defined.
Career Plans: Nominal scale
Age: Ratio scale

Doğru Cevap: A

Soru 53

Self-reported data for high school students is presented in the table above. How many interval scales are there in the data set above?

None

Açıklama:

Only grade point average (GPA) refers to a variable on interval scale. Gender and career plans are nominal variables, grade level and language proficiency level are on ordinal scale, and age is on ratio scale.
1

Doğru Cevap: B

Soru 54

Self-reported data for high school students is presented in the table above.
How many ordinal variables are in the dataset table above?

None

Açıklama:

Grade level and language proficiency level are on ordinal scale. Grade point average (GPA) refers to a variable on interval scale, gender and career plans are nominal variables, and age is ratio type variable.
2

Doğru Cevap: C

Soru 55

What are dependent (DV) and independent (IV) variables in a study to determine whether how long a student sleeps, studies, and solves the number of questions affects exam scores?

DV: Exam score and the length of time spent sleeping
IV: Number of questions solved

DV:The length of time spent studying
IV: Exam score, number of questions solved

DV: Exam score
IV: Number of questions solved, the length of time spent sleeping, and studying

DV:The length of time spent studying and number of questions solved
IV: Exam score

DV:Number of questions solved, the length of time spent sleeping
IV: The length of time spent studying and exam score

Açıklama:

Independent variables are controlled inputs. Dependent variables represent the output or outcome resulting altering these inputs. In other words, you can consider the independent variable as the cause and the dependent variable as the effect.
The independent variable s are the length of time spent sleeping, studying and the number of questions solved while the dependent variable is the exam score.

Doğru Cevap: C

Soru 56

Researchers collected data from 100 men aged 40 of whom 50 have been smoking a pack of cigarettes a day for 5 years while the other 50 have been smoke free for 5 years. They measured their lung capacity for each of the 100 men, analyzed, and drawn conclusions from the collected data. According to the study described above, what type of study is it?

True-experimental study

Quasi-experimental study

Longitudinal study

Observational study

Correlational study

Açıklama:

This study refers to an observational study in which researchers observe subjects (participants) and measure variables of interest without assigning treatments to the subjects. The treatment that each subject receives is determined beyond the control of the investigator.

Doğru Cevap: D

Soru 57

Researchers wanted to evaluate the effectiveness of eKampüs Anadolum system at the end of the semester with a questionnaire. In this questionnaire, students were asked to indicate how often they used the system by marking on the following scale:
(1) none
(2) 1-2 times in the semester
(3) 1-2 times per month
(4) 1-2 times per week
(5) Every day
According to the description given above, what type of question is this?

Open-ended

Ranked responses

Multiple responses

Rated Responses

Single response

Açıklama:

Doğru Cevap: D

Soru 58

Suppose that researchers collected a random sample of 2500 people from the general Turkish adult population to gauge their entertainment preferences. Then, upon analysis, found it to be composed of 75% males.
What type of error was made if this sample would not be representative of the general adult population and would influence the data?

Measurement error

Nonresponse error

Population specification

Analysis error

Sampling error

Açıklama:

Sampling error is affected by the homogeneity of the population being studied and sampled from and by the size of the sample. In the Turkish population, the female to male ratios are almost equal. In order to avoid this error you can increase the size of your sample so you get more survey participants.

Doğru Cevap: E

Soru 59

The frequency distribution of the students’ weights was constructed as the following table. Consider the grouped data table, what is the ratio of the students whose weights are above 65?

0,30

0,37

0,43

0,49

0,52

Açıklama:

0,43

Doğru Cevap: C

Soru 60

The contingency table constructed between the smoking status and age of the participants is as follows.
Within the non-smokers, what is the percentage of participants under 30?

20%

30%

40%

50%

60%

Açıklama:

Non-smokers under 30= 20 people
Non-smokers 30 & over= 30 people
Total non-mokers: 20+30=50
Within smokers, (20x100)/50= 40% age under 30 years.
(30x100)/50= 60% age <30 years.
40% under 30 years old.
60% of them 30 & over.

Doğru Cevap: C

Ünite 3

Soru 1

The graphic above shows the frequency distribution of five continents being visited by 15 people.
What kind of a graphic is used to show the frequency distribution in the figure above?

Pie chart

Line chart

Scatter plot

Bar Chart

Dot plot

Açıklama:

The pie chart of the continent data is shown in the figure above. As it can be seen from the figure, each slice of pie chart corresponds to a continent, showing a percentage of people who travelled to this continent.The figure 3.13a illustrates each slice coloured differently in a two dimensional space. The correct option is A.

Doğru Cevap: A

Soru 2

Suppose that 100 students were asked what type of transportation they use to travel home. Using the student responses given in the table below, construct the pie chart of this data. Which of the response do you think take the smallest portion?

By car

By bike

By bus

By tram

On foot

Açıklama:

The number of students who travel home by car is the smallest so travelling by car will have the smallest portion on the pie chart. The correct answer is A.

Doğru Cevap: A

Soru 3

"It is almost the easiest of the graphs. It can be drawn by hand easily while collecting data. It will be very useful when the number of objects in our study is rather small such as up to 50 observations. It is generally used to investigate univariate (quantitative) data, but sometimes it is used to compare two variables. Essentially it is a one-dimensional scatterplot of observed values of a variable." Which type of graphic is described in the paragraph above?

Pie chart

Line chart

Dot plot

Histogram

Bar chart

Açıklama:

Dot plot is almost the easiest of the graphs. It can be drawn by hand easily while collecting data. Dot plot will be very useful when the number of objects in our study is rather small such as up to 50 observations. Dot plot is generally used to investigate univariate (quantitative) data, but sometimes it is used to compare two variables. Essentially a dot plot is a one-dimensional scatterplot of observed values of a variable. The correct answer is C.

Doğru Cevap: C

Soru 4

Which pie chart is the correct one to analyze the data presented in the table below?

Açıklama:

According to the table, Amanda gets the largest and Liam gets the smallest portion. Therefore, the correct answer is D.

Doğru Cevap: D

Soru 5

In the graphic above, the number of activities each student did is presented. What kind of a graphic is used to present this data?

Line chart

Grouped bar chart

Histogram

Stacked bar chart

Steam and leaf display

Açıklama:

The information about several subgroups of each category can be shown by a grouped bar chart. It can be plotted in horizontal or vertical directions similar to simple bar chart. In grouped bar chart, for each main category there are different sub-categories. In this chart the main categories are the names of the students and the subcategories are the activity types. The correct answer is B.

Doğru Cevap: B

Soru 6

In the graphic above, the number of activities that five students did during a semester is present in a grouped bar chart. According to this data, which student attended the most concerts?

Jack

Mary

Liam

Amanda

George

Açıklama:

The number of concerts that the students attended is shown by the color green. Amanda has the highest green bar. The correct answer is D.

Doğru Cevap: D

Soru 7

Which of the following is FALSE about histogram ?

It is very similar to a bar chart.

It shows continuous data.

It is drawn for qualitative data.

It can tell about the peaks and extreme values.

It helps us to identifty the symmetry of the data.

Açıklama:

Histogram is a graph that is very similar to a bar chart except that bar charts are drawn for qualitative data but histograms are drawn for continuous data. The correct answer is C.

Doğru Cevap: C

Soru 8

"It is often used to display the trends in a continuous data over a period of time. It also works well with discrete (ordered) or categorical types of data. It is constructed by intersecting the points by lines on the x-axis. Some of them are used to draw in two or three dimensions. Additionally, some of them are very helpful to show the relationships for multivariate data."
What kind of a graphic is described in the paragraph above?

Bar chart

Pie chart

Histogram

Dot plot

Line chart

Açıklama:

Line chart is often used to display the trends in a continuous data over a period of time. Line chart also works well with discrete (ordered) or categorical types of data. The chart is constructed by intersecting the points by lines on the x-axis. Some line charts are used to draw in two or three dimensions. Additionally, some of the line charts are very helpful to show the relationships for multivariate data.The correct answer is E.

Doğru Cevap: E

Soru 9

What type of a graphic is used to investigate the relationship between two variables?

Dot plot

Pie chart

Histogram

Scatter plot

Line chart

Açıklama:

Scatter plot is used to investigate the relationship between two variables. They are also very helpful indicating the minimum, maximum or outliers of the variables.The correct answer is D.

Doğru Cevap: D

Soru 10

I. the shape of the distribution of the data
II.the relation between two sets of data
III.the most repeating observations
Which of the things above a properly created graph can report in a visual form?

I and II

II and III

I, II and III

Açıklama:

A properly created graph can report various information of the data in a visual form. For instance, the shape of the distribution of the data, the relation between two sets of data, the most repeating observations, outliers, peaks, summary statistics (minimum, maximum, range, mean, median) etc. can be identified from graphics. The correct answer is E.

Doğru Cevap: E

Soru 11

Which of the following is not true about the dot plot?

A dot plot is a one-dimensional scatterplot of observed values of a variable.

In order to create a dot plot one needs to identify the lowest and the highest value of the data set.

If there are repeating observations (multiple occurrences), the dots are stacked up vertically.

Dot plots tend to be useful to determine a vague point for location of center.

Dot plot will be very useful when working with large number of observations.

Açıklama:

Dot plot will be very useful when the number of objects in our study is rather small such as up to 50 observations. It is especially easy to identify the distribution of a set of data from a dot plot for small and moderate sample sizes. A dot plot is generally not useful for large sizes of data as it may not be possible to display all of the individual values with large datasets.

Doğru Cevap: E

Soru 12

Which of the following is not true about stem and leaf display?

A stem-and-leaf display was invented by Tukey (1977) as a method of displaying data.

A stem-and-leaf display is a type of graph for listing the numerical data.

A steam and leaf display doesn't give information about the grouped frequency distribution of the data.

A stem and leaf display is useful for assessing the location and spread of the distribution of the data.

A stem and leaf display can be useful to figure out the range, outliers, the most frequent values and the shape of the data.

Açıklama:

One of the important advantages of the stem-and-leaf display is that it gives the researcher a chance to create a grouped frequency distribution of the data without using any formula.

Doğru Cevap: C

Soru 13

Which of the following is not correct about bar charts?

The information about several subgroups of each category can be also shown by a grouped bar chart.

A bar chart is the graphical representation of frequencies by rectangles (or bars) with lengths (or heights) proportional to the frequencies of observations.

A simple bar chart is used to represent continuous values for each category.

A stacked bar chart is a bar chart where each bar is divided into subgroups proportional to the contribution a subgroup makes to an associated bar.

Bar graphs are typically used to compare counts, frequencies, the number of categories, objectives, amounts.

Açıklama:

Simple bar chart is used to represent discrete values for each category for a given variable on x-axis (horizontal).

Doğru Cevap: C

Soru 14

Which of the following is not true about histogram?

A histogram is drawn for continuous data.

In order to draw the histogram of the data, a large sample is needed.

A histogram gives information about the centre, shape and symmetry of the data.

A histogram can also be used to check out the normality.

A histogram doesn't give information about the peaks and extreme values.

Açıklama:

Histograms will help us to identify the centre, shape and symmetry of the data. A histogram can tell us about the peaks and extreme values..

Doğru Cevap: E

Soru 15

Which of the following is not true about the pie chart?

A pie chart is usually used for categorical
data.

In the pie chart components or outcomes of a total frequency is shown as sectors of a circle.

In the pie chart, the categories are divided into slices/sectors.

A pie chart usually shows the actual values.

The drawing of a pie chart involves the calculation of angles for each slice/sector.

Açıklama:

A pie chart usually does not show the actual values, therefore, it may easily become a misleading chart.

Doğru Cevap: D

Soru 16

Which of the following can be used to display the trends in continuous data over a period of time?

Pie chart

Line chart

Frequency polygon

Bar chart

Stem and leaf display

Açıklama:

Line chart is often used to display the trends in a continuous data over a period of time. Line chart also works well with discrete (ordered) or categorical types of data.

Doğru Cevap: B

Soru 17

Which of the following is used to investigate the relationship between two variables?

Scatter plot

Lina chart

Bar chart

Histogram

Pie chart

Açıklama:

A scatter plot is used to investigate the relationship between two variables. They are also very helpful indicating the minimum, maximum or outliers of the variables. One of the reasons that the scatter plots may be drawn is that the scatter plot gives a good indication of the correlation between two variables.

Doğru Cevap: A

Soru 18

Which of the following is more useful to discover the overall shape of the data?

Pie chart

Histogram

Dot plot

Line chart

Frequency polygon

Açıklama:

The frequency polygons are useful to discover the overall shape of the data (Is it symmetric or is there any asymmetry?). In order to create the frequency polygon, we use the midpoints of the bins (classes) in histogram vs the frequency of each bin. The midpoints are marked by a dot within each class interval. A straight line is used to connect the dots and so that lines are connected to each other.

Doğru Cevap: E

Soru 19

In which of the following components and outcomes of a total frequency is shown as sectors of a circle?

Line chart

Pie chart

Bar chart

Histogram

Scattered plot

Açıklama:

A pie chart is usually used for categorical data. In pie chart components or outcomes of a total frequency is shown as sectors of a circle. The shape resembles a pie, hence the name of the chart.

Doğru Cevap: B

Soru 20

Which of the following is not true about the scatter plot?

Scatter plot is used to investigate the change of a variable over a time.

To construct a scatter plot, two data sets or variables are needed, usually, these two data sets or variables are named as X and Y.

The pair of the data point for a specific observation, (X, Y), is represented by a dot or a symbol of convenience.

Scatter plot gives a good indication of the correlation between two variables.

Scatter plot is a good indicator of the value of the correlation coefficient.

Açıklama:

Scatter plot is used to investigate the relationship between two variables. They are also very helpful indicating the minimum, maximum or outliers of the variables.

Doğru Cevap: A

Soru 21

What type of data presentation method is described in the sentences below?

It is useful to determine a vague point for location of center and spread of data.

It is not useful for large sizes of data.

To use this type of plot, one needs to identify the lowest and the highest value of the data set first.

Dot Plot

Bar Chart

Scatter Plot

Histogram

Pie Chart

Açıklama:

Doğru Cevap: A

Soru 22

The figure above is a dot plot showing Mathematics grades in class A by gender. Which of the statements is not correct depending on the information above?

The highest grade of males is 70.

The highest grade of females is 90.

There are more female students than the male students in class A.

There are groupings around 60 and 70 in male students.

There are groupings around 60 and 75 in female students.

Açıklama:

There are 3 female students who got 70 and 3 who got 75, which is the highest number of females around these grades.

Doğru Cevap: E

Soru 23

Which one below gives us the difference between Dot plot and Stem-and-Leaf Display?

A stem-and-leaf display is a type of graph for listing the numerical data.

In stem-and-leaf display the original numbers are kept.

Stem-and-leaf display are drawn for continuous data.

Stem-and-leaf display is usually used for categorical
data.

Stem-and-leaf is often used to display the trends in a continuous data over a period of time

Açıklama:

A stem-and-leaf display is a type of graph for listing the numerical data and very similar to dot plot. If you remember in dot plot, dots are used to represent the each observation in our data. In stem-and-leaf display the original numbers are kept and a visual representation of data is created.
A pie chart is usually used for categorical data. Line chart is often used to display the trends in a continuous data over a period of time.

Doğru Cevap: B

Soru 24

“It gives the researcher a chance to create a grouped frequency distribution of the data without using any formula”
Which of the data presentation visual is mentioned in this sentence?

Dot Plot

Histogram

Pie Chart

Stem-And-Leaf Display

Scatter Plot

Açıklama:

One advantage of the stem-and-leaf display is that it gives the researcher a chance to create a grouped frequency distribution of the data without using any formula.

Doğru Cevap: D

Soru 25

Which one below is a type of bar chart where each bar is divided into subgroups proportional to the contribution a subgroup makes to associated bar?

Grouped Bar Chart

Stacked Bar Chart

Simple Bar Chart

Horizontal Bar chart

Vertical Bar Chart

Açıklama:

The stacked bar chart is a bar chart where each bar is divided into subgroups proportional to the contribution a subgroup makes to associated bar.

Doğru Cevap: B

Soru 26

Which one below is NOT correct about histograms?

Histograms will help us to identify the center, shape and symmetry of the data.

A histogram can tell us about the peaks and extreme values.

A histogram can be used to check out the normality.

You can think histograms as bar plots of grouped frequency distributions.

Histograms are drawn for qualitative data but not for continuous data.

Açıklama:

Histogram is a graph that is very similar to a bar chart except that bar charts are drawn for qualitative data but histograms are drawn for continuous data. Histograms will help us to identify the center, shape and symmetry of the data. A histogram can tell us about the peaks and extreme values, whether the distribution of data is skewed to the left, skewed to the right, bell-shaped, uniform or bimodal. A histogram can also be used to check out the normality.

Doğru Cevap: E

Soru 27

An investor needs to decide on that the power of Turkish Lira to make an investment in Turkey and wants to analyze the tendency of Turkish Lira versus a foreign currency. Which type of display is best for the investor?

A simple line chart

Pie chart

Histogram

Stem-And-Leaf Display

Scatter Plot

Açıklama:

In economics, the tendency of a foreign currency versus Turkish Lira may also be analyzed by a simple line chart, which may show a long term increase in the value of Turkish Lira against the foreign currency of interest. Therefore, an investor may decide that the power of Turkish Lira is increasing and it is high time to make an investment in Turkey.

Doğru Cevap: A

Soru 28

Which type of plot is used to display the relationship between two sets of variables or to make comparisons between two sets of data points?

Line chart

Pie chart

Histogram

Stem-And-Leaf Display

Scatter Plot

Açıklama:

Scatter plot is used to investigate the relationship between two variables. They are also very helpful indicating the minimum, maximum or outliers of the variables. One of the reasons that the scatter plots may be drawn is that scatter plot gives a good indication about the correlation between two variables.

Doğru Cevap: E

Soru 29

The chart below shows the relation between humidity and temperature for a certain period of time. What type of relationship is there between temperature and humidity on the days the data recorded according to this chart?

Both the humidity and temperature are increasing.

Both the humidity and temperature are decreasing.

While the temperature is decreasing and humidity is increasing.

There is a positive correlation between humidity and temperature.

There is no correlation between humidity and temperature.

Açıklama:

In this figure, the scattered data points indicate a negative correlation between two sets of data, here the values of y-axis are decreasing but as it decrease the values of the y-axis variable increases.

Doğru Cevap: C

Soru 30

The chart below shows the relationship between humidity and temperature for 25 days. Which option is correct depending on the information on the chart?

The higher the temperature, the higher the humidity.

The days with low humidity levels never have temperatures above 5 C degrees.

The highest humidity levels are at temperatures above 10 C degrees.

Total humidity levels for 25 days is the highest at temperatures between 0 C degrees and 2,5 C degrees.

The days with high humidity tend to have temperatures at 2 C degrees, 5 C degrees and 8 C degrees.

Açıklama:

This pattern indicates that the days with high humidity tend to have temperatures at 2 C degrees, 5 C degrees and 8 C degrees.

Doğru Cevap: E

Soru 31

The relation between two sets of data
The most repeating observations
Summary statistics (min, max, mean, median etc.)

Which of the above could be identified using graphics?

Only III

I and II

I and III

II and III

I, II and III

Açıklama:

Doğru Cevap: E

Soru 32

Can be drawn by hand easily while collecting data
Useful when observation count is less than 50
Generally used to investigate univariate data

Which of the following data display types do these features identify?

Dot plot

Stem-and-Leaf

Bar chart

Histogram

Pie chart

Açıklama:

Doğru Cevap: A

Soru 33

Which of the following is not true for dot plots?

Before creating a dot plot, lowest and highest values should be identified first

It is a simple chart that keeps each observation as a dot along horizontal axis

Repeating occurrences are represented as dots along horizontal axis

The data itself is kept within the graph so, its value could easily be identified by looking at the graph

It helps the researcher to quickly order the data

Açıklama:

In order to create a dot plot one needs to identify the lowest and the highest value of the data set first, then a horizontal axis is drawn and scaled so that it covers the lowest and highest values. A dot plot, essentially, is a simple chart where each observation is presented by a dot along the horizontal axis. If there are repeating observations (multiple occurrences), the dots are stacked up vertically. The dot plots will produce a simple graph of data but at the same time the data itself is never lost, you can easily identify the value of any data point in the dot plot. This is the most powerful aspect of the dot plots. It allows the researcher to show the data in a pictorial form without losing the original information/data. It also gives an opportunity to the researcher to quickly order the data.

Doğru Cevap: C

Soru 34

Divides values as greatest digits and remaining digits
Useful to visualize the range, outliers and most frequent values
Useful to assess the spread of the distribution of the data

Which of the following display types does these features belong to?

Dot plot

Stem-and-leaf

Bar chart

Pie chart

Histogram

Açıklama:

In stem-and-leaf display the original numbers are kept and a visual representation of data is created. Basically, stem-and-leaf display divides the values into a stem and leaf using a vertical line. The “stem” represents the greatest digits on the left of the line where the right of this line displays the “leaf ” with the remaining digits. This graph can be useful to figure out the range, outliers, the most frequent values and the shape of the data. It is also useful for assessing the location and spread of the distribution of the data.

Doğru Cevap: B

Soru 35

Which one of the following stem-and-leaf plot represents the data displayed in the Table above?

Açıklama:

If you want to create a stem-and-leaf display of this data, you need to decide what the stem should be, it is easy to see that all these numbers are the multiples of ten, so the numbers should be 14, 15, 16, and 17. Stems and the leaves should be ordered on each stem from smallest to largest. A vertical line/axis is drawn and on the left hand side of the line/axis stem values are shown as a new row, next we start putting each observation to the right hand side of vertical line according to trailing digits. At last step in each row, the numbers on the right hand side is ordered. Thus the stem-and-leaf plot should be

Doğru Cevap: A

Soru 36

Choose the appropriate bar chart for the given frequency table?

Açıklama:

Simple bar chart is used to represent discrete values for each category for a given variable on x-axis (horizontal). The y-axis (vertical) shows the actual numbers that are the bar heights for the corresponding category. Thus, the graph should be;

Doğru Cevap: B

Soru 37

Choose the appropriate pie chart for the bar graph above?

Açıklama:

DeA pie chart is usually used for categorical data. In pie chart components or outcomes of a total frequency is shown as sectors of a circle. The shape resembles to a pie, hence the name of the chart. In pie chart, the categories are divided in to slices/sectors. Each slices’ size is proportional to the total number of objects. The drawing of a pie chart involves the calculation of angles for each slice/sector. Find the calculation table and the chart below.

Doğru Cevap: E

Soru 38

Typically used to compare counts, frequencies, categories etc.
Represents the frequencies with rectangles by their lengths
Depending on the variable type, it is possible to create many types of it

Which of the following do these features represent?

Stem-and-leaf

Dot plot

Histogram

Bar chart

pie chart

Açıklama:

Bar chart or sometimes called bar graphs are typically used to compare counts, frequencies, total number of categories, objectives, amounts etc. It is used for the graphical representation of the qualitative data. Bar chart is the graphical representation of frequencies by rectangles (or bars) with lengths (or heights) proportional to the frequencies of observations. Depending on the variable type and grouping, it is possible to create many types of bar chart

Doğru Cevap: D

Soru 39

Which of the following is used to represent discrete values for each category for a given variable on horizontal axis while vertical axis show the actual numbers of each category?

Stem-and-leaf

Simple bar chart

Stacked bar chart

Grouped bar chart

Pie chart

Açıklama:

Doğru Cevap: B

Soru 40

Which of the following is used for Likert type items to visualize the contribution of each subgroup of a category?

Histogram

Stem-and-leaf

Stacked bar chart

Simple bar chart

Line chart

Açıklama:

The stacked bar chart is a bar chart where each bar is divided into subgroups proportional to the contribution a subgroup makes to associated bar. Likert type items are often represented by stacked bar chart.

Doğru Cevap: C

Soru 41

Used to represent continuous data
Usually used when the sample is large
Columns are adjacent to each other

Which of the following visualization types do these features represent?

Stem-and-leaf

Bar chart

Histogram

Pie chart

Line chart

Açıklama:

Histogram is a graph that is very similar to a bar chart except that bar charts are drawn for qualitative data but histograms are drawn for continuous data. In order to draw the histogram of the data, we usually need to have a large sample. If you remember from previous chapters, the data was classified in to grouped frequency distributions, basically you can think histograms as bar plots of grouped frequency distributions. If you create a grouped frequency distribution of the data, you can easily create the histogram of the same data. Similarly, by looking at a histogram one may easily create the grouped frequency distribution of the data. In bar charts the columns/bars are separated from each other by a convenient distance whereas in histogram the columns/bars are adjacent to each other.

Doğru Cevap: C

Soru 42

Usually used to represent categorical data

Divides categories as sectors

Each sector's size shows the proportion of each category to the total

Which of the following do these features belong to?

Simple bar chart

Stacked bar chart

Grouped bar chart

Pie chart

Line chart

Açıklama:

A pie chart is usually used for categorical data. In pie chart components or outcomes of a total frequency is shown as sectors of a circle. The shape resembles to a pie, hence the name of the chart. In pie chart, the categories are divided in to slices/sectors. Each slices’ size is proportional to the total number of objects.

Doğru Cevap: D

Soru 43

Which of the following is preferably used to visualize a trend of continuous data over time?

Histogram

Stem-and-leaf

Bar chart

Pie chart

Line chart

Açıklama:

Line chart is often used to display the trends in a continuous data over a period of time.

Doğru Cevap: E

Soru 44

Consider the pie chart constructed based on the data presented above. What is the angle of the studying?

120 degrees

90 degrees

60 degrees

45 degrees

30 degrees

Açıklama:

The total time for the activities is 24 hours. 360 degrees represent 24 hours. So, one hour represented with 360/24 degree=15 degree. The angle of studying equals to 15x4=60 degrees.

Doğru Cevap: C

Soru 45

The graph represents the data of the number of vehicles registered in traffic by year. Purple line represents the 2017 data and maroon line represents the 2018 data. According to this graph, which of the following is the month when most vehicles are registered to traffic in 2018?

Açıklama:

According to graphic, in 2018, during the second month of the year near 60 thousand vehicles, the fifth and seventh month of the year 100 thousand vehicles, and the tenth month of the year near 40 thousand vehicles are registered. However, during the first month near 120 thousand vehicles are registered. Thus, first month of the year is the month that the most vehicles are registered.

Doğru Cevap: A

Soru 46

2nd

10th

4th

2nd and 4th

1st and 2nd

Açıklama:

According to this graph, 2nd and 4th months are the months that equal number of vehicles are registered both in 2017 and 2018. During 2nd month nearly 60 thousand vehicles and during 4th month nearly 100 thousand vehicles were registered.

Doğru Cevap: D

Soru 47

According to the stem-and-leaf plot above, what is the range of the data?

402

909

Açıklama:

According to the plot, the maximum value of the data is 97 and the minimum value is 48. Range is the difference between the minimum and the maximum. Thus the rang is 97-48=49.

Doğru Cevap: B

Soru 48

Retrieved from https://ourworldindata.org/quality-of-education
The following plot represents the relation between the PISA reading scores and the United Nations' Human Development Index (HDI) for a select group of countries.
According to the scatterplot above and considering straight line that indicates relationship, which one of the following is true?

The straight line tends to have positive slope

The straight line tends to have negative slope

No relationship can be claimed between two sets of data

Straight line to indicate the negative correlation between two sets of data.

The is not any outlier data for statistical analysis.

Açıklama:

Scatter plot can also be used with a straight line to indicate the correlation between two sets of data. A regression line is added to scatter plot will show a very good indication about the direction of the relationship between two variables. The values of both variables are increasing, hence there is a positive relationship between these two variables. Thus, the straight line tends to have positive slope.

Doğru Cevap: A

Soru 49

Include the relative frequency of each bin (class) to your grouped frequency distribution
Determine the sample size
Draw a rectangular for each bin
Create the grouped frequency distribution of the data

To create a histogram for continuous data, sort the steps written above?

II, I, IV, III

IV, II, III, I

II, I, IV, III

I, II, III, IV

II, IV, I, III

Açıklama:

To create a histogram for continuous data, the following steps may be used:
1. Determine the sample size,
2. Create the grouped frequency distribution of the data
3. Include the relative frequency of each bin (class) to your grouped frequency distribution
4. Draw a rectangular for each bin.

Doğru Cevap: E

Soru 50

The graph shows the electricity, water and ADSL bills of a family. Which one of the following can be deduced, based on the data?

The highest bill is for March.

The highest bill is for January.

There is a negative relationship between electricity consumption and water consumption.

ADSL bill is rising from January to March.

Lowest bill is for ADSL.

Açıklama:

The graph indicates that higher consumption of electricity tend to have lower water consumption through months.

Doğru Cevap: C

Ünite 4

Soru 1

Mode is the.......value in a data set.
Which of the following correctly fills in the blank in the sentence above?

Minimum

Maximum

Average

Most frequent

Least frequent

Açıklama:

Mode is the most frequent value in a data set.

Doğru Cevap: D

Soru 2

What happens to arithmetic average of a data set if all observations are increased by 4?

It doesn't change.

It increases by 4.

It decreases by 4.

It increases by 2.

It decreases by 2.

Açıklama:

Suppose there are n observations; x₁, x₂, x₃, x₄.......x_n, and the arithmetic mean is k. Thus the sum of observations is n*k=(x₁+x₂+x₃+x₄.......+x_n). If we increase all observations by 4, the new values will be 4+x₁, 4+x₂, 4+x₃, 4+x₄.......4+x_n. Thus the new sum will be (4+4+...+4)+(x₁+x₂+x₃+x₄.......+x_n)=4n+nk=n(4+k). If we divide this sum by n then the new arithmetic mean will be 4+k.

Doğru Cevap: B

Soru 3

Which of the following is true for a left-skewed distribution as shown in figure?

Mean=Mode=Median

Mean=Median

Mean

Mean>Median>Mode

Mean

Açıklama:

For left-skewed distributions Mean

Doğru Cevap: C

Soru 4

The ages of children in a park are given as 10, 4, 8,10, 5, 6, 4, 5, 8,9. What is the median age of this group of children?

Açıklama:

First we have to reorder the ages in an increasing or decreasing order. Let's do it in an increasing order. Then the ages will be:
4,4,5,5,6,8,8,9,10,10
Since there are 10 observations the median will be the average of the 5th and 6th observations. Thus,
median=(6+8)/2=7

Doğru Cevap: E

Soru 5

The data on weight distribution of a certain group of men is given on the table above. What is the mean weight for this group?

Açıklama:

The mean can be calculated by multiplying the weight with their frequencies and summing them up.
Thus average (mean) weight=60*(30/100)+70*(40/100)+80*(20/100)+90*(10/100)=18+28+16+9=71

Doğru Cevap: A

Soru 6

The exam scores of 5 students taking the statistics course are given above. The weight of midterm exam is 40% and the weight of final is 60%. What will be the weighted average score of this group?

Açıklama:

The weighted average score is calculated by multiplying the weight of an exam and the score taken in that exam. Thus the scores of students are calculated as follows:
Ahmet: (40*0.4)+(80*0.6)=64
Mehmet: (80*0.4)+(60*0.6)=68
Ali: (40*0.4)+(90*0.6)=74
Asya: (70*0.4)+(90*0.6)=82
Suzan: (40*0.4)+(60*0.6)=52
Then the average score of the group is equal to (64+68+74+82+52)/5=68

Doğru Cevap: D

Soru 7

What is the geometric mean of the data set A=(6, 8, 9, 16, 36)?

Açıklama:

Geometric mean is found by the formula GM=(x₁x₂x₃....x_n)^1/n . So in this case
GM=(6*8*9*16*36)^1/5=12

Doğru Cevap: B

Soru 8

Bicycle types and their prices in a certain shop are given above. What is the midrange of the bicycle prices?

275

400

550

700

900

Açıklama:

Midrange is the average of the maximum and minimum value of observations. Thus for this bicycle prices the midrange is (900+200)/2=550

Doğru Cevap: C

Soru 9

A researcher wants to calculate the average income for a large group of people but he notices that there are a few extremely low and extremely high income levels. Thus he decides to find the average after eliminating the extreme values. In this case the researcher computes ........
Which of the following correctly fills in the blank in the sentence above?

Trimmed mean

Geometric mean

Arithmetic mean

Harmonic mean

Midrange

Açıklama:

The trimmed mean is an arithmetic mean of a data set without extreme values.

Doğru Cevap: A

Soru 10

Which of the following statements is true?

Mean of a left-skewed distribution is larger than the mode of it.

Median of a left-skewed distribution is larger than the mode of it.

Mean of a right-skewed distribution is smaller than the mode of it.

Mode and median of a normal distribution are larger than its' mean.

Mean, mode and median of a normal distribution are equal.

Açıklama:

Only the statement in E is true. For left-skewed distributions meanmedian>mode. But for normal distributions all these measures are same.

Doğru Cevap: E

Soru 11

What is the mode of the following data set?
10, 19, 11, 26, 26, 26, 18, 20, 35, 99, 11, 14, 18, 18, 26, 20, 48

Açıklama:

26 : occurs most often, it is repeated 4 times

Doğru Cevap: B

Soru 12

What is the median of the following data set?
4, 4, 4, 5, 7, 8, 10, 14, 18, 20, 26, 30

Açıklama:

9 : median = (12 + 1) / 2 = 6.5 th observation ; then average of 6th and 7th observations : (8 + 10) / 2 = 9

Doğru Cevap: B

Soru 13

What is the midrange of the following data set?
10, 19, 11, 26, 26, 26, 18, 20, 35, 98, 11, 14, 18, 18, 26, 20, 48

Açıklama:

Midrange = (x_min + x_max) / 2 = (10 + 98) / 2 = 54

Doğru Cevap: D

Soru 14

What is the arithmetic mean of the following data set?
4, 4, 6, 7, 8, 10, 11, 12, 18, 32

9.5

11.2

Açıklama:

arithmetic mean = sum of numbers / no of numbers = (4 + .. + 32) / 10 = 112 / 10 = 11.2

Doğru Cevap: D

Soru 15

What is the arithmetic mean of the following data set with weights in paranthesis?
20 (10 %), 40 (20 %), 60 (30 %), 80 (40 %)

Açıklama:

(20 x 10 % + 40 x 20 % + 60 x 30 % x + 80 x 40 %) / 4 = (2 + 8 + 18 + 32) / 4 = 60 / 4 = 15

Doğru Cevap: A

Soru 16

What is the geometric mean of the following data set?
8, 27, 64

Açıklama:

(8 x 27 x 64)^-3 = 2 x 3 x 4 = 24

Doğru Cevap: B

Soru 17

What is the arithmetic mean of the following data set with frequency in paranthesis? 25 (3), 35 (3), 45 (2)

33.75

37.5

82.5

Açıklama:

(25 x 3 + 35 x 3 + 45 x 2) / 8 = 270 / 8 = 33.75 . Correct answer is A.

Doğru Cevap: A

Soru 18

What is the mode of the following data set with frequency in paranthesis?
15 (2), 30 (4), 35 (3), 40 (3), 95 (2)

110

Açıklama:

30 : occurs most often, it is repeated 4 times

Doğru Cevap: A

Soru 19

What is the midrange of the following data set with frequency in paranthesis?
14 (2), 34 (4), 38 (3), 44 (3), 96 (2)

Açıklama:

Midrange = (x_min + x_max) / 2 = (14 + 96) / 2 = 55

Doğru Cevap: B

Soru 20

What is the median of the following data set with frequency in paranthesis?
12 (2), 32 (4), 35 (3), 43 (3), 92 (2)

Açıklama:

35 : median = (14 + 1) / 2 = 7.5 th observation ; then average of 7th and 8th observations : (35 + 35) / 2 = 35

Doğru Cevap: D

Soru 21

For an observation that results in values 6,7,3,4,3,5,6,6 what is the mode of this group?

Açıklama:

In order to find the mode of this raw data, first, let’s order the data from smallest to largest as follows 3,3,4,5,6,6,6,7
The number 3 is repeated 2 times, 4 is repeated 1 time, 5 is repeated 1 time, 6 is repeated 3 time, 7 is repeated 1 time. Therefore, for this data set, it can be said that the mode is 6. The answer is B.

Doğru Cevap: B

Soru 22

For an observation that results in values 4,2,6,7,3,4,3,5,6,6,3 what is the mode of this group?

3 and 4

3 and 6

Açıklama:

In order to find the mode of this raw data, first, let’s order the data from smallest to largest as follows 2,3,3,3,4,4,5,6,6,6,7
The number 2 is repeated 1 time, 3 is repeated 3 times, 4 is repeated 2 time, 5 is repeated 1 time, 6 is repeated 3 time, 7 is repeated 1 time. Therefore, for this data set, it can be said that there are two modes and these modes are 3 and 6. The answer is E.

Doğru Cevap: E

Soru 23

For an observation that results in values 6,7,3,4,3,5,6,6 what is the median of this group?

5,5

6,5

Açıklama:

In order to find the median of this raw data, first, let’s order the data from smallest to largest as follows 3,3,4,5,6,6,6,7
The location of the median in this ordered data is (8+1)/2 = 4,5th observation. Therefore, we
need to identify the 4th and 5th observations’ values in the ordered data, these are 5 and 6 respectively.
The Median is equal to (5+6)/2 = 5,5
The answer is B.

Doğru Cevap: B

Soru 24

For an observation that results in values 6,7,3,4,3,5,6,6 what is the arithmetic mean of this group?

4,5

5,5

Açıklama:

There are 8 observation in this data set. The sum of all the observation result's values is 6+7+3+4+3+5+6+6 = 40
If we divide this value by the number of observation, which is 8, we will find 40/8 = 5 which is the arithmetic mean of this data set. The answer is C.

Doğru Cevap: C

Soru 25

What is the mode of the following data set?
14, 17, 22, 2, 25, 4, 14, 6, 35, 93, 11, 14, 25, 18, 18, 25, 20, 14, 14, 48, 18, 25, 48

Açıklama:

14 : occurs most often, it is repeated 5 times. pg. 77. Correct answer is A.

Doğru Cevap: A

Soru 26

What is the median of the following data set?
2, 4, 2, 6, 7, 9, 11, 17, 18, 14, 26, 30, 2, 2

Açıklama:

14 : median = (14 + 1) / 2 = 7.5 th observation ; then average of 7th and 8th observations : (11 + 17) / 2 = 14. pg. 81. Correct answer is C.

Doğru Cevap: C

Soru 27

What is the midrange of the following data set?
11, 18, 10, 25, 25, 81, 26, 18, 20, 35, 14, 11, 18, 18, 25, 20, 48

Açıklama:

Midrange = (x_min + x_max) / 2 = (11 + 81) / 102 = 46. pg. 96. Correct answer is D.

Doğru Cevap: D

Soru 28

What is the arithmetic mean of the following data set?
2, 4, 4, 6, 7, 8, 10, 11, 14, 22, 28, 4

Açıklama:

arithmetic mean = sum of numbers / no of numbers = (2 + .. + 28) / 12 = 120 / 12 = 10 . pg. 85. Correct answer is C.

Doğru Cevap: C

Soru 29

What is the arithmetic mean of the following data set with weights in paranthesis?
60 (10 %), 50 (15 %), 70 (25 %), 90 (30 %), 80 (40 %)

Açıklama:

(60 x 10 % + 50 x 15 % + 70 x 25 % + 80 x 30 % x + 60 x 40 %) / 5 = (6 + 7.5 + 17.5 + 27 + 32) / 5 = 90 / 5 = 18. pg. 85. Correct answer is E.

Doğru Cevap: E

Soru 30

What is the geometric mean of the following data set?
64, 125, 216

105

120

128

144

Açıklama:

(64 x 125 x 216)^1/3 = 4 x 5 x 6 = 120. pg. 92. Correct answer is C.

Doğru Cevap: C

Soru 31

What is the arithmetic mean of the following data set with frequency in paranthesis?
20 (3), 32 (4), 47 (2), 10 (3)

32.5

40.5

Açıklama:

(20 x 3 + 32 x 4 + 47 x 2 + 10 x 3) / 12 = 312 / 12 = 26. pg. 87. Correct answer is B.

Doğru Cevap: B

Soru 32

What is the mode of the following data set with frequency in paranthesis?
11 (3), 24 (2), 27 (3), 33 (2), 39 (4), 42 (3), 97 (2)

108

Açıklama:

39 : occurs most often, it is repeated 4 times. pg. 81. Correct answer is C.

Doğru Cevap: C

Soru 33

What is the midrange of the following data set with frequency in paranthesis?
15 (2), 25 (4), 38 (3), 44 (3), 64 (3), 105 (2)

105

Açıklama:

Midrange = (x_min + x_max) / 2 = (15 + 105) / 2 = 60. pg. 96. Correct answer is C.

Doğru Cevap: C

Soru 34

What is the median of the following data set with frequency in paranthesis? 16 (3), 33 (1), 36 (2), 48 (2), 66 (4), 92 (4)

Açıklama:

57 : median = (16 + 1) / 2 = 8.5 th observation ; then average of 8th and 9th observations : (48 + 66) / 2 = 57. pg. 81. Correct answer is E.

Doğru Cevap: E

Soru 35

I. Most frequent value(s) is the arithmetic mean
II. Middle number in ordered data by magnitude is the median
III. Average number of values is the mode
Which of the definitions about central tendency measures is true?

Only I

Only II

I and II

I and III

II and III

Açıklama:

Most frequent value(s) is the mode. Middle number in ordered data by magnitude is the median. Average number of values is the arithmetic mean. The answer is B

Doğru Cevap: B

Soru 36

I. It is defined as the tendency of data to cluster around some random variable value.
II. Some central tendency measures are arithmetic mean, median and mode
III. Central tendency measures can tell us details about every piece of data.
What can be said to be true about central tendency measures?

Only I

Only II

I and II

I and III

II and III

Açıklama:

I. It is defined as the tendency of data to cluster around some random variable value. (True)
II. Some central tendency measures are arithmetic mean, median and mode. (True)
III. Central tendency measures can tell us details about every piece of data. (False, Central tendency measures do not tell us details about every piece of data.)
The answer is C.

Doğru Cevap: C

Soru 37

What is the geometric mean of the numbers 4, 10, and 25?

Açıklama:

The geometric mean is found by taking the n^th root of all the observation's multiplication. For 4, 10, 25 their multiplication is 4*25*10=1000 and the cubic root of 1000 is 10. The answer is A.

Doğru Cevap: A

Soru 38

For an observation that results in values 6,7,3,4,3,5,6 what is the midrange arithmetic mean of the group?

Açıklama:

Midrange is an arithmetic mean of the extremes in both end of the data set. It only needs the smallest
value and the largest value to be given. The arithmetic mean of the maximum and minimum value of the set gives the midrange. For the set 6,7,3,4,3,5,6 the minimum value is 3 and the maximum value is 7 therefore the midrange would be equal to (7+3)/2 = 5. The answer is A.

Doğru Cevap: A

Soru 39

For an observation that results in values 6,7,3,4,3,5,6 what is the 30% Winsorized mean of the group?

Açıklama:

The Winsorized mean can be calculated with the remaining values after replacing a certain number or the proportion of the values at the low and high end of the sorted data. First the data set is ordered from smallest to largest 3,3,4,5,6,6,7 then we will find how many data points will be replaced, k, since the winsorizing is 30% and there are 7 observations, the value of k is k = np = 7×0.30 = 2,1 which is nearly 2. Therefore, 2 observation from each end of the sorted data will be replaced with those next in magnitude, these observations are as follows, 3 is replaced with 4 and 7 and 6 is replaced with 6. The set yields to 4,4,4,5,6,6,6. The 30% Winsorized mean of the data set is (4+4+4+5+6+6+6)/7= 5. The answer is D.

Doğru Cevap: D

Soru 40

For an observation that results in values 6,7,2,4,3,5,6 what is the %30 trimmed (truncated) mean of the group?

Açıklama:

The trimmed or as it may sometimes be called as the truncated mean is a slightly modified version of the
arithmetic mean. Trimmed or truncated mean is calculated after a certain number or proportion of the lowest and highest observations from the sorted data are removed (trimmed) from the calculations. First, the data is ordered from smallest to largest, as follows, 2,3,4,5,6,6,7. Then we will find how many data points will be discarded, k, since the trimming is 30% and there are 7 observations, the value of k is k = np = 7×0.30 = 2,1 which is nearly 2. Therefore two 2,3,6 and 7 is removed from the set, which yields to 4,5,6. The sum, 15 is divided to n − 2k which is equal to 7 - 2*2=3 this results in the value 5.The answer is D.

Doğru Cevap: D

Soru 41

Which one of the following is an example of central tendency measures?

Variance

Mode

Range

Standard deviation

Correlation coefficient

Açıklama:

Central tendency is defined as “the tendency of data to cluster around some random variable value”. The position of the central value is measured by using central tendency measures such as arithmetic mean, median and mode. There are several names used to refer to central tendency in statistics such as “center of the distribution”, “central location”, “representative values”, “central position”, or “measures of location”.

Doğru Cevap: B

Soru 42

What is mode?

Middle number when the data is ordered from smallest to largest

Interval scale

Middle number when the data is ordered from largest to smallest

Ratio Scale

Most frequent value(s)

Açıklama:

Doğru Cevap: E

Soru 43

Which level of measurement is not suitable for Median?

Continuous

Discrete

Interval

Nominal

Ratio

Açıklama:

Doğru Cevap: D

Soru 44

Which one of the following is an average number of values?

Probability distribution

Arithmetic mean

Variance

Standard deviation

Binomial number

Açıklama:

Doğru Cevap: B

Soru 45

What is the disadvantage of arithmetic mean?

Influenced by outliers

Usually it is a big number

Usually it is a small number

Can not be calculated for ratio level of data

Can not be calculated for interval level of data

Açıklama:

Doğru Cevap: A

Soru 46

What is the mode of following data set: 1, 2, 2, 2, 3, 3, 3, 4, 4, 4, 4, 4, 4, 4, 4, 5, 5, 6, 6, 7, 9?

Açıklama:

The most repeated value is 4, therefore the mode is 4.

Doğru Cevap: D

Soru 47

What is the mode of the following frequency distribution created for Age variable?

Açıklama:

The highest frequency (18) is observed for 45, therefore mode is 45.

Doğru Cevap: D

Soru 48

How do we define a data set which has two modes?

Median

Bimodal

TripleModal

Nominal mode

Minimum mode

Açıklama:

If a data set has more than one mode, it refers to a multimodal distribution, indicating the frequencies of several observations with the similar highest frequencies. Specifically, when two values have the highest frequency in the data as in 1, 2, 2, 3, 4, 4, 5, 6, it is called as bimodal (2 and 4) distribution.

Doğru Cevap: B

Soru 49

What is the median of the following data: 2, 4, 4, 4, 5, 6, 7, 7, 8, 10?

2.5

4.5

5.5

8.5

10.5

Açıklama:

The data is already ordered. There are 10 observations:
(10+1)/2 = 5.5
so the median is the arithmetic mean of 5th and 6th observations:
median =(5 +6)/2=5.5

Doğru Cevap: C

Soru 50

What is the arithmetic mean of the following data: 2, 4, 5, 6, 7, 8, 9, 9, 9, 14?

3.3

4.3

5.3

7.3

10.3

Açıklama:

mean = total / n
mean = 73 / 10
mean = 7.3

Doğru Cevap: D

Soru 51

Which of the following refers to 'the average of a set of numbers in the data'?

Standard deviation

IQR

Mean

Correlation

z-value

Açıklama:

'Mean' refers to the average of a set of numbers in the data. The correct answer is C.

Doğru Cevap: C

Soru 52

What is the middle value of an ordered dataset called?

Median

Ratio

Tendency

Mode

Mean

Açıklama:

The median is the middle value of an ordered dataset. The correct answer is A.

Doğru Cevap: A

Soru 53

Which measure of central tendency is the best for nominal data sets?

Mode

Median

Arithmetic mean

Geometric mean

Trimmed mean

Açıklama:

The mode is the main centrality measure for nominal scales. The correct answer is A.

Doğru Cevap: A

Soru 54

What is 'the most frequent value in the entire data set' called?

Trimmed mean

Median

Geometric mean

Mode

Arithmetic mean

Açıklama:

Mode is the most frequent value in the entire data. The correct answer is D.

Doğru Cevap: D

Soru 55

6, 7, 8, 3, 3, 6, 9, 7, 3
What is the mode of the given data set?

Açıklama:

The mode is 3 because it is the value that occurs most often. The correct answer is A.

Doğru Cevap: A

Soru 56

28, 16, 14, 22, 12, 48, 10 What is the median of the given data set?

Açıklama:

10, 12, 14, 16, 22, 28, 48
16 is the middle value of the set. The correct answer is B.

Doğru Cevap: B

Soru 57

150, 95, 35, 42, 110, 60
What is the arithmetic mean of the given data set?

Açıklama:

150 + 95 + 35 + 42 + 110 + 60 = 492
492/6= 82
The correct answer is D.

Doğru Cevap: D

Soru 58

It is the best measure of central tendency for nominal data.

It is the middle value of an ordered dataset.

It can be preferred when the distribution is skewed.

Which of the given above about median is true?

Only II

I & II

I & III

II & III

I, II & III

Açıklama:

Median cannot be used for nominal data sets.
It is the middle value of an ordered data set and preferred when there are outliers and the distribution is skewed.
The correct answer is D.

Doğru Cevap: D

Soru 59

Which of the following can be estimated by using ogive curve?

Correlation

z-value

Median

Truncated Trimmed mean

Hypothesis value

Açıklama:

The median can also be estimated by using ogive curve (cumulative frequency polygon). The correct answer is C.

Doğru Cevap: C

Soru 60

Which of the following is TRUE about symmetric bell-shape distributions?

Only the mean and median are the same value.

The mean, median, and mode are all equal.

The median falls between the mean and mode.

The mean is smaller than median.

The mode is larger than the mean.

Açıklama:

In symmetric distributions like bell-shape and rectangular (or uniform)
ones, the mean and median are the same value. The correct answer is B

Doğru Cevap: B

Ünite 5

Soru 1

In a data set, what is the difference between the largest and smallest values called?

Range

Frequency

Variance

Skewness

Standard Deviation

Açıklama:

The range of a data set, shown as R, is the difference between the largest and smallest values and calculated as follows:
R = Largest Value - Smallest Value

Doğru Cevap: A

Soru 2

During a sales season, the fifteen salesmen in a computer company sold the following numbers of computers: 7, 11, 5, 12, 17, 6, 13, 9, 8, 4, 23, 12, 7, 6, 11. What is the range of the number of sold computers?

Açıklama:

The range of a data set, shown as R, is the difference between the largest and smallest values and calculated as follows: R = Largest Value - Smallest Value
Largest Value is 23
Smallest Value is 4
So, R=23-4 = 19.
The correct answer is B.

Doğru Cevap: B

Soru 3

The measures of ________ are another kind of descriptive statistics and give information about the shape of distribution of the observations.
Which option completes the blank in the description above?

Skewness

Variance

Standard Deviation

Interquartile Range

Box Plot

Açıklama:

Doğru Cevap: A

Soru 4

I. The mean is pulled in the direction of the tail.
II. The median falls between the mode and the mean.
III. The mean, median, and mode are all the same.
If the distribution is left skewed, having a long tail in negative direction and a single peak, which of the statements above are true?

Only I

I and II

I and III

II and III

I, II and III

Açıklama:

The measures of skewness are another kind of descriptive statistics and give information about the shape of distribution of the observations. A data set which is not symmetrically distributed is called skewed. The mainly observed shapes of distribution are symmetric, left skewed (negatively skewed), and right skewed (positively skewed). If the distribution is unimodal symmetric, the mean, median, and mode are all the same. If the distribution is left skewed, having a long tail in negative direction and a single peak, the mean is pulled in the direction of the tail, and the median falls between the mode and the mean. The correct answer is B.

Doğru Cevap: B

Soru 5

I. The central tendency measure of the distribution
II. A measure of the variability of the observations
III. The information of the distribution shape
Which pieces of information given above can be obtained from a box-plot?

Only I

Only II

I and II

II and III

I, II and III

Açıklama:

What information can we obtain from a box plot? The central tendency measure of the distribution is indicated by the median line in the box plot. A measure of the variability of the observations is given by the length of the box. Also, by examining the relative location of the median line, we can obtain about the information of the distribution shape. The correct answer is E.

Doğru Cevap: E

Soru 6

Which of the following statements is true according to the box-plot given below?

The median is 4 cigarettes.

The interquartile range is 2 cigarettes.

The shape of the distribution is right-skewed.

The values greater than 4 are the outliers.

50% of the workers consume less than 4 cigarettes.

Açıklama:

The median is 2 cigarettes. This means that 50% of the workers consume less than 2 cigarettes, and 50% of the workers consume more than 2 cigarettes. The 25th and 75th percentiles are 1 and 4 cigarettes, respectively. This means that 25% of the workers consume is either less than or equal to 1 cigarettes and 75% of the workers consume leither 4 or fewer than 4 cigarettes a day. The interquartile range is 4-1=3 cigarettes. This means that 50% of the workers consume 3 cigarettes. The highest and the lowest values within the upper and the lower boundaries for outliers are 8 and 0, respectively. There are outliers in the data. The values greater than 8 are the outliers. The shape of the distribution is right skewed, because the median line is closer to the 25th percentile and also the whisker over the 75th percentile is longer than the other whisker.The correct answer is C.

Doğru Cevap: C

Soru 7

Which option gives the correct terms to complete the blanks given in the graph in the correct order ( from 1 to 4)?

Left skewed- mode - median - mean

Left skewed - mean - median - mode

Right skewed - mode- median- mean

Left skewed - median - mean - mode

Right skewed - median - mean - mode

Açıklama:

1. Right skewed
2. Mode
3. Median
4. Mean
The correct answer is C.

Doğru Cevap: C

Soru 8

Which of the following is the most important and widely used measure of variability in statistics?

Range

IQR

Standart Deviation

Box Plot

Skewness

Açıklama:

In order to avoid the disadvantages and misleading of the range and the IQR, we need a measure of variability that is based on including all measurements in a data set. The most important and widely used measure of variability in statistics is the standard deviation. To determine the variation of a data set in terms of the amounts, we need to measure the how much each observation deviates from the mean. The variability of a data set is small if the observations are close to their mean, and large if the observations are deviated widely about their mean. The correct answer is C.

Doğru Cevap: C

Soru 9

The number of houses sold by each of the 10 estate agents during a particular 6 months period are 1, 4, 7, 6, 2, 10, 10, 3, 8, 9. What is the range of the data?

Açıklama:

The range of a data set, shown as R, is the difference between the largest and smallest values and calculated as follows: R = Largest Value - Smallest Value
R = 10-1 = 9. The correct answer is B.

Doğru Cevap: B

Soru 10

During a festival, the five bar tenders in the festival area sold the following number of beers in three days : 9540, 3600, 5566, 12345 and 8745. What is the range of the beers sold during the festival?

12345

9540

8745

5566

3600

Açıklama:

The range of a data set, shown as R, is the difference between the largest and smallest values and calculated as follows: R = Largest Value - Smallest Value
R= 12345 -3600 = 8745

Doğru Cevap: C

Soru 11

Which of the following is not true about range?

The range of a data set is shown as R.

Range is the difference between the largest and smallest values.

It depends only on the highest and lowest observations.

Range is heavily affected by these extremes values.

Range informs about the variability of the observations.

Açıklama:

The disadvantage of the range is that it depends only on the highest and lowest observations and it tells us nothing about the variability of the observations which fall between the two extremes.

Doğru Cevap: E

Soru 12

Which of the following is not true about percentiles?

The percentiles generally are demonstrated as P(m)

The percentiles take numbers between 0 and 100.

25th percentile is the second quantile.

75th percentile is the third quantile.

100th percentile is the fourth quantile.

Açıklama:

25th percentile is the first quantile.

Doğru Cevap: C

Soru 13

Which of the following is not true about box plot?

A measure of the variability of the observations is given by the length of the box.

The relative location of the median line gives information about the shape of the distribution.

If the median line is closer to the 25th percentile than the 75thpercentile, the distribution is right-skewed.

If the median line is closer to the 75th percentile than the 25thpercentile, the distribution is left-skewed.

If the median line is in the centre of the box, the distribution is not symmetric.

Açıklama:

If the median line is in the center of the box then we can conclude that the distribution is symmetric.

Doğru Cevap: E

Soru 14

The data set: [20,20,20,25,25,30,35,35,40,40,45,45] What is the interquartile range of the data set above?

Açıklama:

IQR = Q3 - Q1 = P(75) - P(25)=40-20=20

Doğru Cevap: D

Soru 15

Which of the following is not true about skewness?

A negative value near -3 shows that the distribution is considerably right-skewed.

The skewness gives information about the shape of the distribution.

If the distribution is unimodal, the mean, median and mode are the same.

If the distribution is left-skewed, having a long tail in a negative direction and a single peak

Pearson’s coefficient of skewness (PCS) can take values between -3 and 3.

Açıklama:

Pearson’s coefficient of skewness (PCS) can take values between -3 and 3. A negative value near -3 shows that the distribution is considerably left skewed and a positive value near 3 shows that the distribution is considerably left-skewed. If the PCS is near zero, this indicates that the distribution is symmetric because in this case the mean, the median, and the mode are similar.

Doğru Cevap: A

Soru 16

A sample data set is given as follows; [14,15,15,16,17,18,20,21,22]
Which of the following is the Pearson’s coefficient of skewness (PCS) value of the sample data set given above?

0.584

3.423

0.412

4.987

0.1151

Açıklama:

Median=17
Mod=15
Mean=17.56
Standard deviation= 2.877
PCS = X − mode /s
PCS= 3(X − Median)/s
PCS= 3*(17.56-17)/2.877=0.584

Doğru Cevap: A

Soru 17

The number of the books have been read by the students in a year are given as follows; 10, 11, 11, 12, 14, 17, 18, 19, 20.
What is the range of the number of books have been read by students in this study?

Açıklama:

R = Largest Value - Smallest Value
20-10=10

Doğru Cevap: C

Soru 18

The exam points of the students in a math course are as follows;
45, 53, 67, 74, 88, 88, 95, 95, 100.
Which of the following is the sample standard deviation of the data?

20.137

18.379

12.197

19.723

18.419

Açıklama:

= 19.723

Doğru Cevap: D

Soru 19

The height of the students in a classroom are as follows;
155, 156, 157, 160, 162, 165, 170, 180.
Which of the following is the median of the given data?

160

161

162

163

164

Açıklama:

There are 8 observations, we need the find the middle value, the average of 4th and 5th observations is going to give us the result therefore (160+162)/2=161 is what we are looking for.

Doğru Cevap: B

Soru 20

The box plot is given above shows the data about the number of hours spent on watching tv a day by students in a school.
Which of the following is not correct about the box plot data?

The median is two hours.

The interquartile range is three hours.

50% of the people watch three hours of TV a day.

The shape of the distribution is left-skewed.

There are outliers in the data.

Açıklama:

The shape of the distribution is right-skewed because the median line is closer to the 25th percentile and also the whisker over the 75th percentile is longer than the other whisker.

Doğru Cevap: E

Soru 21

During a sales season, the five salesmen in a mobile phone company sold the following numbers of mobile phone: 9, 15, 8, 24, 30. What is the range of the number of sold mobile phones?

Açıklama:

The range for grouped frequency distribution is calculated to be the difference of the upper limit of the highest value and the lower limit of the first class.
R=30-8=22

Doğru Cevap: A

Soru 22

A farming company collected data about the amount of wheat (tons) sold in the last week to different companies. The data is as follows, 12, 35, 14, 25, 24, 40, 25. What is the mean of this data set?

Açıklama:

Doğru Cevap: A

Soru 23

A farming company collected data about the amount of wheat (tons) sold in the last week. The data is as follows, 12, 35, 14, 25, 24, 40, 25. What is the standard deviation of the data set?

10.13

12.51

14.85

16.82

18.28

Açıklama:

Doğru Cevap: A

Soru 24

The frequency distribution table of the students’ performance scores of a school were constructed as follows. What is the sample mean of the data?

52,4

54,5

56,8

Açıklama:

Doğru Cevap: A

Soru 25

The frequency distribution table of the students’ performance scores of a school were constructed as follows. What is the sample standard deviation of the data?

19,4

21,3

25,6

27,9

30,2

Açıklama:

Doğru Cevap: A

Soru 26

The standard deviation of the students’ performance scores data is 20. What is the variance of the data?

350

360

370

380

400

Açıklama:

Doğru Cevap: E

Soru 27

A bakery employs 10 people. The number of years’ experience that the employees of the company have is the following: 0, 0, 2, 4, 5, 8, 15, 17, 20, 24. What is the 50^thpercentile of the data?

6,5

7,5

8,5

9,5

10,5

Açıklama:

Doğru Cevap: A

Soru 28

Which class interval does contain the median?

0 up to 20

20 up to 40

40 up to 60

60 up to 80

80 up to 100

Açıklama:

To determine the interval that contains the median, we must find the first interval for which the cumulative relative frequency exceeds 0.50. This interval is the one containing the median. For the following data, the interval from 40 up to 60 is the first interval for which the cumulative relative frequency exceeds 0.50, so this interval contains the median.

Doğru Cevap: C

Soru 29

A bakery employs 10 people. The number of years’ experience that the employees of the company have is the following: 0, 0, 2, 4, 5, 8, 15, 17, 20, 24. What is the 80^thpercentile of the data?

18.5

19.5

20.5

21.5

22.5

Açıklama:

Doğru Cevap: A

Soru 30

A bakery employs 10 people. The number of years’ experience that the employees of the company have is the following: 0, 0, 2, 4, 5, 8, 15, 17, 20, 24. What is the interquartile range of the data?

Açıklama:

Doğru Cevap: B

Soru 31

Which of the followings is not true about "range"?

In order to find the range of a data set, researcher need to identify only two characteristics, the largest and smallest values.

Although the range is easy to calculate and to understand, it is generally not a very useful measure of variability.

The disadvantage of the range is that it depends only on the highest and lowest observations.

The range of a data set is the difference between the largest and smallest values.

In grouped frequency distribution, the value of the range will only be the lowest value.

Açıklama:

In grouped frequency distribution, the value of the range will only be an approximate value.

Doğru Cevap: E

Soru 32

Which of the following is the most important and widely used measure of variability in statistics ?

Range

Percentiles

Standard Deviation

Interquartile Range

Mode

Açıklama:

VARIANCE AND STANDARD DEVIATION
In order to avoid the disadvantages of the range, we need a measure of variability that is based on including all measurements in a data set. The most important and widely used measure of variability in statistics is the standard deviation.

Doğru Cevap: C

Soru 33

Which of the following is generally demonstrated as P(m), where m is the number taking values between 0 and 100?

Range

Percentiles

Interquartile Range

Variance

Skewness

Açıklama:

The percentiles generally are demonstrated as P(m), where m is the number taking values between 0 and 100. Intuitively, the P(m) percentile of a set of n measurements, arranged in order of magnitude, is the value such m percent of the measurements are less than or equal to that corresponding value.

Doğru Cevap: B

Soru 34

Which of the following statement is not true about "percentiles"?

The percentiles generally are demonstrated as P(m), where m is the number taking values between 0 and 100.

Intuitively, the P(m) percentile of a set of n measurements, arranged in order of magnitude, is the value such m percent of the measurements are less than or equal to that corresponding value.

Some of the specific percentiles frequently used as variability measures are 25th, 50th, and 75th percentiles.

Various methods may be used for the calculation of percentiles.

In order to calculate the percentiles, the sample measurements must be sorted from largest to smallest.

Açıklama:

In order to calculate the percentiles, the sample measurements must be sorted in ascending order (from smallest to largest).

Doğru Cevap: E

Soru 35

Which of the following is the differences between the third and the first quartiles?

Percentiles

Range

Interquartile Range

Box Plot

Skewness

Açıklama:

The second variability measure is the interquartile range (IQR). The interquartile range is the differences between the third and the first quartiles.

Doğru Cevap: C

Soru 36

Which of the following information cannot be obtained from a box plot?

We can obtain the difference between the largest and smallest values.

The central tendency measure of the distribution is indicated by the median line in the box plot.

By examining the relative location of the median line, we can obtain about the information of the distribution shape.

We can obtain additional information about skewness from the lengths of the whiskers.

A general assessment can be made about the presence of outliers by examining the number of observations.

Açıklama:

BOX PLOT
The range of a data set is the difference between the largest and smallest values.

Doğru Cevap: A

Soru 37

Which of the followings is a kind of descriptive statistics and give information about the shape of distribution of the observations?

Quartiles

Variance

Standard Deviation

Skewness

Box Plot

Açıklama:

Doğru Cevap: D

Soru 38

During a sales season, the fifteen salesmen in a computer company sold the following numbers of computers: 7, 11, 5, 12, 17, 6, 13, 9, 8, 4, 23, 12, 7, 6, 11.
Which of followings is the range of the number of sold computers?

R = 23 - 4 = 19

R = 4 - 23 = 19

R = 19 - 7 = 23

R = 11 - 5 = 19

R = 12 - 8 = 23

Açıklama:

The range for grouped frequency distribution is calculated to be the difference of the upper limit of the highest class and the lower limit of the first class. Therefore, it will be an approximate value.

Doğru Cevap: A

Soru 39

The sample standard deviation and the sample variance for a frequency distribution can be calculated as follows.
s= ∑k fi(xi−X)2 i=1 n−1
s=∑k fi(xi−X)2 2 i=1 n−1
Which of the followings is not true related to this formula?

"k" is the number of classes/categories in the frequency distribution

"x" is the value of the i th class/category

"X" is the final result

"n" is the total frequency, n = ∑i=1 fi

ƒi is the frequency of the i th class/category

Açıklama:

VARIANCE AND STANDARD DEVIATION
X is the arithmetic mean

Doğru Cevap: C

Soru 40

A petrol company collected the data about the amount of fuel (tons) sold on two cities in a given Saturday. In each city the company has ten gas stations. The data is as follows.
City A: 3, 5, 6, 2, 7, 9, 8, 1, 4, 8
City B: 1, 4, 6, 2, 7, 19, 8, 1, 2, 18
Which of the followings is the sample mean of the city A?

1.3

2.3

3.3

4.3

5.3

Açıklama:

VARIANCE AND STANDARD DEVIATION
X =3+5+!+8=53=5.3 A 10 10

Doğru Cevap: E

Soru 41

Data : 4, 4, 5, 5, 5, 15, 15, 18, 19, 15, 4, 5, 14, 14, 17, 17, 26, 26, 22, 22. What is the range of this data ?

Açıklama:

22 = 26 - 4. pg. 109. Correct answer is E.

Doğru Cevap: E

Soru 42

Data : 0.2, 0.5, 0.17, 0.22, 0.26, 0.34, 0.39, 0.44, 0.55, 0.87, 1.1, 1.14, 2.5, 2.9, 4.4, 5.6. What is the 80th percentile of this data ?

2.9

2.5

4.4

3.45

1.14

Açıklama:

16 numbers ; k = 16 x 80 / 100 = 12.8 not integer ; (⟦k⟧ + 1) = 12 + 1 = 13 ; P(13) = 2.5. pg. 117. Correct answer is B.

Doğru Cevap: B

Soru 43

Data format : (class, class interval, frequency, cumulative frequency) ; data : (1, 100 up to 200, 5, 5), (2, 200 up to 300, 10, 14), (3, 300 up to 400, 16, 30), (4, 400 up to 500, 20, 50), (5, 500 up to 600, 10, 60) ; what is the 40th percentile of these data ?

355

357.5

360

362.5

365

Açıklama:

k = 60 x 40 / 100 = 24 ; k is in the 3rd class ; P(k) = 300 + (100 / 16) (24 - 14) = 300 + 62.5 = 362.5 . pg. 117. Correct answer is D.

Doğru Cevap: D

Soru 44

Data : 1.000, 1.000, 1.200, 1.200, 1.200, 1.400, 1.400, 1.500, 1.500, 1.500, 1.600, 1.600, 1.800, 1.800, 1.800, 2.000, 2.000, 2.500, 2.500, 2.500. What is the IQR of these data ?

500

600

700

800

900

Açıklama:

The interquartile range IQR = P(75) - P(25) ; k1 = 20 x (75 / 100) = 15 ; P(75) = (1.800 + 2.000) / 2 = 1.900 ; k2 = 20 x (25 / 100) = 5 ; P(25) = (1.200 + 1.400) / 2 = 1.300 ; IQR = 1.900 - 1.300 = 600 . pg. 120. Correct answer is B.

Doğru Cevap: B

Soru 45

Data : 100, 120, 120, 180, 200. What is the sample standard deviation of this data ?

5.525^1/2 / 3

6.625^1/2 / 3

8.275^1/2 / 3

7.475^1/2 / 2

9.250^1/2 / 2

Açıklama:

arithmetic mean : m = 720 / 5 = 145 ; s = ((45² + 25² + 25² + 35² + 55²) / 4)^1/2 = ((2.025 + 625 + 625 + 1225 + 3.025) / 4)^1/2 = (7.475 / 4)^1/2. pg. 111 . Correct answer is D.

Doğru Cevap: D

Soru 46

Data format : (class interval, frequency) data : (100 up to 120, 1), (120 up to 180, 2), (180 up to 200, 1), (200 up to 250, 1). What is the sample standard deviation of these data ?

4.850^1/2 / 3

7.525^1/2 / 2

8.875^1/2 / 3

6.675^1/2 / 2

9.625^1/2 / 3

Açıklama:

arithmetic mean : m = 850 / 5 = 170 ; s = ((60² + 20² + 20² + 10² + 55²) / 4)^1/2 = ((3.600 + 400 + 400 + 100 + 3.025) / 4)^1/2 = (7.525 / 4)^1/2. pg. 111. Correct answer is B.

Doğru Cevap: B

Soru 47

Data : 20, 12, 16, 12, 10 ; what is the Pearson’s coefficient of skewness of these data ?

0.80

-1.75

0.90

1.50

-1.20

Açıklama:

median : 12 ; arithmetic mean : m = 70 / 5 = 14 ; s = ((4² + 2² + 2² + 2² + 6²) / 4)^1/2 = 4 ; Pearson’s coefficient of skewness PCS = 3 (14 - 12) / 4 = 1.5 . pg. 122. Correct answer is D.

Doğru Cevap: D

Soru 48

Data : 20, 12, 16, 12, 10 ; what is the standard coefficient of skewness of these data ?

55 / 32

65 / 34

75 / 42

85 / 44

95 / 48

Açıklama:

a = 5 / (4 x 3) = 5 / 12 ; arithmetic mean : m = 70 / 5 = 14 ; s = ((4² + 2² + 2² + 2² + 6²) / 4)^1/2 = 4 ; standard coefficient of skewness SCS = (5 / 12) x ((4³ + 2³ + 2³ + 2³ + 6³) / 4³) = (5 / 12) ((64 + 24 + 216) / 64) = (5 / 12) (304 / 64) = (5 / 12) (76 / 16) = (5 / 12) (19 / 4) = 95 / 48. pg. 122. Correct answer is E.

Doğru Cevap: E

Soru 49

Which one of the following ones is included in a box plot ?

75th percentile

50th percentile

mean

standard deviation

mode

Açıklama:

75th percentile. pg. 121. Correct answer is A.

Doğru Cevap: A

Soru 50

Data : 12, 16, 10, 20, 12 ; what is the mean deviation of these data ?

3.2

4.6

5.2

4.4

3.8

Açıklama:

arithmetic mean : m = 70 / 5 = 14 ; s = (4 + 2 + 2 + 2 + 6) / 5 = 16 / 5 = 3.2 ; . pg. 112. Correct answer is A.

Doğru Cevap: A

Ünite 6

Soru 1

Which of the followings cannot be given as an example for random experiments?

Rolling a fair die.

Customers arriving at a particular store during some time interval.

Airplanes taking off in a given time interval at some airport.

Some particular customer requests in a bank.

Surveying an elemanatry level english classroom.

Açıklama:

A random experiment is any process that leads to two or more possible outcomes, without knowing exactly which outcome will occur.
For example, when a fair die is rolled we know that one of the six faces will show up but we will not be able to say exactly which face will actually show up. Thus, rolling a fair die is an example of a random experiment. Some further examples of random experiments are:
• Customers arriving at a particular store during some time interval.
• Airplanes taking off in a given time interval at some airport.
• Some particular customer requests in a bank.
Note that in a random experiment, although we do not know which outcome will occur, we are able to list or describe all of the possible outcomes.The set of all possible outcomes is important to understand the random experiment. Terefore, it is called a sample space, which is restated below as a definition.

Doğru Cevap: E

Soru 2

According to classical probability, when rolling a fair dice, what is the probability of obtaining the number 5?

5/6

4/6

3/6

2/6

1/6

Açıklama:

In the classical probability approach to assign a probability to an event, the assumption is that all the outcomes have the same chance of happening. Pick up a six-sided fair die, there are six numbers on each face of the die as 1, 2, 3, 4, 5, and 6. Classical probability says that each side of the die has the same chance to come face up if this die is thrown. Since there are 6 possible outcomes of throwing a six-sided fair die probability of obtaining any number represented on the faces of this six-sided fair die is 1/6. We can formularize this by following equation:
Probabilityof an Event = Thenumber of timestheevent can happen
/Total number of possibleoutcomes

Doğru Cevap: E

Soru 3

According to classical probability, when a fair dice is rolled, what is the probability of obtaining a number more than 4?

0.167

0.267

0.333

0.433

0.555

Açıklama:

Using the same idea, we can try another example, what is the probability of obtaining a number more than 4 if we throw a six-sided fair die? Remember on a six-sided fair die, all the outcomes have the same chance to appear, in this example the question makes a restriction of observing values more than 4. ere are only two outcomes to satisfy this restriction; those are the numbers 5, and 6; therefore, the probability we are looking for is
P(Morethan 4) = 2/6 = 1/3 = 0.333
The correct answer is C.

Doğru Cevap: C

Soru 4

Which probability approach uses the relative frequencies to assign the probabilities to the events?

Classical probability

Objective probability

Subjective probability

Positive probability

Empirical probability

Açıklama:

The empirical probability uses the relative frequencies to assign the probabilities to the events. The empirical probability is based on experiments. In order to find the probability of a specific event, the experiments are repeated many times and the observed outcomes of the event we are interested in is counted. The correct answer is E.

Doğru Cevap: E

Soru 5

According to classical probability approach, what is the probability of obtaining a number 6 when you throw a six-sided fair die?

0.167

0.267

0.333

0.467

0.533

Açıklama:

Classical probability says that each side of the die has the same chance to come face up if this die is thrown. Since there are 6 possible outcomes of throwing a six-sided fair die probability of obtaining any number represented on the faces of this six-sided fair die is 1/6. We can formularize this by following equation
Probability of an Event = The number of times the event can happen/ Total number of possible outcomes
P(Number 6) = 1 6 = 0.167
The correct answer is A.

Doğru Cevap: A

Soru 6

According to empirical probability, which of the following statements is not true?

It uses the relative frequencies to assign the probabilities to the events.

It is based on experiments.

To find the probability of a specific event, the experiments are repeated many times.

It states that all the outcomes have the same chance of happening.

In empirical probability, the past information becomes very important.

Açıklama:

Doğru Cevap: D

Soru 7

In how many different ways can the letters in U-S-U-A-L-L-Y be arranged?

Açıklama:

Note that there are five different letters, namely U, S, A, L, Y and that the letters U and L are used twice. If all the seven letters in the given word were different, the total number of arrangements would be 7!. Since all the arrangements of the two letters U1 and U2 , and all the arrangements of the two letters L1 and L2 should be counted only once, it follows that the answer is 7!/ 2!2! =15. The correct answer is C.

Doğru Cevap: C

Soru 8

A manager in a company has two assistant directors. The probability that the older assistant director comes late to work on a given day is 0.07, whereas for the younger assistant director this probability is 0.05. In addition, the probability that both assistant directors come late to work on given day is 0.03. What is the probability that on a given day one or both assistant directors come late to work?

0.87

0.97

1.07

1.17

1.27

Açıklama:

Note that this event corresponds to (O+Y -) , ( - O+Y - ) , ( - O+Y - ), and that (O+Y -) , ( - O+Y - ) , ( - O+Y - ) , (O+Y ) = S.
Therefore, P((O+Y - ) , ( - O+Y -) , ( -O+Y-)) = 1 - P(O+Y- ) = 1 - 0.03 = 0.97 The correct answer is B.

Doğru Cevap: B

Soru 9

A shop selling mobile phones has purchased four new mobile phones of the same brand and model. It is known that a mobile phone of this brand and model works without any problem for at least 2 years with probability 0.95. What is the probability that all three mobile phones will work without any problem for at least 2 years?

0.95

0.90

0.85

0.80

0.75

Açıklama:

Here, it is natural to assume that a failure of a mobile phone is independent from a failure of another mobile phone. Therefore, if Ai(i=1,2,3,4) denotes the event that the i-th mobile phone will work without any problem for at least 2 years, then the probability is given by
P(A1+A2+A3+A4) = P(A1)P(A2)P(A3)P(A4) = (0.90).
The correct answer is B.

Doğru Cevap: B

Soru 10

A manager of a café in a university campus assesses its customers as student, academic staff or visitor. She estimates that of all its customers 50% are students, 30% are academic staff and that 20% are visitors. It is known that purchases are made by 70% of student customers, by 60% of academic staff and by 30% of visitors. If a randomly chosen customer makes a purchase, what is the probability that this customer is a student?

0,74

0,54

0,59

0,95

0,059

Açıklama:

Define the following events
S: Customer is a student
A: Customer is academic staff
V: Customer is a visitor
B: Customer makes a purchase
Then we need to find P(S⏐B), which is given by, P(S⏐B)=P(B⏐S)P(S)/P(B⏐S)P(S)+ P(B⏐A) P(A)+ P(B⏐V)P(V) = 0.35/0.59 ≅ 0.593

Doğru Cevap: C

Soru 11

What is the probability of obtaining a number more than 3 if we throw a six-sided fair dice?

0,5

0,6

0,7

0,8

0,9

Açıklama:

Probability of an Event = The number of times the event can happen / Total number of possible outcomes
P (more than 3) = 3 / 6 = 0,5

Doğru Cevap: A

Soru 12

A six-sided fair dice has been thrown 1000 times and the occurrence of number 2 is 156. What is the empirical probability of obtaining a number 2 when you throw a six-sided fair dice?

0,156

0,159

0,240

0,300

0,500

Açıklama:

Empirical Probability of an Event = The number of times the event happens/Total number of observations
P (Number 2)= 156 / 1000 =0,156

Doğru Cevap: A

Soru 13

In ___________ approach, the researcher assigns a suitable value as the probability of the event.
Which of the following fills the blank correctly?

Mutual Probability

Classic Probability

Empirical Probability

Objective Probability

Subjective Probability

Açıklama:

Sometimes it may not possible to observe the outcomes of events; therefore, the researcher may assign a probability to an event. In subjective probability approach, the researcher assigns a suitable value as the probability of the event. Therefore, a personal judgment comes in to play to assign the probability. This approach is not favorable method to assign probability, but sometimes if there’s no previous knowledge on the subject then the researcher may assign a subjective probability as a starting point.

Doğru Cevap: E

Soru 14

If there are 20 different departments and 5 different elective courses, in how many different ways can a student be classified?

100

150

200

250

300

Açıklama:

Consider any two experiments of which the first experiment can result in n₁ and the second can result in n₂ possible outcomes. Then, considering both experiments together, there are n₁n₂ possible outcomes.
P=20x5=100

Doğru Cevap: A

Soru 15

In how many different ways can the letters in T H A N K S be arranged?

700

720

740

760

780

Açıklama:

All the six letters in the given word were different, the total number of arrangements would be 6!.
6!=720

Doğru Cevap: B

Soru 16

A company works with two supplier firms for the same raw material. The first firm's late delivery probability on a given day is 0.05, whereas for the second firm this probability is 0.07. In addition, both firms' late delivery probability on given day is 0.04. What is the late delivery probability of at least one of the firms to company on given day?

0.05

0.06

0.07

0.08

0.09

Açıklama:

P(OUY) = P(O) + P(Y) - P(O∩Y) = 0.05 + 0.07 - 0.04 = 0.08

Doğru Cevap: D

Soru 17

At a local district, there are three different pizza restaurants denoted here by A, B, and C. It is known that 40% of all customers give an order from company A, whereas 35% give an order from company B, and 25% give an order from company C. It is also known that 10% of the motorcycles from company A, 20% of the motorcycles from company B, and 5% of the motorcycles from company C need to a checkup before the next pizza delivery. What is the probability that a motorcycle returned to the restaurant needs a checkup before the next pizza delivery?

0,12

0,14

0,16

0,18

0,20

Açıklama:

P(A)=0.40 P(U/A)=0.10 P(U/A).P(A)=0.04
P(B)=0.35 P(U/B)=0.20 P(U/B).P(B)=0.07
P(C)=0.25 P(U/C)=0.05 P(U/C).P(C)=0.0125
P(U)=0,04+0,07+0,0125=0,12

Doğru Cevap: A

Soru 18

At a local district, there are three different pizza restaurants denoted here by A, B, and C. It is known that 40% of all customers give an order from company A, whereas 35% give an order from company B, and 25% give an order from company C. It is also known that 10% of the motorcycles from company A, 20% of the motorcycles from company B, and 5% of the motorcycles from company C need to a checkup before the next pizza delivery. If a motorcycle returned to the company needs a checkup before the next pizza delivery, what is the probability that this motorcycle was owned by company B?

0,58

0,62

0,74

0,81

0,93

Açıklama:

P(A)=0.40 P(U/A)=0.10 P(U/A).P(A)=0.04
P(B)=0.35 P(U/B)=0.20 P(U/B).P(B)=0.07
P(C)=0.25 P(U/C)=0.05 P(U/C).P(C)=0.0125
P(U)= 0,04+0,07+0,0125=0,12
P(B/U)=0,07/0,12=0,58

Doğru Cevap: A

Soru 19

At a local district, there are three different pizza restaurants denoted here by A, B, and C. It is known that 40% of all customers give an order from company A, whereas 35% give an order from company B, and 25% give an order from company C. It is also known that 10% of the motorcycles from company A, 20% of the motorcycles from company B, and 5% of the motorcycles from company C need to a checkup before the next pizza delivery. If a motorcycle returned to the company needs a checkup before the next pizza delivery, what is the probability that this motorcycle was owned by company A?

0,33

0,40

0,52

0,60

0,74

Açıklama:

P(A)=0.40 P(U/A)=0.10 P(U/A).P(A)=0.04
P(B)=0.35 P(U/B)=0.20 P(U/B).P(B)=0.07
P(C)=0.25 P(U/C)=0.05 P(U/C).P(C)=0.0125
P(U)= 0,04+0,07+0,0125=0,12
P(A/U)=0,04/0,12=0,33

Doğru Cevap: A

Soru 20

A six-sided fair dice has been thrown 3000 times and the occurrence of number 4 is 450. What is the empirical probability of obtaining a number 4 when you throw a six-sided fair dice?

0,15

0,20

0,25

0,30

0,35

Açıklama:

Empirical Probability of an Event = The number of times the event happens/Total number of observations
P (Number 4)= 450 / 3000 =0,15

Doğru Cevap: A

Soru 21

Consider the random experiment of tossing a fair coin until a tail (T) and a head (H) show up once. Describe the sample space of this random experiment ?

S = {H, T}

S = {HH, HT, TH, TT}

S = {HT, HT, HHT, HHHT}

S = {HT, TH, HHT, TTH, HHHT, ... }

S = {HT, TH, HHT, THH, HHHT, ... }

Açıklama:

S = {HT, TH, HHT, TTH, HHHT, ... }. pg. 133. Correct answer is D.

Doğru Cevap: D

Soru 22

A mixed basketball team (5 players) will be chosen from 8 male and 7 female players. This team should consist of at least 2 female and at least 2 male players, how many different teams are possible?

C(15, 5)

C(8, 3) + C(7, 2) + C(8, 2) * C(7, 3)

C(8, 3) * C(7, 2) + C(8, 2) * C(7, 3)

C(15, 3) * C(15, 2)

C(15, 3) + C(15, 2)

Açıklama:

C(8, 3) * C(7, 2) + C(8, 2) * C(7, 3) . pg. 137. Correct answer is C.

Doğru Cevap: C

Soru 23

A box contains 10 glasses : 4 red and 6 blue. Two glasses are selected randomly, without replacement, from this lot. What is the probability that the first selected glass is blue?

3 / 5

7 / 15

7 / 12

17 / 24

3 / 4

Açıklama:

S = {BR , BB , RB , RR} ; C(10, 2) = 10! / (2! * (10-2)!) = 10! / (2! * 8!) = 10 * 9 / 2 = 45 ; P(BR) + P(BB) = (6 / 10) * (4 / 9) + (6 / 10).* (5 / 9) = 6 * 4 / (10 * 9) + 6 * 5 / (10 * 9) = 4 / 15 + 1 / 3 = 9 / 15 = 3 / 5. pg. 137. Correct answer is A.

Doğru Cevap: A

Soru 24

Assume that 20 percent of Statistics course students in Anadolu University take Music course, 10 percent take Swimming course, and 4 percent take both Music and Swimming courses. What percentage of Statistics students neither take Music nor Swimming course?

Açıklama:

(S and M) = 20 % ; (S ansd Sw) : 10 % ; (S and M and Sw) = 4 % ; (S and (M or Sw)) = 26 % , (S and NOT(M and Sw) )= 100 - 26 = 74 % . pg. 138. Correct answer is B.

Doğru Cevap: B

Soru 25

The probability of an event A is 0.6, the probability of an event B is 0.3. The probability that neither A nor B occurs is 0.25. What is the probability that both A and B occurs?

0.35

0.30

0.25

0.20

0.15

Açıklama:

P(A or B) = 1 - 0.25 = 0.75 P(A and B) = P(A) + P(B) - 0.75 = 0..6 + 0.3 - 0.75 = 0.15 . pg. 138. Correct answer is E.

Doğru Cevap: E

Soru 26

A jar contains 4 blue, 5 yellow, and 6 pink balls. If 3 balls are selected randomly, without replacement, what is the probability that the 4th ball selected is blue, given that the first 3 balls are blue, pink, yellow, respectively ?

0.15

0.20

0.25

0.30

0.35

Açıklama:

3 / 12 = 1 / 4 = 0.25 . pg. 137. Correct answer is C.

Doğru Cevap:

Soru 27

The probability that Navigator A shows wrong way to an address is 0.09 and the probability that Navigator B shows wrong way to the same address is 0.07 and the probability that both Navigators show wrong way to the same address is 0.04. What is the probability that only one of the Navigators shows wrong way to the same address?

0.16

0.14

0.12

0.10

0.08

Açıklama:

P(A) - P(A and B) + P(B) - P(A and B) = 0.09 - 0.04 + 0.07 - 0.04 = 0.08 . pg. 138. Correct answer is E.

Doğru Cevap:

Soru 28

Events A, B, and C are all independent and that B and C are mutually exclusive events. P(A)=0.04, P(B)=0.03, and P(C)=0.02. What is the probability that events A and B will occur or A will not occur and C will occur?

0.0096

0.0128

0.0144

0.0204

0.0256

Açıklama:

P(A) * P(B) + P(not-A) * P(C) = 0.04 * 0.03 + 0.96 * 0.02 = 0.0012 + 0.0192 = 0.0204. pg. 138. Correct answer is D.

Doğru Cevap: D

Soru 29

Consider an ultrasound software for brain tumor diagnosis : a) the overall rate of the disease in the population being screened is 1 % ; b) the probability that a healthy person wrongly gets a positive result (false positive) is 0.05 ; c) the probability that an ill wrongly gets a negative result (false negative) is 0.002 ; d) other 2 situations are correctly diagnosing healthy persons (true negative) and correctly diagnosing ills (true positive). If test of a person A gives a positive result, what is the probability that person A actually have the disease?

0.0495

0.998 * 0.01 / 0.05948

0.06

0.099 * 0.02 / 0.064

0.095

Açıklama:

S = {people in the population being screened} , D = {have disease} , not-D = {do not have diesase} , pos = {positive result} , neg = {negative result} ; P (pos | not-D) = 0.05 , P (neg | D) = 0.002 , P (D) = 0.01 ; P (D | pos) = ? , P (D | pos) = P (pos | D) * P(D) / P(pos) (Bayes theorem) ; P (pos | D) = 1 - P ( neg | D) = 1 - 0.002 = 0.998 ; P (pos) = ? , P (pos) = P (pos | D) * P(D) + P (pos | not-D) * P(not-D) ; P (pos) = 0.998 * 0.01 + 0.05 * (1 - 0.01) = 0.00998 + 0.05 x 0.99 = 0.00998 + 0.0495 = 0.05948 ; P (D | pos) = 0.998 * 0.01 / 0.05948 ( = 0.168 = 16.8 % ) . pg. 141. Correct answer is B.

Doğru Cevap: B

Soru 30

60 % of students take Statistics 2 course after Statistics 1 course. 25 % of students takes Statistics 2 course without taking Statistics 1 course. What is the probability that a student who takes Statistics 2 course has not taken Statistics 1 course?

0.15

0.25

5 / 12

12 / 17

5 / 17

Açıklama:

25 / (25 + 60) = 25 / 85 = 5 / 17. pg. 141. Correct answer is E.

Doğru Cevap: E

Soru 31

What can be said true about the basic concepts of probability?

An event occurs if the random experiment results in one of the basic outcomes of that event.

A set of some of the possible outcomes of a random experiment is called the sample space.

Each possible outcome of a random experiment is called a sample space.

The set of all possible outcomes of a random experiment is called the elementary outcome.

A random experiment is any process that leads to a certain possible outcome.

Açıklama:

A random experiment is any process that leads to two or more possible outcomes, without knowing
exactly which outcome will occur. The set of all possible outcomes of a random experiment is called the sample space. Each possible outcome of a random experiment is called an elementary outcome. An event occurs if the random experiment results in one of the basic outcomes of that event. A is the correct answer.

Doğru Cevap: A

Soru 32

I. The complement of an event A is the set of all basic outcomes in S that do not belong to A.
II. The union of events A and B is the set of all elementary outcomes that belong to both sets.
III. The intersection of events A and B is the set of all elementary outcomes that belong to at least one of the sets A and B.
For A and B, which are any two events in a random experiment with sample space S, which of the statements are true?

Only I

Only II

I and II

I and III

II and III

Açıklama:

I. The complement of an event A is the set of all basic outcomes in S that do not belong to A. (True)
II. The union of events A and B is the set of all elementary outcomes that belong to both sets. (False, The intersection of events A and B is the set of all elementary outcomes that belong to both sets.)
III. The intersection of events A and B is the set of all elementary outcomes that belong to at least one of the sets A and B. (False, The union of events A and B is the set of all elementary outcomes that belong to at least one of the sets A and B.)
The answer is A.

Doğru Cevap: A

Soru 33

When is A and B, which are any two events in a random experiment with sample space S, can be said to be mutually exclusive?

A ∪ B = 0

A ∩ B = 0

A ∩ B = A

A ∩ B = A ∪ B

A ∪ B = A

Açıklama:

Sometimes it will be important to consider events that do not occur at the same time. That is the events whose intersection is the null event (empty set). If A and B are any two events then they are said to be mutually exclusive if A ∩ B = 0. The answer is B.

Doğru Cevap: B

Soru 34

If there were two dices thrown at the same time what would their total number of possible outcomes be?

Açıklama:

Each dice has a possible outcome of 6 in total {1,2,3,4,5,6}. If they are thrown at the same time the possible outcome would equal to 6*6 = 36. The answer is D.

Doğru Cevap: D

Soru 35

In how many different ways can the letters in P R O B A B I L I T Y be arranged?

110

11!

Açıklama:

When order is not important, the arrangements of r objects from n distinct objects is called a combination. The number of combinations of size r from a collection of n objects is denoted by C(n,r), and it is given by C(n,r)= P(n,r)/r! = n!/r!(n - r)! when 0≤r≤n.
For P R O B A B I L I T Y there are in total 11 letters but B and I is used twice so they should be counted only once.
r=9, n=11 therefore 11!/9!(11-9)! = (10*11)/2! = 55. The answer is B.

Doğru Cevap: B

Soru 36

I. For any event A⊆S, P(A)≥0.
II. For any event A, P(Ā) = 1 - P (A)
III. For any two events A and B, P(A,B) = P(A) + P(B)
Which of the probability axioms can be said to be true?

Only I

Only II

I and II

I and III

II and III

Açıklama:

I. For any event A⊆S, P(A)≥0. (True)
II. For any event A, P(Ā) = 1 - P (A) (True)
III. For any two events A and B, P(A,B) = P(A) + P(B) (False, For any two events A and B, P(A,B) = P(A) + P(B) - P(A+B) )
The answer is C.

Doğru Cevap: C

Soru 37

I. P(AnB) = P(A⏐B)P(B) this formula is called the multiplication rule.
II. A and B are statistically independent if and only if P(AnB) = P(A)P(B).
III. For A and B sets independence can also be denoted by P(A⏐B)

Only I

Only II

I and II

I and III

II and III

Açıklama:

I. P(AnB) = P(A⏐B)P(B) this formula is called the multiplication rule. (True)
II. A and B are statistically independent if and only if P(AnB) = P(A)P(B). (True)
III. For A and B sets independence can also be denoted by P(A⏐B). (False, P(A⏐B) denotes conditional probability for sets A and B.)
The answer is C.

Doğru Cevap: C

Soru 38

Which of the following terms define the set of all possible outcomes in a random experiment?

Sample space

Elementary outcome

Basic outcomes

Event

Subset of a sample

Açıklama:

The set of all possible outcomes of a random experiment is called the sample space.

Doğru Cevap: A

Soru 39

A random experiment is any process that leads to two or more possible outcomes, without knowing exactly which outcome will occur. Which one below is NOT an example of a random experiment?

Rolling a fair die

Customers arriving at a particular store during some time interval.

Airplanes taking off in a given time interval at some airport.

The students in the biggest classroom at school.

Some particular customer requests in a bank.

Açıklama:

Customers arriving at a particular store during some time interval.
Airplanes taking off in a given time interval at some airport.
Some particular customer requests in a bank.

Doğru Cevap: D

Soru 40

What does this equation tell us?

A and B are closely related two events

The events A and B intersects

A and B are mutually exclusive

A and B are complementary events

A and B are union of events

Açıklama:

If A and B are any two events then they are said to be mutually exclusive. The above equation shows this.

Doğru Cevap: C

Soru 41

Which of the statements below are correct?
I In classical probability, all the outcomes have the same chance of happening.
II In empirical probability, the experiments are repeated many times and the observed outcomes of the event we are interested in is counted.
III When it is not possible to observe the outcomes of events, the researcher applies the researcher assigns a suitable value as the probability of the event.
IV It is not appropriate to use personal judgement to assign the probability.

I and II

II and III

II, III and IV

I, II and III

II, III and IV

Açıklama:

In the classical probability approach to assign a probability to an event, the assumption is that all the outcomes have the same chance of happening.
The empirical probability uses the relative frequencies to assign the probabilities to the events. The empirical probability is based on experiments. In order to find the probability of a specific event, the experiments are repeated many times and the observed outcomes of the event we are interested in is counted.
Sometimes it may not possible to observe the outcomes of events; therefore, the researcher may assign a probability to an event. In subjective probability approach, the researcher assigns a suitable value as the probability of the event. Therefore, a personal judgement comes in to play to assign the probability. This approach is not favorable method to assign probability, but sometimes if there is no previous knowledge on the subject then the researcher may assign a subjective probability as a starting point. Once enough information about the probability of the event is collected then the researcher may revise this initial subjective probability.

Doğru Cevap: D

Soru 42

A engineer wants to choose the best company to work for. Before choosing the company, he classifies the companies according to their location and salary offered. There are 12 different locations and 10 different salaries. In how many different ways can companies be classified?

220

2200

120

1200

Açıklama:

Consider any two experiments of which the first experiment can result in n1 and the second can result in n2 possible outcomes. Then, considering both experiments together, there are n1n2 possible outcomes. This principle can also be generalized to a finite number of experiments.
According to the basic principle of counting, it follows that there are in total 10.12=120 different possible classifications.

Doğru Cevap: D

Soru 43

2 models will be selected to walk in a fashion show in France. There are 2 models from an Italian agency and 3 of them work for a Turkish agency. how many different possible outcomes are there?

Açıklama:

Let us denote the Italian models by X1 and X2, the Turkish models by Y1, Y2, and Y3. Then, there

Doğru Cevap: E

Soru 44

2 models will be randomly selected to walk in a fashion show in France. There are 2 models from an Italian agency and 3 of them work for a Turkish agency. If the event “one Turkish model Y and one Italian model X is chosen” is denoted by E, which one below is the listing the elements of E?

E= {X1X2, Y1Y2, Y1Y3, Y2Y3, X1Y1, X1Y2, X1Y3, X2Y1, X2Y2, X2Y3}

E = { X1Y1, X1Y2, X1Y3, X2Y1, X2Y2, X2Y3}

E = { X1Y1, X1Y2, X1Y3, X2Y1, X2Y2}

E = { X1X1, X1Y2, X1Y3, X2Y1, X2Y2, X2Y3}

E = { X1Y1, X1Y2, X1Y3}

Açıklama:

E = { X1Y1, X1Y2, X1Y3, X2Y1, X2Y2, X2Y3}

Doğru Cevap: B

Soru 45

2 models will be randomly selected to walk in a fashion show in France. There are 2 models from an Italian agency and 3 of them work for a Turkish agency. If the event “one Turkish model X and one Italian model Y is chosen” is denoted by E, what is the probability that event E occur?

0.4

0.5

0.6

0.7

0.07

Açıklama:

There are different possible outcomes and the sample space is
S = {X1X2, Y1Y2, Y1Y3, Y2Y3, X1Y1, X1Y2, X1Y3, X2Y1, X2Y2, X2Y3}
If the event “one brand X and one brand Y phone is chosen” is denoted by E, the elements of E are .
E = { X1Y1, X1Y2, X1Y3, X2Y1, X2Y2, X2Y3}
The probability that event E will occur.
P(E)=n(E):n(S) = 6:10 =0.6

Doğru Cevap:

Soru 46

When the occurrence or non-occurrence of an event A does not affect the occurrence of another event B, then we say that A and B are statistically ........ events?

irrelevant

codependent

independent

dependent

random

Açıklama:

When the occurrence or non-occurrence of an event A does not affect the occurrence of another event B, then we say that A and B are statistically independent events.

Doğru Cevap: C

Soru 47

Let E1, E2, ..., Ek be a collection of mutually exclusive and collectively exhaustive events. Then for any event A with P(A)≠0 and any i=1, 2..., k

What is the formula above?

Multiplication rule

Bayes’ Theorem

Random experiment

Elementary outcome

Independence probability

Açıklama:

An important application of conditional probability is given in the following result, which is known as Bayes’ Theorem.
Let E1, E2, ..., Ek be a collection of mutually exclusive and collectively exhaustive events. Then for any event A with P(A)≠0 and any i=1, 2..., k
P(Ei A)=P(Ei∩A)= P(AEi)P(Ei)
P (A) P (A E1 )P (E1 )+ P (A E2 )P (E2 )+!+ P (A Ek )P (Ek )

Doğru Cevap: B

Soru 48

For two sets A and B the probability of A is 0.15 and the probability of B is 0.25 if P(A|B) is equal to 0.30 what is the value of P(B|A)?

0.05

0.10

0.125

0.18

0.5

Açıklama:

According to the Bayes's Theorem P(B|A) is equal to [P(A|B)*P(B)]/P(A). The values are given as so P(A|B)=0.30, P(B)=0.25 and P(A)=0.15. P(B|A)=(0.30*0.25)/(0.15) = 0.5. The answer is E.

Doğru Cevap: E

Soru 49

For two sets A and B the probability of B is 0.45 and P(A|B) is equal to 0.20 what is the value of P(A∩B)?

2.25

0.90

0.44

0.10

0.09

Açıklama:

According to the multiplication rule P(A∩B) = P(A⏐B)*P(B). The values are given as P(B)=0.45 and P(A⏐B)=0.20. P(A∩B) = (0.45)*(0.20) = 0.09. The answer is E.

Doğru Cevap: E

Soru 50

What is the probability of two dices landing on numbers that when sum up is equal to 6?

1/6

1/12

5/6

5/36

2/6

Açıklama:

For two dices the total number of possible outcomes is equal to 6*6 = 36. For the numbers' sum to be equal to 6 the number has to be (1,5), (2,4), (3,3), (4,2), (5,1) which shows that the number of times the event happening is equal to 5. The probability of two dices landing on numbers that when sum up is equal to 6 is 5/36. The answer is D.

Doğru Cevap: D

Soru 51

Which is NOT TRUE about a random experiment?

It is any process that leads to two or more possible outcomes

We are able to know which outcome will occur

We are able to list or describe all of the possible outcomes

The set of all possible outcomes is important to understand

Its set of all possible outcomes is called the sample space

Açıklama:

A random experiment is any process that leads to two or more possible outcomes, without knowing exactly which outcome will occurIn a random experiment, Although we do not know which outcome will occur, we are able to list or describe all of the possible outcomes. The set of all possible outcomes is important to understand the random experiment. Therefore, it is called a sample space, which is restated below as a definition.

Doğru Cevap: B

Soru 52

Which is TRUE aboout the classical probability?

Uses the relative frequencies to assign the probabilities to the events

It is based on experiments

The assumption is that the outcomes have the same chance of happening

The researcher assigns a suitable value as the probability of the event.

A personal judgement comes in to play to assign the probability.

Açıklama:

Doğru Cevap: C

Soru 53

We have a box containing only red and blue balls. If we randomly pick up a ball from this box, it could only be either a red ball or a blue ball, the selected ball cannot be a ball of red and blue at the same time. The occurrence of one event dictates that none of the other events can occur at the same time.
How is this event called?

Multiplication rule

Statistically independent

Empirical probability

Subjective Probability

Mutually exclusive

Açıklama:

In probability, if the occurrence of one event dictates that none of the other events can occur at the same time, we call this event mutually exclusive events.

Doğru Cevap: E

Soru 54

In a study about employess, employees are classified according to the department they work in and their preference of training coursec. If there are 12 different departments and 7 different courses, in how many different ways can an employee be classified?

Açıklama:

In a study about college students, students are classified according to their department and preference of language course. If there are 10 different departments and 5 different language courses, in how many different ways can a student be classified? According to the basic principle of counting, it follows that there are in total 10.5=50 different possible classifications.
In a study about employess, employees are classified according to the department they work in and their preference of training courses. If there are 12 different departments and 7 different courses, according to the basic principle of counting, itf ollows that there are in total 12.7=84 different possible classifications.

Doğru Cevap: C

Soru 55

How many different ordered arrangements of the four letters W,X, Y and Z are there?

Açıklama:

When order is important, we call the arrangements of a finite number of distinct objects a permutation. For example, there are a total of 3!=6 different ordered arrangements of the three letters A,B, and C. If there are four letters (W,X,Y,Z) there are a total of 4!=24 different ordered arrangements.

Doğru Cevap: D

Soru 56

How many different ways can the letters in the word COMMON be arranged?

180

360

120

720

Açıklama:

This is a permutation with repeated items. If all the seven letters in the given word were different, the total number of arrangements would be 6!. Since all the arrangements of the two letters C1 and C2 , and all the arrangements of the two letters M1 and M2 should be counted only once, it follows that the answer is: 6!/2!2!=180

Doğru Cevap: B

Soru 57

We know that event B has occurred and we are interested in finding the probability of event A. In other words, we are interested in finding the probability of A knowing that event B has occurred. Which of the following formula denotes the probability? We know that event B has occurred and we are interested in finding the probability of event A. In other words, we are interested in finding the probability of A knowing that event B has occurred. Which of the following formula denotes the probability?

P(A∩B) = P(A⏐B)P(B)

P(A∩B) = P(A⏐B)P(A)

P(A∩B) = P(A⏐B)

P(A∩B) P(A)= P(A⏐B)

P(A∩B) = P(A⏐B)P(B)P(A)

Açıklama:

We know that event B has occurred and we are interested in finding the probability of event A. That is, we are interested in finding the probability of A knowing that event B has occurred.The formula for this is as follows: P(A∩B) = P(A⏐B)P(B)

Doğru Cevap: A

Soru 58

There are 36 cards in a box with three different colors-6 yellow, 12 blue, 18 red. What is the probability of obtaining two yellow cards when randomly drawing two cards?

1/42= 0.02381

5/216 = 0.02315

1/35 = 0.02857

4/35 = 0.11429

9/35 = 0.25714

Açıklama:

This is how we formulate the probability: P(A1∩ A2 ) = P(A2|A1 )P(A1 ) = P(A1 )P(A2|A1 )
This is how we calculate the probability of drawing two yellow cards: 6/36.5/35=1/42= 0.02381

Doğru Cevap: A

Soru 59

Which of the following is a feature of Bayes’ Theorem?
I.Using Bayes’ Theorem, we can find the probability that a defective item is produced by a particular machine.
II.It enables us to compute a particular conditional probability.
III. It is defined as the arrangements of a finite number of distinct objects.

I, III

III

I, II

Açıklama:

Bayes’ Theorem is an important application of conditional probability. This theorem enables us to compute a particular conditional probability. Using Bayes’ Theorem, we can find the probability that a defective item is produced by a particular machine,

Doğru Cevap: D

Soru 60

What is the formula P(A∩B) = P(A⏐B)P(B) called?

Classical Probability

Empirical Probability

Subjective Probability

Multiplication rule

Bayes’ Theorem

Açıklama:

Suppose that we know that event B has occurred and we are interested in finding the probability of event A. That is, we are interested in finding the probability of A knowing that event B has occurred. The formula specified is actually called the multiplication rule and has important applications.

Doğru Cevap: D

Soru 61

According to classical probability approach, what is the probability of obtaining a number less than 3 if we throw a six-sided fair die?

0.167

0.267

0.333

0.467

0.567

Açıklama:

Remember on a six-sided fair die, all the outcomes have the same chance to appear, in this example the question makes a restriction of observing values less than 3. There are only two outcomes to satisfy this restriction; those are the numbers 1, and 2; therefore, the probability we are looking for is
P(less than 3) = 2/6 = 1/3 = 0.333

Doğru Cevap: C

Ünite 7

Soru 1

Which of the following is a discrete random variable?

A time that is spent to complete a task

The weight of a newborn baby

The height of randomly selected students in a classroom

The number of arrivals at an emergency room between midnight and 6:00 a.m.

The duration of the next outgoing telephone call from a business office.

Açıklama:

Discrete random variables have only a countable number of separate values such as 0, 1, 2 , 3... etc. For example, the number of students in a class for certain day or the number of customers in a supermarket after 5:00 PM are cases for discrete random variables since these variables are finite and countable. Conversely, continuous random variable can take entire infinite values in a given interval. Because of this reason, continuous random variables are commonly measured instead of counted. For instance, waiting time for customers in a supermarket cashier line and travel time of a bus between two points are examples for continuous random variables. The correct answer is D.

Doğru Cevap: D

Soru 2

Which of the following is a continuous random variable?

The number of new cases of influenza in a particular city in a month

The number of accident-free days in a year at a building site

The amount of rain recorded at an airport in a week

The number of passengers in a bus on a highway at rush hour

The number of clerical errors on a medical chart

Açıklama:

Doğru Cevap: C

Soru 3

I. The air pressure on a tire on an automobile II.The number of students who actually register for classes III. The amount of liquid in a can of cola IV. The temperature of a cup of coffee Which of the variables above are examples of a continuous random variable?

I and II

I, II and III

I and III

I, II ve IV

I, III and IV

Açıklama:

Doğru Cevap: E

Soru 4

What is the set for the possible values of the random variable stated below?
"The number of coins that match when three coins are tossed at once."

{1,2}

{2,3}

{0,1}

{0,1,2}

{1, 2, 3}

Açıklama:

When three coins are tossed at once, some of the possibilities of head or tail can be listed as below:
H(Head) T(Tail)
HHH
TTT
HHT
HTT
When these possibilities are taken into consideration either two or three coins may have the same match. So {2,3}
is the correct answer.

Doğru Cevap: B

Soru 5

I. It carries a unique numerical value.
II. It is determined by random probability experiment and its associated outcome
III. In statistical notation, random variables represented by lower case such as x and y.
Which of the following statements above are TRUE for random variables?

I, II and III

I and II

I and III

Only I

Only II

Açıklama:

In statistical notation random variables represented by capital letters such as X,Y and so on. So number III is false. Number I and II are true. The correct answer is B.

Doğru Cevap: B

Soru 6

"__________is a widely employed discrete probability distribution in statistics where a set of independent observations constitutes exactly two disjoint outcomes of a trial."
Which option completes the definition given above?

Binomial Distribution

Cumulative Distribution

Standard Deviation

Poisson Distribution

Hypergeometric Distribution

Açıklama:

Binomial distribution is a widely employed discrete probability distribution in statistics where a set of independent observations constitutes exactly two disjoint outcomes of a trial. Therefore in binomial distribution, an outcome of a random experiment can be classified under two different categories. For example, when a die is tossed once we observe six different numbers, that is x = 1, 2, 3, 4, 5 and 6. If we classified these outcomes as even or odd numbers then we will get two different outcomes. Likewise we can separate the turnover of a company into two categories, such as above and below the target level. Also, we can separate the exam grades simply into two categories, satisfactory and poor. The correct answer is A.

Doğru Cevap: A

Soru 7

"For binomial distribution, its principal assumptions are n independent trials with two possible outcomes (success or failure) for each trial, and the success probability remains constant for each trial. On the other hand __________ distribution doesn’t involve with independence assumption for each trial and accordingly the sampling process is established on without replacement. Because of these features, it is broadly used in various real life applications especially for acceptance sampling in quality control."
Choose the correct option to complete blank in the paragraph given above.

Binomial

Poisson

Hypergeometric

Standard

Variance

Açıklama:

In binomial distribution, its principal assumptions are n independent trials with two possible outcomes (success or failure) for each trial, and the success probability remains constant for each trial. Therefore in binomial distribution the sampling process is carried out with replacement. On the other hand hypergeometric distribution doesn’t involve with independence assumption for each trial and accordingly the sampling process is established on without replacement. Because of these features, hypergeometric distribution is broadly used in various real life applications especially for acceptance sampling in quality control.The correct answer is C.

Doğru Cevap: C

Soru 8

Some of the illustrations of random variables that generally conform to model by means of Poisson distribution are presented below.Which illustration does NOT conform to this model?

The number of customers served by automated teller machine in a day.

The amount of time that customers spent using the automated teller machine in a day

The number of patients in the hospital on a given day

The number of diseased strawberry plants in ten acres field

The number of telephone calls received by a technical support center in a week.

Açıklama:

The Poisson distribution is widely used for discrete probability distribution which is used to model the number of outcomes occurring during a specified time interval or in a definite region.Option B is not a discerete random variable. It is a continuous random variable. The correct answer is B.

Doğru Cevap: B

Soru 9

What is a central tendency measure of the probability distribution?

Arithmetic mean

Standart deviation

Variance

Cumulative Distribution Function

Hypergeometric Distribution

Açıklama:

Arithmetic mean and variance are frequently used to summarize the features of probability distributions for a discrete random variable X. The arithmetic mean is a central tendency measure of the probability distribution and the variance is a measure of the dispersion or variability for a data set. The correct answer is A.

Doğru Cevap: A

Soru 10

I. The time a person spends on reading a day
II. The amount of water a person drinks a day
III. The weight gain of a person in a month
Which of the variables given above are examples of a continuous random variable?

Only I

Only II

I and II

II and III

I,II and III

Açıklama:

Doğru Cevap: E

Soru 11

I. The number of students in a class.
II. The time spent on doing an assignment.
III. The height of the students.
IV. The number of laptops in a school.
Which of the variables above is an example of a discrete random variable?

I and II.

I and IV.

All of them

None of them

II and III.

Açıklama:

Discrete random variables have only a countable number of separate values such as 0, 1, 2, 3... etc.

Doğru Cevap: B

Soru 12

Which of the following is a discrete random variable?

The sitting time interval of students in a library.

The weight of the books in a library.

The heat of a library.

The height of the students in a library.

The blood pressures of the students in a library.

Açıklama:

Discrete random variables have only a countable number of separate values such as 0, 1, 2 , 3... etc. In addition to above examples, the number of students in a class for certain day or the number of customers in a supermarket after 5:00 PM are cases for discrete random variables since these variables are finite and countable. Conversely, continuous random variable can take entire infinite values in a given interval. Because of this reason, continuous random variables are commonly measured instead of counted.

Doğru Cevap: E

Soru 13

X=x 1 2 3 4 P(X=x) 0,20 0,25 0,30 0,30 Which of the following is the mean (μ) for the probability distribution given above?

2,5

2,8

2,6

Açıklama:

Ux= E(X)=1.02+2.0,25+3.0,3+4.0,30=2,8

Doğru Cevap: D

Soru 14

A support centre receives 8 calls per hour and the number of the calls follows the Poisson distribution.
Which of the following is the probability exactly 6 calls in an hour?

0,01

0,02

0,03

0,04

0,05

Açıklama:

0,01

Doğru Cevap: A

Soru 15

I. The number of people in a mall. II. The speed of a car. III. The exam points of students in a class. Which of the variables above continuous random variable?

I and II

I, II, and III

II and III

III

Açıklama:

The number of people in a mall can be countable while the other variables are measured.

Doğru Cevap: D

Soru 16

Consider the probability distribution for the random variable X given below and determine the probability of P (1.5 < X ≤ 4) =?

0,4

0,5

0,35

0,45

0,70

Açıklama:

0,5

Doğru Cevap: B

Soru 17

Which of the following is not true about binomial distribution?

Independent observations constitute exactly two disjoint outcomes of a trial

An outcome of a random experiment can be classified into two different categories

A random experiment (trial) with only two possible outcomes is called a Bernoulli trial.

Binomial random variable represented as X ∼ Binomal (n, p)

Binominal distribution is used to model the number of outcomes occurring during a specified time interval or in a definite region

Açıklama:

The Poisson distribution is widely used for discrete probability distribution which is used to model the number of outcomes occurring during a specified time interval or in a definite region.

Doğru Cevap: E

Soru 18

X=x	1	2	3	4
P(X=x)	0,20	0,30	0,30	0,20

Which of the following is the mean of given data above?

2,5

3,5

Açıklama:

μ = E(X) = xP(X = x), x = 1,2,3,4, = 1⋅P(X =1)+ 2⋅P(X = 2)+ 3⋅P(X = 3)+ 4 ⋅P(X = 4)
μ = E(X) = 1⋅0.20+ 2⋅0.30+ 3⋅0.30+ 4 ⋅0.20
μ = E(X) = 0.20+ 0.60+ 0.90+ 0.80 = 2.5

Doğru Cevap: C

Soru 19

X=x	1	2	3	4
P(X=x)	0,20	0,30	0,30	0,20

Which of the following is the variance of given data above?

4.25

4.15

4.20

4.50

Açıklama:

E(X²) = 1²P(X =1)+ 2²⋅P(X = 2)+ 3²⋅P(X = 3)+ 4²⋅P(X = 4)
E(X²) = 1⋅0.20+ 4 ⋅0.30+ 9⋅0.30+16⋅0.40 = 10.50
σ ²=V(X) = E(X²)−[E(X)]²=10.50 − (2.5)²= 4.25

Doğru Cevap: B

Soru 20

X=x	1	2	3	4
P(X=x)	0,20	0,30	0,30	0,20

Which of the following is the standard deviation of the given data above?

1.9

1.95

2.015

2.0615

Açıklama:

σ= 2.0615

Doğru Cevap: E

Soru 21

Which of the following is a discrete random variable?

The rainfall in a area over years

Waiting time in a phone banking system

Length of trees in a certain forest

The water level in a certain river during a year

Number of students taking statistics course over years

Açıklama:

Things measured by time, volume, length, height etc are type of contionus variables, but things meauser by numbers are discrete. However, number of students taking a course can be measured only in integers so it's a discrete variable.

Doğru Cevap: E

Soru 22

What will be the probability P(2.5

0.1

0.2

0.3

0.4

0.6

Açıklama:

P(2.5

Doğru Cevap: D

Soru 23

What is the mean of variable X, given the probabilty distribution above?

3.2

3.4

3.5

3.6

Açıklama:

Mean=Sum(X.P(X=x))=(1*0.1)+(2*0.2)+(3*0.1)+(4*0.3)+(5*0.2)+(6*0.1)
=0.1+0.4+0.3+1.2+1+0.6=3.6

Doğru Cevap: E

Soru 24

Which one is broadly used in various real life applications especially for acceptance sampling in quality control?

Hypergeometric distribution

Binomial distribution

Poisson distribution

Arithmetic mean

Cumulative distribution

Açıklama:

This probability is called Hypergeometric distribution.The correct answer is A.

Doğru Cevap: A

Soru 25

What is mean score?

The most frequent score in a data set

The gap between the lowest and the highest score

The difference between a certain score and the average

The score obtained by dividing the total scores by the number of scores

Scores which are turned into zvalue

Açıklama:

You get the mean score if you divide the total scores by the number of scores.So the correct answer is D.

Doğru Cevap: D

Soru 26

What is mode?

The most frequent score in a data set

The gap between the lowest and the highest score

The difference between a certain score and the average

The score optained by dividing the total scores by the number of scores

Scores which are turned into zvalue

Açıklama:

Mode is the most frequently appearing score in a data set.So the correct answer is A.

Doğru Cevap: A

Soru 27

What is the variance of the probability distribution given above?

3.62

3.20

2.24

1.68

Açıklama:

We have to first compute the mean of the distribution in order to calculate the variance.
Mean=Sum(X.P(X=x))=(1*0.1)+(2*0.2)+(3*0.1)+(4*0.3)+(5*0.2)+(6*0.1)
=0.1+0.4+0.3+1.2+1+0.6=3.6
Variance=Sum[ P(X)*(X-Mean)²]=0.1*(1-3.6)²+0.2*(2-3.6)²+0.1*(3-3.6)²+0.3*(4-3.6)²+0.2*(5-3.6)²+0.1*(6-3.6)²=2.24

Doğru Cevap: D

Soru 28

Find out the mean score of the following data set: 10,20,40,60,100?

Açıklama:

The answer is E)46 as : The total scores are 230 and when we divide 230 by 5 we get 46

Doğru Cevap: E

Soru 29

Find out the mode in the following data set: 30, 30, 40,40 50,50,65,65,80,80,80

Açıklama:

The correct answer is E because it appears three times.

Doğru Cevap: E

Soru 30

What must be the value of k if the table above is a probability distribution table of variable X?

0.02

0.03

0.05

0.06

0.10

Açıklama:

The sum of all probabilities must be equal to 1, if it is a probability distribution table. Thus 3k+5k+8k+3k+k=20k=1 then k=1/20=0.05

Doğru Cevap: C

Soru 31

What is range?

The most frequent score in a data set

The gap between the lowest and the highest score

The difference between a certain score and the average

The score optained by dividing the total scores by the number of scores

Scores which are turned into zvalue

Açıklama:

Range is the difference between the lowest and the highest score in a data set that's why the correct answer is B.

Doğru Cevap: B

Soru 32

Find out the range in the following data set: 10,15,20,25,30,35,40,45,50,55,60

Açıklama:

The difference between 60(the highest) and 10(the lowest) is 50 so the correct answer is A

Doğru Cevap: A

Soru 33

Which one can be defined as a rule that assigns probabilities to the values of random variables?

Hypergeometric distribution

Binomial distribution

Poisson distribution

Probability distribution

Cumulative distribution

Açıklama:

We can define probability distribution as a rule that assigns probabilities to the values of random variables.So the correct answer is D.

Doğru Cevap: D

Soru 34

Which one is frequently used to summarize the features of probability distributions for a discrete random variable X?

Hypergeometric distribution

Binomial distribution

Arithmetic mean and variance

Probability distribution

Cumulative distribution

Açıklama:

Arithmetic mean and variance are frequently used to summarize the features of probability distributions for a discrete random variable X .So the correct answer is C

Doğru Cevap: C

Soru 35

Which one is a widely employed discrete probability distribution in statistics where a set of independent observations constitutes exactly two disjoint outcomes of a trial?

Hypergeometric distribution

Binomial distribution

Poisson distribution

Probability distribution

Cumulative distribution

Açıklama:

Doğru Cevap: B

Soru 36

What must be the value of "a" if the table above is the probability distribution of variable X whose mean is 2.8?

0.1

0.2

0.3

0.4

0.5

Açıklama:

If this is a probabilty distribution than the sum of all probabilities must be equal to 1. Thus 0.2+0.3+0.1+a+b=1 which means a+b=0.4, thus b=0.4-a. Also it is given that the mean of X is equal to 2.8.
Mean=Sum((X=x)P(X=x))
2.8=(1*0.2)+(2*0.3)+(3*0.1)+(4*a)+(5*b)
2.8=1.1+4a+5b
1.7=4a+5b=4a+5(0.4-a)
1.7=2-a
0.3=a

Doğru Cevap: C

Soru 37

There are 6 red and 4 blue balls in a box. One randomly chooses 4 balls from the box without replacing it. What is the probability of choosing at most 1 red ball out of these 4 balls?

0.12

0.18

0.24

0.36

0.48

Açıklama:

We have to compute zero red balls and 1 red ball.
P(x=0)=C(4,0)*0.6⁰*0.4⁴=0.4⁴
P(x=1)=C(4,1)*0.6¹*0.4³=2.4*0.4³
P(x=0)+P(x=1)=2.4*0.4³+0.4⁴=0.4³(2.4+0.4)=2.8*0.4³=0.1792=0.18

Doğru Cevap: B

Soru 38

If one wants to model the number of customers entering to a fast food restaurant per hour, which probability distribution should he/she use?

Poisson

Binomial

Hypergeometric

Continious

Bernoulli

Açıklama:

Poisson distribution is suitable for modelling the number of outcomes occurring during a specified time interval or in a definite region. Here, the time interval indicates any length, for instance an hour, a day, or a month.

Doğru Cevap: A

Soru 39

A call center receives, on average, 6 calls per minute and number of calls follows a Poisson distribution. What is the probability of receiving at most 2 calls at a given minute?

0.062

0.044

0.028

0.014

0.006

Açıklama:

We have to compute P(x=0)+P(x=1)+P(x=2) in order to find the probability of at most 2 calls at a given minute.
P(x=0)=(e^-6*6⁰)/0!=0.00248
P(x=1)=(e^-6*6¹)/1!=0.0149
P(x=2)=(e^-6*6²)/2!=0.0446
Thus P(x=0)+P(x=1)+P(x=2)=0.06198

Doğru Cevap: A

Soru 40

What is the probability of flipping a coin 3 times but getting no head at all?

1/4

1/8

1/16

1/32

1/64

Açıklama:

We have to find the probabilty of getting 3 tails but no head. Thus the probabilty of TTT=(1/2)*(1/2)*(1/2)=1/8

Doğru Cevap: B

Soru 41

____________________ random variables have only a countable number of separate values such as 0, 1, 2 , 3... etc.Which of the following fills in the blank above?

Continuous

Ratio

Interval

Discrete

Distinct

Açıklama:

Discrete random variables have only a countable number of separate values such as 0, 1, 2 , 3... etc.

Doğru Cevap: D

Soru 42

The _______________ of a discrete random variable is a list of odds associated with each of its possible values.Which of the following is appropriate for filling in the blank above?

ratio

probability distribution

continuous distribution

sum distribution

elevated ratio distribution

Açıklama:

The probability distribution of a discrete random variable is a list of odds associated with each of its possible values. It is also called the probability function or the probability mass function. Basically we can define probability function as a rule that assigns probabilities to the values of random variables.

Doğru Cevap: B

Soru 43

In a discrete probability distribution, The sum of the probabilities of each outcome of the random variable must equal to ___________ ?

0.1

0.4

0.7

0.9

Açıklama:

The sum of the probabilities of each outcome of the random variable must equal to 1.

Doğru Cevap: E

Soru 44

Let's assume that three perfect coins are flipped three times, what is the probability that all the results are head?

2/9

3/8

2/4

1/8

1/16

Açıklama:

The sample space is
S = {(TTT), (TTH), (THT), (HTT), (THH), (HTH), (HHT), (HHH)}
only once all the results are head, therefore
P(all head) = 1/ 8

Doğru Cevap: D

Soru 45

Study the following discrete probability distribution:

According to table what is the probability that the result is equal to 2 or less?

3/8

1/8

2/8

7/8

8/8

Açıklama:

the result is the total of the following probabilities:
1/8 + 3/8 + 3/8 = 7/8

Doğru Cevap: D

Soru 46

According to the following cumulative distribytion function graph, what is the probability of x < -1?

0.5

0.4

0.3

0.1

Açıklama:

It will be zero.

Doğru Cevap: E

Soru 47

What is the arithmetic mean of the following discrete probablity distribution?

0.50

0.75

1.25

1.75

2.25

Açıklama:

Mean = (0 * 1/8) + (1 * 3/8) + (2 * 3/8) + (3 * 1/8) =10/8=1.25

Doğru Cevap: C

Soru 48

In ______________ distribution, an outcome of a random experiment can be classified under two different categories.

Poisson

Normal

Binomial

Geometric

Açıklama:

in binomial distribution, an outcome of a random experiment can be classified under two different categories.

Doğru Cevap: D

Soru 49

In binomial distribution, "The probability of success, denoted by and the probability of failure, denoted by remains _____________ for all trials"?

different

changes trial to trial

constant

variable

fluctuates

Açıklama:

The probability of success, denoted by and the probability of failure, denoted by remains constant for all trials

Doğru Cevap: C

Soru 50

In Binomial distribtuion, Each trial has only_________ possible outcomes?

One

Two

Three

Four

Five

Açıklama:

Each trial has only two possible outcomes, such as head and tail, 0 and 1 or success and failure.

Doğru Cevap: B

Ünite 8

Soru 1

Which of the following can be categorized as a discrete random variable?

Water consumption in a company

The speed of a car in a certain area

Weighs of a people in a population

Electricity consumption of a house

The number of students in a class

Açıklama:

While all variables can take real numbers in other options, the number of the students cannot take a real number in option E. That is the number of students in a class can only take uncountable numbers.

Doğru Cevap: E

Soru 2

Which of the followings are not correct related to probability density function?

In order to calculate the area under a probability function between two points by utilizing probability density function, f (x).

The probability density function f (x), defines the physical characteristics of the random variable.

The probability density function determines the shape of the distribution for the continuous random variable X.

The area under the probability density function f (x) is always equivalent to 3.

Probability density function is represented by f (x).

Açıklama:

Because of the probability density function must satisfy the rule: ∫∞ f(x)dx=1,
the area under the probability density function f (x) is always equivalent to 1

Doğru Cevap: D

Soru 3

Which of the followings is not true about cumulative distribution function?

Cumulative distribution function defined as F(x)=P(X≤x)= ∫x f(t)dt,−∞

The cumulative distribution function probability supplies values by utilizing probability density function.

Cumulative distribution function f(x) of a continuous random variable X fulfils the following property; f(x)≥0 for all x.

Cumulative distribution function f(x) of a continuous random variable X fulfils the following property; if x1 ≤ x2 then F (x1) ≤ F (x2).

Cumulative distribution function f(x) of a continuous random variable X fulfils the following property; 0 ≤ F (x) ≤ 1.

Açıklama:

Not cumulative distribution function but probability density function fulfils the property of f(x)≥0 for all x.

Doğru Cevap: C

Soru 4

Which of the followings is not true about normal distribution?

Normal distribution is one of the most significant and extensively used continuous probability distribution.

Normal distribution provides basis for the statistical inference.

Normal distribution was developed by a mathematician Karl Friedrich Gauss.

Normal distribution is an asymmetric distribution where the random variable values are uniformly scattered around the mean.

Normal distribution can be called as “bell curve” or “Gaussian curve”.

Açıklama:

Normal distribution is a symmetric distribution where the random variable values are uniformly scattered around the mean.

Doğru Cevap: D

Soru 5

Which of the following properties is not valid related to probability density function f (x) for normal distribution?

If x1 ≤ x2 then F (x1) ≤ F (x2)

∫∞ f(x)dx=1

The normal distribution function curve is symmetric around the mean, μ.

The probability density function f (x) does not the touch and intersect x axis.

f (x) ≥ 0 for all x values.

Açıklama:

The property of "If x1 ≤ x2 then F (x1) ≤ F (x2)" is not valid related to probability density function f (x) for normal distribution because it is the property of Cumulative Density Function F(x) of a continuous random variable X.

Doğru Cevap: A

Soru 6

Probability density function for continuous random variable X is defined as follows;
f(x) = 0.02, for 0 ≤ x ≤ 50
Which of the following mean of the continuous random variable X?

150

Açıklama:

μ=E(X)= ∫xf(x)dx= ∫(0.02x)dx=25

Doğru Cevap: A

Soru 7

Probability density function f (x) of normal distribution has the following property;
f (x) ≥ 0 for all x values
Which of the statements explains the property above?

Probability density function of random variable x obtain the non-negative values

The area under the probability density function f (x) always equivalent to 1 in the definition interval of the random variable X.

Normal distribution curve has a similar shape on both sides of the mean x=μ.

The tails of the probability function goes to infinity and at no time crosses or touches the x axis.

P (X < μ ) = P (X > μ ) =0.5.

Açıklama:

Doğru Cevap: A

Soru 8

Probability density function for continuous random variable X is defined as follows;
f (x) = 0.05, for 0 ≤ x ≤ 30.
Which of the following is the standard deviation of this function?

14,08

14,06

14,05

15,07

14,03

Açıklama:

μ=E(X)= ∫xf(x)dx
σ2 =V(x)=E(X−μ)2 = ∫(x−μ)2 f(x)dx= ∫x2 f(x)dx−μ2
The standard deviation of random variable X is a square root of variance= 14.03

Doğru Cevap: E

Soru 9

An eye doctor’s physical examination time is exponentially distributed with a mean of 25 minutes
Which of the following is the probability that the physical exam duration takes less than 20 minutes?

0,33

0,44

0,55

0,66

0,77

Açıklama:

In this problem exponential random variable X represents the physical examination time. Also, the mean μ = 25 minutes of the physical examination time, therefore
λ=1= 1 =0.04μ 25
The probability density function is as follows,
f (x) = (0.04) e-0.04x, x ≥ 0
Therefore X ~ Exponential (λ = 0.04) and to find the probability that the physical exam duration takes less than 20 minutes P (X < 20).
P(X<20)= ∫20=(0.04)e−0.04xdx=−e−0.04x 20 =−(e−0.8 −1)=0.5500

Doğru Cevap:

Soru 10

Consider that continuous random variable X is uniformly distributed and takes values between a and 19 and the mean value of μ=12.
Which of the following the standard deviation for the random variable X?

4.001

4.041

4.104

4.101

4.404

Açıklama:

The variance (σ2) of the continuous uniform random variable X between a and b, X ~ U (a = 5, b = 19) can be calculated from these formula,(b−a)2 (19−5)2
σ2 =V(X)= 12 = 12 =16.33Standard deviation, σ = 4.041

Doğru Cevap: B

Soru 11

Which of the following is not a continuous random variable?

amount of rainfall in Eskişehir

number of university students in Eskişehir

flow rate of Porsuk river

length of streets of Eskişehir

flight height of owls over Anadolu University

Açıklama:

number of university students in Eskişehir. pg. 179. Correct answer is B.

Doğru Cevap: B

Soru 12

Consider the probability density function f(x)=0.04, for 15 ≤ x ≤ 40 and determine the probability of P (25 <= X <= 35) ?

0.1

0.2

0.4

0.5

0.8

Açıklama:

Int(25, 35)(f(x) dx) = 0.04 * x = 0.04 x (35 - 25) = 0.04 * 10 = 0.4. pg. 180. Correct answer is C.

Doğru Cevap: C

Soru 13

Consider the probability density function f(x)=0.025, for 20 ≤ x ≤ 60 and find the standard deviation of this function?

10^1/3

12.5

15^1/2

20 / 3^1/2

Açıklama:

m = E(X) = Int(-sonsuz , sonsuz)(....) = Int(20, 60)(x * f(x) dx) = Int(20, 60)(x * 0.025 * dx) = girdi(20, 60)((1/2) * x2 * 0.025) = (1/2) * 0.025 * (3600 - 400) = 0.25 * 160 = 40 ; V(X) = E(X - m)² = Int(-sonsuz , sonsuz)(....) = Int(20, 60)((x - m)² * f(x) dx) = Int(20, 60)((x - 40)² * 0.025 * dx) = girdi(20, 60)((1/3) * (x - 40)³ * 0.025) = (1/3) * 0.025 * (8000 - (-8000)) = (1/3) * 0.25 * 1600 = 400/3 ; sd = (400/3)^1/2 = 20 / 3^1/2 ; . pg. 184. Correct answer is E.

Doğru Cevap: E

Soru 14

Which one below is NOT one of the differences between continuous and discrete random variables?

Continuous random variables take on uncountable and infinite number of possible outcomes.

Probabilities in continuous random variables can be determined from the area under probability density function.

The range of continuous random variable X comprises all real numbers in an interval.

To describe such structures through continuous random variables density functions are utilized.

Only for continuous random variables, the mean is a measure of the midpoint or center of the probability distribution.

Açıklama:

Doğru Cevap: E

Soru 15

Which statements below are correct?
I Exponential distribution is a type of a discrete random variable.
II Discrete random variables have only a countable number of distinct values.
III A discrete random variable typically comprises of a counting concept.
IV Continuous random variables represent entire infinite values in an interval.
V Continuous random variables are commonly measured instead of counted.

I, II, III, IV

I, II, III, V

II, III, IV, V

I, III, IV, V

Only I

Açıklama:

As it can be recalled that discrete random variables have only a countable number of distinct values such as 0, 1, 2, 3... etc. In other words, a discrete random variable typically comprises of a counting concept. On the other hand, continuous random variables represent entire infinite values in an interval. For that reason, continuous random variables are commonly measured instead of counted. The speed of a plane, waiting time of customers at a bank’s call center, rainfall amount in a given day and inter arrival time between two customers which arrive to the post office are commonly cited examples for continuous random variables.

Doğru Cevap: C

Soru 16

Which one below is NOT an example of a continuous random variable?

The speed of a plane

The number of students who are present in the class

Rainfall amount in a given day

Travel duration from Ankara to Eskisehir on Sundays

Waiting time at the university cafeteria lane during the lunch hour

Açıklama:

A discrete random variable typically comprises of a counting concept. On the other hand, continuous random variables represent entire infinite values in an interval. For that reason, continuous random variables are commonly measured instead of counted. The speed of a plane, waiting time of customers at a bank’s call center, rainfall amount in a given day and inter arrival time between two customers which arrive to the post office are commonly cited examples for continuous random variables.

Doğru Cevap: B

Soru 17

Which option is NOT correct about the probability density function?

Probabilities in continuous random variables can be determined from the area under probability density function.

The probability density function f (x), defines the physical characteristics of the random variable.

Probability density function basically determines the shape of the distribution for the continuous random variable X.

The area under the probability density function f (x) is always greater than 1.

For a continuous random variable X, probability density function f (x)³0 for all x.

Açıklama:

The area under the probability density function f (x) is always equivalent to 1

Doğru Cevap: D

Soru 18

Probability density function for continuous random variable X is defined as follows;
f (x) = 0.02, for 0 ≤ x ≤ 50.
Which one below is the probability of P (X < 20)?

4.4

4.0

0.4

0.04

Açıklama:

Doğru Cevap: C

Soru 19

Consider that continuous random variable X is uniformly distributed and takes values between -5 and b and the mean value of μ = 10. Determine the value of b?

Açıklama:

f(x) = 1 / (b - a) ; E(X) = (a + b) / 2 = (-5 + b) / 2 = 10 ; b = 25. pg. 186. Correct answer is E.

Doğru Cevap: E

Soru 20

Consider that continuous random variable X is uniformly distributed and takes values between 10 and 50. Find the probability of P (15 < X < 25) ?

0.10

0.15

0.20

0.25

0.30

Açıklama:

f(x) = 1 / (b - a) = 1 / (50 - 10) = 1 / 40 ; P (15 < X < 25) = Int(15, 25)(f(x) dx) = Int(15, 25)((1/40) dx) = girdi(15, 25)((1/40) * x) = (1/40) * (25 - 15) = 1/4 = 0.25. pg. 186. Correct answer is D.

Doğru Cevap: D

Soru 21

z score of a standard normally distributed random variable Z for value=a is 0.1700. What is P(Z > a) ?

0.17

0.25

0.33

0.67

0.83

Açıklama:

P(Z > a) = 0.5 - 0.17 = 0.33. pg. 193. Correct answer is C.

Doğru Cevap: C

Soru 22

z score of a standard normally distributed random variable Z for value=-a is 0.195. What is P(Z < (-a)) ?

0.805

0.695

0.305

0.265

0.195

Açıklama:

P(Z < (-a)) = P(Z > a) = 0.5 - 0.195 = 0.305 . pg. 193. Correct answer is C.

Doğru Cevap: C

Soru 23

Random variable X has normal distribution, mean μ=25 and variance σ²=16. Determine the probability P (30 ≤ X ≤ 35) in terms of standard normal distribution?

P(0.25 ≤ z ≤ 0.75)

P(0.50 ≤ z ≤ 1)

P(0.25 ≤ z ≤ 1.5)

P(1 ≤ z ≤ 2)

P(1.25 ≤ z ≤ 2.5)

Açıklama:

z = (x - m) / sd ; sd = variance^1/2 ; (30 - 25) / 4 = 1.25 ; (35 - 25) / 4 = 2.5 ; P(1.25 <= z <= 2.5). pg. 200. Correct answer is E.

Doğru Cevap: E

Soru 24

Assume that waiting time to connect to internet at your home is normally distributed with a mean of 270 seconds and a standard deviation of 90 seconds. Find the probability that you can connect to internet in less than 210 seconds in terms of standard normal distribution?

0.5 - P(z = 1/3)

0.5 - P(z = 2/3)

0.5 + P(z = 1/3)

0.5 + P(z = 2/3)

1 - P(z = 1/3)

Açıklama:

z = (x - m) / sd ; (210 - 270) / 90 = -2/3 ; P(z <= (-2/3)) = P(z >= 2/3) = 0.5 - P(z = 2/3) . pg. 200. Correct answer is B.

Doğru Cevap: B

Soru 25

Suppose that random variable X has exponential distribution with λ=a. Find the probability of P (X ≥ b) ?

e^-a/b

e^-b/a

-e^-b/a

e^-ab

-e^-ab

Açıklama:

f (x) = λ e^-λx = a e^-ax , x ≥ 0 ; P( X >= x) = Int(b, sonsuz)(a * e^−at dt) = girdi(b, sonsuz)(-e^−ax) = 0 - (-e^-ab) = e^-ab . pg. 213. Correct answer is D.

Doğru Cevap: D

Soru 26

Which information below is correct?
I The mean of a continuous random variable X is a weighted
average through the possible values of the random variable and associated probabilities.
II The mean of the continuous random variable is denoted by E (x).
III The mean is also called as expected value and denoted by μ.
IV The variance is denoted by V(x) or σ2.

I, II

I,III

I, IV

II,III

II, IV

Açıklama:

For the calculation of the mean and the variance for the continuous random variables,only difference is integration substitute’s summation. The mean of the continuous random variable is denoted by μ, the mean is also called as expected value and denoted by E (x). The variance is denoted by V (x) or σ2 and it’s a measure of the scatter or variability for data set. the mean of a continuous random variable X is a weighted average through the possible values of the random variable and associated probabilities. Also, the variance of a continuous random variable X is all squared deviations are weighted with associated probability.

Doğru Cevap: C

Soru 27

Pdf for continuous random variable X is defined as follows;
f (x) = 0.02, for 0 ≤ x ≤ 50. What is the mean of the continuous random variable X ?

0.25

2.5

Açıklama:

The mean of the continuous random variable X;

Doğru Cevap: A

Soru 28

Pdf for continuous random variable X is defined as follows;
f (x) = 0.02, for 0 ≤ x ≤ 50. What is he variance of the continuous random variable X?

833.33

208.33

104.167

250.5

0.02

Açıklama:

The variance of the continuous random variable X;

Doğru Cevap: B

Soru 29

Pdf for continuous random variable X is defined as follows;
f (x) = 0.02, for 0 ≤ x ≤ 50. What is the standard deviation of the continuous random variable X?

208.33

833.33

104.167

14.4336

0. 1443

Açıklama:

The standard deviation of the continuous random variable X;

Doğru Cevap: D

Soru 30

Which one below is an example of exponential distribution?

The number of customers a call center representative talks

The ages of students in a class

Consumption amount in a household

The number of customers arrive to the bank

Time between two failures of a certain mechanical device

Açıklama:

Exponential distribution is another most significant and extensively used continuous probability distribution. Exponential random variable is frequently used to model the time interval between two events. Some illustrations of random variables that generally conform to model by means of exponential
distribution are presented below.
• Arrival time between two customers.
• Time between two messages.
• Time between telephone calls received by a customer service.
• Time between customers who are arriving to the checkout lane of the supermarket.
• Time between two failures of a certain mechanical device.

Doğru Cevap: E

Soru 31

I. f (x) ≥ 0 for all x.
II. the area under probability density function between points a and b is equal to 1
III. P(a ≤ X ≤ b) is equal to 1
Which of the statements should the probability density function satisfy?

Only I

Only II

I and II

I and III

II and III

Açıklama:

For a continuous random variable X, probability density function f (x) must satisfy the following
properties,
(i) f (x) ≥ 0 for all x,
(ii) the integral of f(x) between positive infinite and negative infinite is equal to 1
(iii) P(a ≤ X ≤ b) is equal to the area under probability density function between points a and b.
The answer is A.

Doğru Cevap: A

Soru 32

What is the area below the probability density function f (x) = 0.1, for 0 ≤ x ≤ 20 ?

Açıklama:

The probability density function f (x) = 0.1 is a straight line above the values 0 to 20, therefore, it is a rectangle with one side being 0.1 value and the other 20. The area is calculated by 20*(0.1) which is equal to 2. The answer is A.

Doğru Cevap: A

Soru 33

I. 0 ≤ F (x) ≤ 1
II. If x₁ ≤ x₂ then F (x₁) ≤ F (x₂)
III. If x₁ = x₂ then F (x₁) = F (x₂)
Which of the given statements are considered the properties of the cumulative probability function?

Only I

Only II

I and II

I and III

II and III

Açıklama:

I. 0 ≤ F (x) ≤ 1 (True)
II. If x₁ ≤ x₂ then F (x₁) ≤ F (x₂) (True)
III. If x₁ = x₂ then F (x₁) = F (x₂) (False, If x₁ ≤ x₂ then F (x₁) ≤ F (x₂))
The answer is C.

Doğru Cevap: C

Soru 34

What is the mean of f(x) = x for 0 ≤ x ≤ 4 equal to?

Açıklama:

The mean of an f(x) function is equal to the integral of that function's product to it's respective x on the given range. For the function f(x)= x for 0 ≤ x ≤ 3 the integral of x*f(x) on the range 0 to 4 is taken. This would yield to the integral of x²which is equal to x³/3. For x=0 this would equal to 0, for x=4 this would equal to 9. The mean is equal to 9 - 0 = 9. The answer is B.

Doğru Cevap: B

Soru 35

What is the mean of f(x) = x for 0 ≤ x ≤ 4 equal to?

Açıklama:

The variance is equal to V (X) = E (X²) - [E (X)]² for f(x) = x for 0 ≤ x ≤ 4 the f(x²)=x² which would indicate that x*x² is the function to take the integral in order to find the variance. The integral of x³ is x⁴/4 on 0 ≤ x ≤ 4 would yield to the difference of the value found from x = 4 and x = 0. For x=4 the value is 64 and for x=0 the value is 0. The difference between these values is 64. The answer is E.

Doğru Cevap: E

Soru 36

For X which is a continuous random variable that is uniformly distributed and takes values between 3 and 9. What is the probability density function of the random variable X ?

f(x)=0 for 3≤x≤9
f(x)=1/6 otherwise

f(x)=1/6 for 3≤x≤9
f(x)=0 otherwise

f(x)=1/3 for 3≤x≤9
f(x)=0 otherwise

f(x)=1/2 for 3≤x≤9
f(x)=0 otherwise

f(x)=1/6

Açıklama:

For X which is a continuous random variable that is uniformly distributed and takes values between 3 and 9, the minimum value of the random variable X is 3 (it’s the value of a) and the maximum value is 9 (it’s the value of b). Then X~U (a=3, b=9) and the probability density function of the continuous random variable X is defined as, f (x) = 1/(b − a) = 1/(9 − 3)= 1/6 for 3≤x≤9. The answer is B.

Doğru Cevap: B

Soru 37

For X which is a continuous random variable that is uniformly distributed and takes values between 3 and 9. What is the mean of the random variable X ?

Açıklama:

Doğru Cevap: E

Soru 38

For X which is a continuous random variable that is uniformly distributed and takes values between 3 and 9. What is the variance of the random variable X ?

Açıklama:

Doğru Cevap: D

Soru 39

I. The exponential random variable is frequently used to model the time interval between two events.
II. The exponential random variable is defined with a parameter λ.
III. The exponential random variable X defines the time interval between two independent events.
Which one of the given statements can be said to be true about exponential distribution?

Only I

Only II

I and II

I and III

II and III

Açıklama:

I. The exponential random variable is frequently used to model the time interval between two events. (True)
II. The exponential random variable is defined with a parameter λ. (True)
III. The exponential random variable X defines the time interval between two independent events. (False, the exponential random variable X defines the time interval between two consecutive events.)
The answer is C.

Doğru Cevap: C

Soru 40

I. The standard normal curve is symmetric around the mean μ = 0.
II. The standard normal distribution has a standard deviation σ = 1 of the distribution.
III. The area under the standard normal distribution function for P (z ≤ 4.25) is exactly 1.
Which of the given statements can be said to be true about the standard normal distribution function?

Only I

Only II

I and II

I and III

II and III

Açıklama:

The standard normal curve is symmetric around the mean μ = 0. The standard normal distribution has a standard deviation σ = 1 of the distribution. The area under the standard normal distribution function for P (z ≤ 4.25) is approximately 1.
The answer is C.

Doğru Cevap: C

Soru 41

What does a variable's taking on infinite number of possible outcomes in a given interval show?

that it is a continuous random variable

that it is a discrete random variable

that it typically comprises of a counting concept

that it cannot be determined from the area under probability density function

that it cannot keep uncountable measures

Açıklama:

A major difference between continuous and discrete random variables is the former takes on uncountable and infinite number of possible outcomes in a given interval. Hence the range of continuous random variable X comprises all real numbers in an interval. In addition to the above given illustrations, water consumption amount in a household, weights of people in a population, the speed of wind in a open certain area, waiting time in a supermarket, checkout lanes or load on a bridge are the few examples for continuous random variable for real world applications. From these examples it’s clear that random variable X can take unaccountably infinite values. To describe such physical structures through continuous random variables density functions are utilized. Therefore, in contrast to discrete random variables, probabilities in continuous random variables can be determined from the area under probability density function (pdf) which is represented by f (x).

Doğru Cevap: A

Soru 42

Which of the following is a measure of the midpoint or center of the probability distribution?

Mean

Median

Mode

Variance

Range

Açıklama:

Similar to the discrete random variable the mean is a measure of the midpoint or center of the probability distribution.

Doğru Cevap: A

Soru 43

Which of the following is a measure of the dispersion or variability for data set for continuous random variables?

Mean

Mode

Median

Variance

Frequency

Açıklama:

The variance is a measure of the dispersion or variability for data set for continuous
random variables.

Doğru Cevap: D

Soru 44

For which of the following, does the probability density function f (x) of the continuous random variable X take a constant value over the range of the random variable X is defined?

Uniform distribution

Normal distribution

Standard normal distribution

Exponential distribution

Constant distribution

Açıklama:

Continuous uniform distribution is the one of the easiest continuous random variable and the probability density function f (x) of the continuous random variable X takes a constant value over the range of the random variable X is defined.

Doğru Cevap: A

Soru 45

The normal distribution is one of the most significant and extensively used continuous probability distribution because ...
Which of the following correctly concludes the sentence above?

probability density function for normal random variable is easier to apply

real life applications are approximately normally distributed

normally distributed random variables are represented using two parameters

the mean of a normal random variable can have both positive and negative values including zero

standart deviation of the normal random variable can have both positive and negative values

Açıklama:

Normal distribution is one of the most significant and extensively used continuous probability
distribution. The major reason for this circumstance is majority of the continuous random variables
which are observed through real life applications (social, medical, physical, biological) are normally or
approximately normally distributed (bell-shaped) variables.

Doğru Cevap: B

Soru 46

Which of the following is not true for normal distribution?

It provides basis for the statistical inference

It is also called as "uniform distribution"

It is a symmetric distribution where random variable values are uniformly scattered around mean

Population mean has an effect on the shape of its function

Standard deviation has an effect on the shape of its function

Açıklama:

Normal distribution provides basis for the statistical inference. Normal distribution is a symmetric distribution where the random variable values are uniformly scattered around the mean. Population mean, μ and standard deviation σ parameters determine the shape of the normal distribution function. Uniform distribution is different than normal distribution.

Doğru Cevap: B

Soru 47

f (x) ≥ 0 for all x values
The normal distribution function curve is symmetric around the mean, μ
The probability density function f (x) does not the touch and intersect x axis

Which of the above is/are the properties that probability density function f (x) of normal distribution has?

Only I

I and II

I and III

II and III

I, II and III

Açıklama:

Probability density function f (x) of normal distribution has the following properties.
(i) f (x) ≥ 0 for all x values.
(ii) ∫ ∞ −∞ f (x)dx =1
(iii) The normal distribution function curve is symmetric around the mean, μ.
(iv) The probability density function f (x) does not the touch and intersect x axis.

Doğru Cevap: E

Soru 48

According to the properties that probability density function f(x) of normal distribution has, which of the following is not true?

Probability density function of random variable x obtain the non-negative
values at all times

Probability density function, f (x) decreases as the random variable value goes away from the mean, μ

The area under the probability density function f (x) always equivalent to 1 in the definition interval of the random variable X

Normal distribution curve has a similar shape on both sides of the mean x=μ

The left tail of the probability function touches the x axis at point 1

Açıklama:

First property assures that probability density function of random variable x obtain the non-negative values at all times. From the shape of the probability density function curve it’s obvious that pdf, f (x) decreases as the random variable value goes away from the mean, μ. Likewise, probability density function f (x) increases as the random variable value gets closer to the mean, μ. Second property identifies that the area under the probability density function f (x) always equivalent to 1 in the definition interval of the random variable X. Third property suggests that normal distribution curve has a similar shape on both sides of the mean x=μ. That property also proposes the fact that, P (X < μ ) = P (X > μ ) =0.5. Fourth property indicates that the tails of the probability function goes to infinity and at no time crosses or touches the x axis.

Doğru Cevap: E

Soru 49

Arithmetic mean
Standard deviation
Variance

Which of the above determine(s) the shape of the normal random variable?

Only III

I and II

I and III

II and III

I, II and III

Açıklama:

The shape of the normal random variable is determined by the mean μ and the standard deviation σ of the distribution.

Doğru Cevap: B

Soru 50

Frequently used to model the time interval between two events
It is essential to use consistent time units in the determination of probabilities
The mean and the standard deviation of the distribution are equal

Which of the above is/are true for exponential distribution?

Only I

I and II

I and III

II and III

I,II and III

Açıklama:

Exponential distribution is another most significant and extensively used continuous probability distribution. Exponential random variable is frequently used to model the time interval between two events. Exponential random variable is defined with a parameter λ and it’s represented as X~Exponential (λ). In that sense the exponential random variable X defines the time interval between two consecutive events of a Poisson process with a mean of µ = λ. Here λ parameter defines the number of events is a certain time period. Therefore, it’s essential to use consistent time units in the determination of probabilities, mean and variance with the exponential random variable X. The mean (µ) and variance (σ2) for exponential random variable X~Exponential (λ) with parameter λ can be calculated from the following formulas,
E(x) = µ = 1/λ
V (x) =σ2 = 1/λ2
Hence from above given formulas it’s clear that the mean and the standard deviation of the exponential distribution are equal.

Doğru Cevap: E

Statıstıcs I (ENG) - Tüm Sorular

Ünite 1

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler

Seçenekler