results of an experiment must be

In order to continue enjoying our site, we ask that you confirm your identity as a human. Thank you very much for your cooperation.

The Scientific Method Tutorial

The Scientific Method

Steps in the scientific method.

There is a great deal of variation in the specific techniques scientists use explore the natural world. However, the following steps characterize the majority of scientific investigations:

Step 1: Make observations Step 2: Propose a hypothesis to explain observations Step 3: Test the hypothesis with further observations or experiments Step 4: Analyze data Step 5: State conclusions about hypothesis based on data analysis

Each of these steps is explained briefly below, and in more detail later in this section.

Step 1: Make observations

A scientific inquiry typically starts with observations. Often, simple observations will trigger a question in the researcher's mind.

Example: A biologist frequently sees monarch caterpillars feeding on milkweed plants, but rarely sees them feeding on other types of plants. She wonders if it is because the caterpillars prefer milkweed over other food choices.

Step 2: Propose a hypothesis

The researcher develops a hypothesis (singular) or hypotheses (plural) to explain these observations. A hypothesis is a tentative explanation of a phenomenon or observation(s) that can be supported or falsified by further observations or experimentation.

Example: The researcher hypothesizes that monarch caterpillars prefer to feed on milkweed compared to other common plants. (Notice how the hypothesis is a statement, not a question as in step 1.)

Step 3: Test the hypothesis

The researcher makes further observations and/or may design an experiment to test the hypothesis. An experiment is a controlled situation created by a researcher to test the validity of a hypothesis. Whether further observations or an experiment is used to test the hypothesis will depend on the nature of the question and the practicality of manipulating the factors involved.

Example: The researcher sets up an experiment in the lab in which a number of monarch caterpillars are given a choice between milkweed and a number of other common plants to feed on.

Step 4: Analyze data

The researcher summarizes and analyzes the information, or data, generated by these further observations or experiments.

Example: In her experiment, milkweed was chosen by caterpillars 9 times out of 10 over all other plant selections.

Step 5: State conclusions

The researcher interprets the results of experiments or observations and forms conclusions about the meaning of these results. These conclusions are generally expressed as probability statements about their hypothesis.

Example: She concludes that when given a choice, 90 percent of monarch caterpillars prefer to feed on milkweed over other common plants.

Often, the results of one scientific study will raise questions that may be addressed in subsequent research. For example, the above study might lead the researcher to wonder why monarchs seem to prefer to feed on milkweed, and she may plan additional experiments to explore this question. For example, perhaps the milkweed has higher nutritional value than other available plants.

Return to top of page

The Scientific Method Flowchart

The steps in the scientific method are presented visually in the following flow chart. The question raised or the results obtained at each step directly determine how the next step will proceed. Following the flow of the arrows, pass the cursor over each blue box. An explanation and example of each step will appear. As you read the example given at each step, see if you can predict what the next step will be.

Activity: Apply the Scientific Method to Everyday Life Use the steps of the scientific method described above to solve a problem in real life. Suppose you come home one evening and flick the light switch only to find that the light doesn’t turn on. What is your hypothesis? How will you test that hypothesis? Based on the result of this test, what are your conclusions? Follow your instructor's directions for submitting your response.

The above flowchart illustrates the logical sequence of conclusions and decisions in a typical scientific study. There are some important points to note about this process:

1. The steps are clearly linked.

The steps in this process are clearly linked. The hypothesis, formed as a potential explanation for the initial observations, becomes the focus of the study. The hypothesis will determine what further observations are needed or what type of experiment should be done to test its validity. The conclusions of the experiment or further observations will either be in agreement with or will contradict the hypothesis. If the results are in agreement with the hypothesis, this does not prove that the hypothesis is true! In scientific terms, it "lends support" to the hypothesis, which will be tested again and again under a variety of circumstances before researchers accept it as a fairly reliable description of reality.

2. The same steps are not followed in all types of research.

The steps described above present a generalized method followed in a many scientific investigations. These steps are not carved in stone. The question the researcher wishes to answer will influence the steps in the method and how they will be carried out. For example, astronomers do not perform many experiments as defined here. They tend to rely on observations to test theories. Biologists and chemists have the ability to change conditions in a test tube and then observe whether the outcome supports or invalidates their starting hypothesis, while astronomers are not able to change the path of Jupiter around the Sun and observe the outcome!

3. Collected observations may lead to the development of theories.

When a large number of observations and/or experimental results have been compiled, and all are consistent with a generalized description of how some element of nature operates, this description is called a theory. Theories are much broader than hypotheses and are supported by a wide range of evidence. Theories are important scientific tools. They provide a context for interpretation of new observations and also suggest experiments to test their own validity. Theories are discussed in more detail in another section.

. .

The Scientific Method in Detail

In the sections that follow, each step in the scientific method is described in more detail.

Step 1: Observations

Observations in science.

An observation is some thing, event, or phenomenon that is noticed or observed. Observations are listed as the first step in the scientific method because they often provide a starting point, a source of questions a researcher may ask. For example, the observation that leaves change color in the fall may lead a researcher to ask why this is so, and to propose a hypothesis to explain this phenomena. In fact, observations also will provide the key to answering the research question.

In science, observations form the foundation of all hypotheses, experiments, and theories. In an experiment, the researcher carefully plans what observations will be made and how they will be recorded. To be accepted, scientific conclusions and theories must be supported by all available observations. If new observations are made which seem to contradict an established theory, that theory will be re-examined and may be revised to explain the new facts. Observations are the nuts and bolts of science that researchers use to piece together a better understanding of nature.

Observations in science are made in a way that can be precisely communicated to (and verified by) other researchers. In many types of studies (especially in chemistry, physics, and biology), quantitative observations are used. A quantitative observation is one that is expressed and recorded as a quantity, using some standard system of measurement. Quantities such as size, volume, weight, time, distance, or a host of others may be measured in scientific studies.

Some observations that researchers need to make may be difficult or impossible to quantify. Take the example of color. Not all individuals perceive color in exactly the same way. Even apart from limiting conditions such as colorblindness, the way two people see and describe the color of a particular flower, for example, will not be the same. Color, as perceived by the human eye, is an example of a qualitative observation.

Qualitative observations note qualities associated with subjects or samples that are not readily measured. Other examples of qualitative observations might be descriptions of mating behaviors, human facial expressions, or "yes/no" type of data, where some factor is present or absent. Though the qualities of an object may be more difficult to describe or measure than any quantities associated with it, every attempt is made to minimize the effects of the subjective perceptions of the researcher in the process. Some types of studies, such as those in the social and behavioral sciences (which deal with highly variable human subjects), may rely heavily on qualitative observations.

Question: Why are observations important to science?

Limits of Observations

Because all observations rely to some degree on the senses (eyes, ears, or steady hand) of the researcher, complete objectivity is impossible. Our human perceptions are limited by the physical abilities of our sense organs and are interpreted according to our understanding of how the world works, which can be influenced by culture, experience, or education. According to science education specialist, George F. Kneller, "Surprising as it may seem, there is no fact that is not colored by our preconceptions" ("A Method of Enquiry," from Science and Its Ways of Knowing [Upper Saddle River: Prentice-Hall Inc., 1997], 15).

Observations made by a scientist are also limited by the sensitivity of whatever equipment he is using. Research findings will be limited at times by the available technology. For example, Italian physicist and philosopher Galileo Galilei (1564–1642) was reportedly the first person to observe the heavens with a telescope. Imagine how it must have felt to him to see the heavens through this amazing new instrument! It opened a window to the stars and planets and allowed new observations undreamed of before.

In the centuries since Galileo, increasingly more powerful telescopes have been devised that dwarf the power of that first device. In the past decade, we have marveled at images from deep space , courtesy of the Hubble Space Telescope, a large telescope that orbits Earth. Because of its view from outside the distorting effects of the atmosphere, the Hubble can look 50 times farther into space than the best earth-bound telescopes, and resolve details a tenth of the size (Seeds, Michael A., Horizons: Exploring the Universe , 5 th ed. [Belmont: Wadsworth Publishing Company, 1998], 86-87).

Construction is underway on a new radio telescope that scientists say will be able to detect electromagnetic waves from the very edges of the universe! This joint U.S.-Mexican project may allow us to ask questions about the origins of the universe and the beginnings of time that we could never have hoped to answer before. Completion of the new telescope is expected by the end of 2001.

Although the amount of detail observed by Galileo and today's astronomers is vastly different, the stars and their relationships have not changed very much. Yet with each technological advance, the level of detail of observation has been increased, and with it, the power to answer more and more challenging questions with greater precision.

Question: What are some of the differences between a casual observation and a 'scientific observation'?

Step 2: The Hypothesis

A hypothesis is a statement created by the researcher as a potential explanation for an observation or phenomena. The hypothesis converts the researcher's original question into a statement that can be used to make predictions about what should be observed if the hypothesis is true. For example, given the hypothesis, "exposure to ultraviolet (UV) radiation increases the risk of skin cancer," one would predict higher rates of skin cancer among people with greater UV exposure. These predictions could be tested by comparing skin cancer rates among individuals with varying amounts of UV exposure. Note how the hypothesis itself determines what experiments or further observations should be made to test its validity. Results of tests are then compared to predictions from the hypothesis, and conclusions are stated in terms of whether or not the data supports the hypothesis. So the hypothesis serves a guide to the full process of scientific inquiry.

The Qualities of a Good Hypothesis

A hypothesis must be testable or provide predictions that are testable. It can potentially be shown to be false by further observations or experimentation.
A hypothesis should be specific. If it is too general it cannot be tested, or tests will have so many variables that the results will be complicated and difficult to interpret. A well-written hypothesis is so specific it actually determines how the experiment should be set up.
A hypothesis should not include any untested assumptions if they can be avoided. The hypothesis itself may be an assumption that is being tested, but it should be phrased in a way that does not include assumptions that are not tested in the experiment.
It is okay (and sometimes a good idea) to develop more than one hypothesis to explain a set of observations. Competing hypotheses can often be tested side-by-side in the same experiment.

Question: Why is the hypothesis important to the scientific method?

grow well in a lighted incubator maintained at 90 F. A culture of was accidentally left uncovered overnight on a laboratory bench where it was dark and temperatures fluctuated between 65 F and 68 F. When the technician returned in the morning, all the cells were dead. Which of the following statements is the hypothesis to explain why the cells died, based on this observation?		cells to die.

Step 3: Testing the Hypothesis

A hypothesis may be tested in one of two ways: by making additional observations of a natural situation, or by setting up an experiment. In either case, the hypothesis is used to make predictions, and the observations or experimental data collected are examined to determine if they are consistent or inconsistent with those predictions. Hypothesis testing, especially through experimentation, is at the core of the scientific process. It is how scientists gain a better understanding of how things work.

Testing a Hypothesis by Observation

Some hypotheses may be tested through simple observation. For example, a researcher may formulate the hypothesis that the sun always rises in the east. What might an alternative hypothesis be? If his hypothesis is correct, he would predict that the sun will rise in the east tomorrow. He can easily test such a prediction by rising before dawn and going out to observe the sunrise. If the sun rises in the west, he will have disproved the hypothesis. He will have shown that it does not hold true in every situation. However, if he observes on that morning that the sun does in fact rise in the east, he has not proven the hypothesis. He has made a single observation that is consistent with, or supports, the hypothesis. As a scientist, to confidently state that the sun will always rise in the east, he will want to make many observations, under a variety of circumstances. Note that in this instance no manipulation of circumstance is required to test the hypothesis (i.e., you aren't altering the sun in any way).

Testing a Hypothesis by Experimentation

An experiment is a controlled series of observations designed to test a specific hypothesis. In an experiment, the researcher manipulates factors related to the hypothesis in such a way that the effect of these factors on the observations (data) can be readily measured and compared. Most experiments are an attempt to define a cause-and-effect relationship between two factors or events—to explain why something happens. For example, with the hypothesis "roses planted in sunny areas bloom earlier than those grown in shady areas," the experiment would be testing a cause-and-effect relationship between sunlight and time of blooming.

A major advantage of setting up an experiment versus making observations of what is already available is that it allows the researcher to control all the factors or events related to the hypothesis, so that the true cause of an event can be more easily isolated. In all cases, the hypothesis itself will determine the way the experiment will be set up. For example, suppose my hypothesis is "the weight of an object is proportional to the amount of time it takes to fall a certain distance." How would you test this hypothesis?

The Qualities of a Good Experiment

The experiment must be conducted on a group of subjects that are narrowly defined and have certain aspects in common. This is the group to which any conclusions must later be confined. (Examples of possible subjects: female cancer patients over age 40, E. coli bacteria, red giant stars, the nicotine molecule and its derivatives.)
All subjects of the experiment should be (ideally) completely alike in all ways except for the factor or factors that are being tested. Factors that are compared in scientific experiments are called variables. A variable is some aspect of a subject or event that may differ over time or from one group of subjects to another. For example, if a biologist wanted to test the effect of nitrogen on grass growth, he would apply different amounts of nitrogen fertilizer to several plots of grass. The grass in each of the plots should be as alike as possible so that any difference in growth could be attributed to the effect of the nitrogen. For example, all the grass should be of the same species, planted at the same time and at the same density, receive the same amount of water and sunlight, and so on. The variable in this case would be the amount of nitrogen applied to the plants. The researcher would not compare differing amounts of nitrogen across different grass species to determine the effect of nitrogen on grass growth. What is the problem with using different species of plants to compare the effect of nitrogen on plant growth? There are different kinds of variables in an experiment. A factor that the experimenter controls, and changes intentionally to determine if it has an effect, is called an independent variable . A factor that is recorded as data in the experiment, and which is compared across different groups of subjects, is called a dependent variable . In many cases, the value of the dependent variable will be influenced by the value of an independent variable. The goal of the experiment is to determine a cause-and-effect relationship between independent and dependent variables—in this case, an effect of nitrogen on plant growth. In the nitrogen/grass experiment, (1) which factor was the independent variable? (2) Which factor was the dependent variable?
Nearly all types of experiments require a control group and an experimental group. The control group generally is not changed in any way, but remains in a "natural state," while the experimental group is modified in some way to examine the effect of the variable which of interest to the researcher. The control group provides a standard of comparison for the experimental groups. For example, in new drug trials, some patients are given a placebo while others are given doses of the drug being tested. The placebo serves as a control by showing the effect of no drug treatment on the patients. In research terminology, the experimental groups are often referred to as treatments , since each group is treated differently. In the experimental test of the effect of nitrogen on grass growth, what is the control group? In the example of the nitrogen experiment, what is the purpose of a control group?
In research studies a great deal of emphasis is placed on repetition. It is essential that an experiment or study include enough subjects or enough observations for the researcher to make valid conclusions. The two main reasons why repetition is important in scientific studies are (1) variation among subjects or samples and (2) measurement error.

Variation among Subjects

There is a great deal of variation in nature. In a group of experimental subjects, much of this variation may have little to do with the variables being studied, but could still affect the outcome of the experiment in unpredicted ways. For example, in an experiment designed to test the effects of alcohol dose levels on reflex time in 18- to 22-year-old males, there would be significant variation among individual responses to various doses of alcohol. Some of this variation might be due to differences in genetic make-up, to varying levels of previous alcohol use, or any number of factors unknown to the researcher.

Because what the researcher wants to discover is average dose level effects for this group, he must run the test on a number of different subjects. Suppose he performed the test on only 10 individuals. Do you think the average response calculated would be the same as the average response of all 18- to 22-year-old males? What if he tests 100 individuals, or 1,000? Do you think the average he comes up with would be the same in each case? Chances are it would not be. So which average would you predict would be most representative of all 18- to 22-year-old males?

A basic rule of statistics is, the more observations you make, the closer the average of those observations will be to the average for the whole population you are interested in. This is because factors that vary among a population tend to occur most commonly in the middle range, and least commonly at the two extremes. Take human height for example. Although you may find a man who is 7 feet tall, or one who is 4 feet tall, most men will fall somewhere between 5 and 6 feet in height. The more men we measure to determine average male height, the less effect those uncommon extreme (tall or short) individuals will tend to impact the average. Thus, one reason why repetition is so important in experiments is that it helps to assure that the conclusions made will be valid not only for the individuals tested, but also for the greater population those individuals represent.

"The use of a sample (or subset) of a population, an event, or some other aspect of nature for an experimental group that is not large enough to be representative of the whole" is called sampling error (Starr, Cecie, Biology: Concepts and Applications , 4 th ed. [Pacific Cove: Brooks/Cole, 2000], glossary). If too few samples or subjects are used in an experiment, the researcher may draw incorrect conclusions about the population those samples or subjects represent.

Use the jellybean activity below to see a simple demonstration of samping error.

Directions: There are 400 jellybeans in the jar. If you could not see the jar and you initially chose 1 green jellybean from the jar, you might assume the jar only contains green jelly beans. The jar actually contains both green and black jellybeans. Use the "pick 1, 5, or 10" buttons to create your samples. For example, use the "pick" buttons now to create samples of 2, 13, and 27 jellybeans. After you take each sample, try to predict the ratio of green to black jellybeans in the jar. How does your prediction of the ratio of green to black jellybeans change as your sample changes?

Measurement Error

The second reason why repetition is necessary in research studies has to do with measurement error. Measurement error may be the fault of the researcher, a slight difference in measuring techniques among one or more technicians, or the result of limitations or glitches in measuring equipment. Even the most careful researcher or the best state-of-the-art equipment will make some mistakes in measuring or recording data. Another way of looking at this is to say that, in any study, some measurements will be more accurate than others will. If the researcher is conscientious and the equipment is good, the majority of measurements will be highly accurate, some will be somewhat inaccurate, and a few may be considerably inaccurate. In this case, the same reasoning used above also applies here: the more measurements taken, the less effect a few inaccurate measurements will have on the overall average.

Step 4: Data Analysis

In any experiment, observations are made, and often, measurements are taken. Measurements and observations recorded in an experiment are referred to as data . The data collected must relate to the hypothesis being tested. Any differences between experimental and control groups must be expressed in some way (often quantitatively) so that the groups may be compared. Graphs and charts are often used to visualize the data and to identify patterns and relationships among the variables.

Statistics is the branch of mathematics that deals with interpretation of data. Data analysis refers to statistical methods of determining whether any differences between the control group and experimental groups are too great to be attributed to chance alone. Although a discussion of statistical methods is beyond the scope of this tutorial, the data analysis step is crucial because it provides a somewhat standardized means for interpreting data. The statistical methods of data analysis used, and the results of those analyses, are always included in the publication of scientific research. This convention limits the subjective aspects of data interpretation and allows scientists to scrutinize the working methods of their peers.

Why is data analysis an important step in the scientific method?

Step 5: Stating Conclusions

The conclusions made in a scientific experiment are particularly important. Often, the conclusion is the only part of a study that gets communicated to the general public. As such, it must be a statement of reality, based upon the results of the experiment. To assure that this is the case, the conclusions made in an experiment must (1) relate back to the hypothesis being tested, (2) be limited to the population under study, and (3) be stated as probabilities.

The hypothesis that is being tested will be compared to the data collected in the experiment. If the experimental results contradict the hypothesis, it is rejected and further testing of that hypothesis under those conditions is not necessary. However, if the hypothesis is not shown to be wrong, that does not conclusively prove that it is right! In scientific terms, the hypothesis is said to be "supported by the data." Further testing will be done to see if the hypothesis is supported under a number of trials and under different conditions.

If the hypothesis holds up to extensive testing then the temptation is to claim that it is correct. However, keep in mind that the number of experiments and observations made will only represent a subset of all the situations in which the hypothesis may potentially be tested. In other words, experimental data will only show part of the picture. There is always the possibility that a further experiment may show the hypothesis to be wrong in some situations. Also, note that the limits of current knowledge and available technologies may prevent a researcher from devising an experiment that would disprove a particular hypothesis.

The researcher must be sure to limit his or her conclusions to apply only to the subjects tested in the study. If a particular species of fish is shown to consume their young 90 percent of the time when raised in captivity, that doesn't necessarily mean that all fish will do so, or that this fish's behavior would be the same in its native habitat.

Finally, the conclusions of the experiment are generally stated as probabilities. A careful scientist would never say, "drug x kills cancer cells;" she would more likely say, "drug x was shown to destroy 85 percent of cancerous skin cells in rats in lab trials." Notice how very different these two statements are. There is a tendency in the media and in the general public to gravitate toward the first statement. This makes a terrific headline and is also easy to interpret; it is absolute. Remember though, in science conclusions must be confined to the population under study; broad generalizations should be avoided. The second statement is sound science. There is data to back it up. Later studies may reveal a more universal effect of the drug on cancerous cells, or they may not. Most researchers would be unwilling to stake their reputations on the first statement.

As a student, you should read and interpret popular press articles about research studies very carefully. From the text, can you determine how the experiment was set up and what variables were measured? Are the observations and data collected appropriate to the hypothesis being tested? Are the conclusions supported by the data? Are the conclusions worded in a scientific context (as probability statements) or are they generalized for dramatic effect? In any researched-based assignment, it is a good idea to refer to the original publication of a study (usually found in professional journals) and to interpret the facts for yourself.

Qualities of a Good Experiment

narrowly defined subjects
all subjects treated alike except for the factor or variable being studied
a control group is used for comparison
measurements related to the factors being studied are carefully recorded
enough samples or subjects are used so that conclusions are valid for the population of interest
conclusions made relate back to the hypothesis, are limited to the population being studied, and are stated in terms of probabilities

by Stephen S. Carey.

Have a language expert improve your writing

Run a free plagiarism check in 10 minutes, generate accurate citations for free.

Knowledge Base

Methodology

What Is a Controlled Experiment? | Definitions & Examples

What Is a Controlled Experiment? | Definitions & Examples

Published on April 19, 2021 by Pritha Bhandari . Revised on June 22, 2023.

In experiments , researchers manipulate independent variables to test their effects on dependent variables. In a controlled experiment , all variables other than the independent variable are controlled or held constant so they don’t influence the dependent variable.

Controlling variables can involve:

holding variables at a constant or restricted level (e.g., keeping room temperature fixed).
measuring variables to statistically control for them in your analyses.
balancing variables across your experiment through randomization (e.g., using a random order of tasks).

Why does control matter in experiments, methods of control, problems with controlled experiments, other interesting articles, frequently asked questions about controlled experiments.

Control in experiments is critical for internal validity , which allows you to establish a cause-and-effect relationship between variables. Strong validity also helps you avoid research biases , particularly ones related to issues with generalizability (like sampling bias and selection bias .)

Your independent variable is the color used in advertising.
Your dependent variable is the price that participants are willing to pay for a standard fast food meal.

Extraneous variables are factors that you’re not interested in studying, but that can still influence the dependent variable. For strong internal validity, you need to remove their effects from your experiment.

Design and description of the meal,
Study environment (e.g., temperature or lighting),
Participant’s frequency of buying fast food,
Participant’s familiarity with the specific fast food brand,
Participant’s socioeconomic status.

Here's why students love Scribbr's proofreading services

Discover proofreading & editing

You can control some variables by standardizing your data collection procedures. All participants should be tested in the same environment with identical materials. Only the independent variable (e.g., ad color) should be systematically changed between groups.

Other extraneous variables can be controlled through your sampling procedures . Ideally, you’ll select a sample that’s representative of your target population by using relevant inclusion and exclusion criteria (e.g., including participants from a specific income bracket, and not including participants with color blindness).

By measuring extraneous participant variables (e.g., age or gender) that may affect your experimental results, you can also include them in later analyses.

After gathering your participants, you’ll need to place them into groups to test different independent variable treatments. The types of groups and method of assigning participants to groups will help you implement control in your experiment.

Control groups

Controlled experiments require control groups . Control groups allow you to test a comparable treatment, no treatment, or a fake treatment (e.g., a placebo to control for a placebo effect ), and compare the outcome with your experimental treatment.

You can assess whether it’s your treatment specifically that caused the outcomes, or whether time or any other treatment might have resulted in the same effects.

To test the effect of colors in advertising, each participant is placed in one of two groups:

A control group that’s presented with red advertisements for a fast food meal.
An experimental group that’s presented with green advertisements for the same fast food meal.

Random assignment

To avoid systematic differences and selection bias between the participants in your control and treatment groups, you should use random assignment .

This helps ensure that any extraneous participant variables are evenly distributed, allowing for a valid comparison between groups .

Random assignment is a hallmark of a “true experiment”—it differentiates true experiments from quasi-experiments .

Masking (blinding)

Masking in experiments means hiding condition assignment from participants or researchers—or, in a double-blind study , from both. It’s often used in clinical studies that test new treatments or drugs and is critical for avoiding several types of research bias .

Sometimes, researchers may unintentionally encourage participants to behave in ways that support their hypotheses , leading to observer bias . In other cases, cues in the study environment may signal the goal of the experiment to participants and influence their responses. These are called demand characteristics . If participants behave a particular way due to awareness of being observed (called a Hawthorne effect ), your results could be invalidated.

Using masking means that participants don’t know whether they’re in the control group or the experimental group. This helps you control biases from participants or researchers that could influence your study results.

You use an online survey form to present the advertisements to participants, and you leave the room while each participant completes the survey on the computer so that you can’t tell which condition each participant was in.

Although controlled experiments are the strongest way to test causal relationships, they also involve some challenges.

Difficult to control all variables

Especially in research with human participants, it’s impossible to hold all extraneous variables constant, because every individual has different experiences that may influence their perception, attitudes, or behaviors.

But measuring or restricting extraneous variables allows you to limit their influence or statistically control for them in your study.

Risk of low external validity

Controlled experiments have disadvantages when it comes to external validity —the extent to which your results can be generalized to broad populations and settings.

The more controlled your experiment is, the less it resembles real world contexts. That makes it harder to apply your findings outside of a controlled setting.

There’s always a tradeoff between internal and external validity . It’s important to consider your research aims when deciding whether to prioritize control or generalizability in your experiment.

If you want to know more about statistics , methodology , or research bias , make sure to check out some of our other articles with explanations and examples.

Student’s t -distribution
Normal distribution
Null and Alternative Hypotheses
Chi square tests
Confidence interval
Quartiles & Quantiles
Cluster sampling
Stratified sampling
Data cleansing
Reproducibility vs Replicability
Peer review
Prospective cohort study

Research bias

Implicit bias
Cognitive bias
Placebo effect
Hawthorne effect
Hindsight bias
Affect heuristic
Social desirability bias

Prevent plagiarism. Run a free check.

In a controlled experiment , all extraneous variables are held constant so that they can’t influence the results. Controlled experiments require:

A control group that receives a standard treatment, a fake treatment, or no treatment.
Random assignment of participants to ensure the groups are equivalent.

Depending on your study topic, there are various other methods of controlling variables .

An experimental group, also known as a treatment group, receives the treatment whose effect researchers wish to study, whereas a control group does not. They should be identical in all other ways.

Experimental design means planning a set of procedures to investigate a relationship between variables . To design a controlled experiment, you need:

A testable hypothesis
At least one independent variable that can be precisely manipulated
At least one dependent variable that can be precisely measured

When designing the experiment, you decide:

How you will manipulate the variable(s)
How you will control for any potential confounding variables
How many subjects or samples will be included in the study
How subjects will be assigned to treatment levels

Experimental design is essential to the internal and external validity of your experiment.

Cite this Scribbr article

If you want to cite this source, you can copy and paste the citation or click the “Cite this Scribbr article” button to automatically add the citation to our free Citation Generator.

Bhandari, P. (2023, June 22). What Is a Controlled Experiment? | Definitions & Examples. Scribbr. Retrieved July 2, 2024, from https://www.scribbr.com/methodology/controlled-experiment/

Is this article helpful?

Pritha Bhandari

Other students also liked, extraneous variables | examples, types & controls, guide to experimental design | overview, steps, & examples, how to write a lab report, get unlimited documents corrected.

✔ Free APA citation check included ✔ Unlimited document corrections ✔ Specialized in correcting academic texts

Sciencing_Icons_Science SCIENCE

Sciencing_icons_biology biology, sciencing_icons_cells cells, sciencing_icons_molecular molecular, sciencing_icons_microorganisms microorganisms, sciencing_icons_genetics genetics, sciencing_icons_human body human body, sciencing_icons_ecology ecology, sciencing_icons_chemistry chemistry, sciencing_icons_atomic & molecular structure atomic & molecular structure, sciencing_icons_bonds bonds, sciencing_icons_reactions reactions, sciencing_icons_stoichiometry stoichiometry, sciencing_icons_solutions solutions, sciencing_icons_acids & bases acids & bases, sciencing_icons_thermodynamics thermodynamics, sciencing_icons_organic chemistry organic chemistry, sciencing_icons_physics physics, sciencing_icons_fundamentals-physics fundamentals, sciencing_icons_electronics electronics, sciencing_icons_waves waves, sciencing_icons_energy energy, sciencing_icons_fluid fluid, sciencing_icons_astronomy astronomy, sciencing_icons_geology geology, sciencing_icons_fundamentals-geology fundamentals, sciencing_icons_minerals & rocks minerals & rocks, sciencing_icons_earth scructure earth structure, sciencing_icons_fossils fossils, sciencing_icons_natural disasters natural disasters, sciencing_icons_nature nature, sciencing_icons_ecosystems ecosystems, sciencing_icons_environment environment, sciencing_icons_insects insects, sciencing_icons_plants & mushrooms plants & mushrooms, sciencing_icons_animals animals, sciencing_icons_math math, sciencing_icons_arithmetic arithmetic, sciencing_icons_addition & subtraction addition & subtraction, sciencing_icons_multiplication & division multiplication & division, sciencing_icons_decimals decimals, sciencing_icons_fractions fractions, sciencing_icons_conversions conversions, sciencing_icons_algebra algebra, sciencing_icons_working with units working with units, sciencing_icons_equations & expressions equations & expressions, sciencing_icons_ratios & proportions ratios & proportions, sciencing_icons_inequalities inequalities, sciencing_icons_exponents & logarithms exponents & logarithms, sciencing_icons_factorization factorization, sciencing_icons_functions functions, sciencing_icons_linear equations linear equations, sciencing_icons_graphs graphs, sciencing_icons_quadratics quadratics, sciencing_icons_polynomials polynomials, sciencing_icons_geometry geometry, sciencing_icons_fundamentals-geometry fundamentals, sciencing_icons_cartesian cartesian, sciencing_icons_circles circles, sciencing_icons_solids solids, sciencing_icons_trigonometry trigonometry, sciencing_icons_probability-statistics probability & statistics, sciencing_icons_mean-median-mode mean/median/mode, sciencing_icons_independent-dependent variables independent/dependent variables, sciencing_icons_deviation deviation, sciencing_icons_correlation correlation, sciencing_icons_sampling sampling, sciencing_icons_distributions distributions, sciencing_icons_probability probability, sciencing_icons_calculus calculus, sciencing_icons_differentiation-integration differentiation/integration, sciencing_icons_application application, sciencing_icons_projects projects, sciencing_icons_news news.

Share Tweet Email Print
Home ⋅
Science Fair Project Ideas for Kids, Middle & High School Students ⋅
Probability & Statistics

What Are Constants & Controls of a Science Project Experiment?

What Is a Standardized Variable in Biology?

The scientific method involves asking a question, doing research, forming a hypothesis and testing the hypothesis via an experiment, so that the results can be analyzed. Every successful science experiment must include specific types of variables. There must be an independent variable, which changes throughout the course of an experiment; a dependent variable, which is observed and measured; and a controlled variable, also known as the "constant" variable, which must remain consistent and unchanging throughout the experiment. Even though the controlled or constant variable in an experiment does not change, it is every bit as important to the success of a science experiment as the other variables.

TL;DR (Too Long; Didn't Read)

TL;DR: In a science experiment, the controlled or constant variable is a variable that does not change. For example, in an experiment to test the effect of different lights on plants, other factors that affect plant growth and health, such as soil quality and watering, would need to remain constant.

Example of an Independent Variable

Let's say that a scientist is performing an experiment to test the effect of different lighting on houseplants. In this case, the lighting itself would be the independent variable, because it is the variable that the scientist is actively changing, over the course of the experiment. Whether the scientist is using different bulbs or altering the amount of light given to the plants, the light is the variable being altered, and is therefore the independent variable.

Example of a Dependent Variable

Dependent variables are the traits that a scientist observes, in relation to the independent variable. In other words, the dependent variable changes depending on the alterations made to the independent variable. In the houseplant experiment, the dependent variables would be the properties of the plants themselves, which the scientist is observing in relation to the changing light. These properties might include the plants' color, height and general health.

Example of a Controlled Variable

A controlled or constant variable does not change throughout the course of an experiment. It is vitally important that every scientific experiment include a controlled variable; otherwise, the conclusions of an experiment are impossible to understand. For example, in the houseplant experiment, controlled variables might be things such as the the quality of soil and the amount of water given to the plants. If these factors were not constant, and certain plants received more water or better soil than others, then there would be no way for the scientist to be sure that the plants weren't changing based on those factors instead of the different kinds of light. A plant might be healthy and green because of the amount of light it received, or it could be because it was given more water than the other plants. In this case, it would be impossible to draw proper conclusions based on the experiment.

However, if all plants are given the same amount of water and the same quality of soil, then the scientist can be sure that any changes from one plant to another are due to changes made to the independent variable: the light. Even though the controlled variable did not change and was not the variable actually being tested, it allowed the scientist to observe the cause-and-effect relationship between plant health and different types of lighting. In other words, it allowed for a successful scientific experiment.

Difference between manipulative & responding variable, what is a responding variable in science projects, definitions of control, constant, independent and dependent..., what is a constant in a science fair project, what are dependent, independent & controlled variables, what type of plants are best for science projects, how to collect data from a science project, what are independent & dependent variables in science..., can a science experiment have two manipulated variables, what is a constant in the scientific method, what is an independent variable in quantitative research, what is the difference between a control & a controlled..., what is the meaning of variables in research, what is a scientist who studies plants called, science fair project for testing different soils with..., why should you only test for one variable at a time..., what are the independent variables for a moldy bread..., what is the role of carotenoids in photosynthesis.

ScienceBuddies: Variables in Your Science Fair Project
ThoughtCo: What Is An Experiment?

About the Author

Maria Cook is a freelance and fiction writer from Indianapolis, Indiana. She holds an MFA in Creative Writing from Butler University in Indianapolis. She has written about science as it relates to eco-friendly practices, conservation and the environment for Green Matters.

Find Your Next Great Science Fair Project! GO

What Is an Experiment? Definition and Design

The Basics of an Experiment

Chemical Laws
Periodic Table
Projects & Experiments
Scientific Method
Biochemistry
Physical Chemistry
Medical Chemistry
Chemistry In Everyday Life
Famous Chemists
Activities for Kids
Abbreviations & Acronyms
Weather & Climate
Ph.D., Biomedical Sciences, University of Tennessee at Knoxville
B.A., Physics and Mathematics, Hastings College

Science is concerned with experiments and experimentation, but do you know what exactly an experiment is? Here's a look at what an experiment is... and isn't!

Key Takeaways: Experiments

An experiment is a procedure designed to test a hypothesis as part of the scientific method.
The two key variables in any experiment are the independent and dependent variables. The independent variable is controlled or changed to test its effects on the dependent variable.
Three key types of experiments are controlled experiments, field experiments, and natural experiments.

What Is an Experiment? The Short Answer

In its simplest form, an experiment is simply the test of a hypothesis . A hypothesis, in turn, is a proposed relationship or explanation of phenomena.

Experiment Basics

The experiment is the foundation of the scientific method , which is a systematic means of exploring the world around you. Although some experiments take place in laboratories, you could perform an experiment anywhere, at any time.

Take a look at the steps of the scientific method:

Make observations.
Formulate a hypothesis.
Design and conduct an experiment to test the hypothesis.
Evaluate the results of the experiment.
Accept or reject the hypothesis.
If necessary, make and test a new hypothesis.

Types of Experiments

Natural Experiments : A natural experiment also is called a quasi-experiment. A natural experiment involves making a prediction or forming a hypothesis and then gathering data by observing a system. The variables are not controlled in a natural experiment.
Controlled Experiments : Lab experiments are controlled experiments , although you can perform a controlled experiment outside of a lab setting! In a controlled experiment, you compare an experimental group with a control group. Ideally, these two groups are identical except for one variable , the independent variable .
Field Experiments : A field experiment may be either a natural experiment or a controlled experiment. It takes place in a real-world setting, rather than under lab conditions. For example, an experiment involving an animal in its natural habitat would be a field experiment.

Variables in an Experiment

Simply put, a variable is anything you can change or control in an experiment. Common examples of variables include temperature, duration of the experiment, composition of a material, amount of light, etc. There are three kinds of variables in an experiment: controlled variables, independent variables and dependent variables .

Controlled variables , sometimes called constant variables are variables that are kept constant or unchanging. For example, if you are doing an experiment measuring the fizz released from different types of soda, you might control the size of the container so that all brands of soda would be in 12-oz cans. If you are performing an experiment on the effect of spraying plants with different chemicals, you would try to maintain the same pressure and maybe the same volume when spraying your plants.

The independent variable is the one factor that you are changing. It is one factor because usually in an experiment you try to change one thing at a time. This makes measurements and interpretation of the data much easier. If you are trying to determine whether heating water allows you to dissolve more sugar in the water then your independent variable is the temperature of the water. This is the variable you are purposely controlling.

The dependent variable is the variable you observe, to see whether it is affected by your independent variable. In the example where you are heating water to see if this affects the amount of sugar you can dissolve , the mass or volume of sugar (whichever you choose to measure) would be your dependent variable.

Validity, Accuracy and Reliability Explained with Examples

This is part of the NSW HSC science curriculum part of the Working Scientifically skills.

Part 1 – Validity

Part 2 – Accuracy

Part 3 – Reliability

Science experiments are an essential part of high school education, helping students understand key concepts and develop critical thinking skills. However, the value of an experiment lies in its validity, accuracy, and reliability. Let's break down these terms and explore how they can be improved and reduced, using simple experiments as examples.

Target Analogy to Understand Accuracy and Reliability

The target analogy is a classic way to understand the concepts of accuracy and reliability in scientific measurements and experiments.

Accuracy refers to how close a measurement is to the true or accepted value. In the analogy, it's how close the arrows come to hitting the bullseye (represents the true or accepted value).

Reliability refers to the consistency of a set of measurements. Reliable data can be reproduced under the same conditions. In the analogy, it's represented by how tightly the arrows are grouped together, regardless of whether they hit the bullseye. Therefore, we can have scientific results that are reliable but inaccurate.

Validity refers to how well an experiment investigates the aim or tests the underlying hypothesis. While validity is not represented in this target analogy, the validity of an experiment can sometimes be assessed by using the accuracy of results as a proxy. Experiments that produce accurate results are likely to be valid as invalid experiments usually do not yield accurate result.

Validity refers to how well an experiment measures what it is supposed to measure and investigates the aim.

Ask yourself the questions:

"Is my experimental method and design suitable?"
"Is my experiment testing or investigating what it's suppose to?"

For example, if you're investigating the effect of the volume of water (independent variable) on plant growth, your experiment would be valid if you measure growth factors like height or leaf size (these would be your dependent variables).

However, validity entails more than just what's being measured. When assessing validity, you should also examine how well the experimental methodology investigates the aim of the experiment.

Assessing Validity

An experiment’s procedure, the subsequent methods of analysis of the data, the data itself, and the conclusion you draw from the data, all have their own associated validities. It is important to understand this division because there are different factors to consider when assessing the validity of any single one of them. The validity of an experiment as a whole , depends on the individual validities of these components.

When assessing the validity of the procedure , consider the following:

Does the procedure control all necessary variables except for the dependent and independent variables? That is, have you isolated the effect of the independent variable on the dependent variable?
Does this effect you have isolated actually address the aim and/or hypothesis?
Does your method include enough repetitions for a reliable result? (Read more about reliability below)

When assessing the validity of the method of analysis of the data , consider the following:

Does the analysis extrapolate or interpolate the experimental data? Generally, interpolation is valid, but extrapolation is invalid. This because by extrapolating, you are ‘peering out into the darkness’ – just because your data showed a certain trend for a certain range it does not mean that this trend will hold for all.
Does the analysis use accepted laws and mathematical relationships? That is, do the equations used for analysis have scientific or mathematical base? For example, `F = ma` is an accepted law in physics, but if in the analysis you made up a relationship like `F = ma^2` that has no scientific or mathematical backing, the method of analysis is invalid.
Is the most appropriate method of analysis used? Consider the differences between using a table and a graph. In a graph, you can use the gradient to minimise the effects of systematic errors and can also reduce the effect of random errors. The visual nature of a graph also allows you to easily identify outliers and potentially exclude them from analysis. This is why graphical analysis is generally more valid than using values from tables.

When assessing the validity of your results , consider the following:

Is your primary data (data you collected from your own experiment) BOTH accurate and reliable? If not, it is invalid.
Are the secondary sources you may have used BOTH reliable and accurate?

When assessing the validity of your conclusion , consider the following:

Does your conclusion relate directly to the aim or the hypothesis?

How to Improve Validity

Ways of improving validity will differ across experiments. You must first identify what area(s) of the experiment’s validity is lacking (is it the procedure, analysis, results, or conclusion?). Then, you must come up with ways of overcoming the particular weakness.

Below are some examples of this.

Example – Validity in Chemistry Experiment

Let's say we want to measure the mass of carbon dioxide in a can of soft drink.

The following steps are followed:

Weigh an unopened can of soft drink on an electronic balance.
Open the can.
Place the can on a hot plate until it begins to boil.
When cool, re-weigh the can to determine the mass loss.

To ensure this experiment is valid, we must establish controlled variables:

type of soft drink used
temperature at which this experiment is conducted
period of time before soft drink is re-weighed

Despite these controlled variables, this experiment is invalid because it actually doesn't help us measure the mass of carbon dioxide in the soft drink. This is because by heating the soft drink until it boils, we are also losing water due to evaporation. As a result, the mass loss measured is not only due to the loss of carbon dioxide, but also water. A simple way to improve the validity of this experiment is to not heat it; by simply opening the can of soft drink, carbon dioxide in the can will escape without loss of water.

Example – Validity in Physics Experiment

Let's say we want to measure the value of gravitational acceleration `g` using a simple pendulum system, and the following equation:

$$T = 2\pi \sqrt{\frac{l}{g}}$$

`T` is the period of oscillation
`l` is the length of string attached to the mass
`g` is the acceleration due to gravity

Cut a piece of a string or dental floss so that it is 1.0 m long.
Attach a 500.0 g mass of high density to the end of the string.
Attach the other end of the string to the retort stand using a clamp.
Starting at an angle of less than 10º, allow the pendulum to swing and measure the pendulum’s period for 10 oscillations using a stopwatch.
Repeat the experiment with 1.2 m, 1.5 m and 1.8 m strings.

The controlled variables we must established in this experiment include:

mass used in the pendulum
location at which the experiment is conducted

The validity of this experiment depends on the starting angle of oscillation. The above equation (method of analysis) is only true for small angles (`\theta < 15^{\circ}`) such that `\sin \theta = \theta`. We also want to make sure the pendulum system has a small enough surface area to minimise the effect of air resistance on its oscillation.

In this instance, it would be invalid to use a pair of values (length and period) to calculate the value of gravitational acceleration. A more appropriate method of analysis would be to plot the length and period squared to obtain a linear relationship, then use the value of the gradient of the line of best fit to determine the value of `g`.

Accuracy refers to how close the experimental measurements are to the true value.

Accuracy depends on

the validity of the experiment
the degree of error:
systematic errors are those that are systemic in your experiment. That is, they effect every single one of your data points consistently, meaning that the cause of the error is always present. For example, it could be a badly calibrated temperature gauge that reports every reading 5 °C above the true value.
random errors are errors that occur inconsistently. For example, the temperature gauge readings might be affected by random fluctuations in room temperature. Some readings might be above the true value, some might then be below the true value.
sensitivity of equipment used.

Assessing Accuracy

The effect of errors and insensitive equipment can both be captured by calculating the percentage error:

$$\text{% error} = \frac{\text{|experimental value – true value|}}{\text{true value}} \times 100%$$

Generally, measurements are considered accurate when the percentage error is less than 5%. You should always take the context of the experimental into account when assessing accuracy.

While accuracy and validity have different definitions, the two are closely related. Accurate results often suggest that the underlying experiment is valid, as invalid experiments are unlikely to produce accurate results.

In a simple pendulum experiment, if your measurements of the pendulum's period are close to the calculated value, your experiment is accurate. A table showing sample experimental measurements vs accepted values from using the equation above is shown below.

All experimental values in the table above are within 5% of accepted (theoretical) values, they are therefore considered as accurate.

How to Improve Accuracy

Remove systematic errors : for example, if the experiment’s measuring instruments are poorly calibrated, then you should correctly calibrate it before doing the experiment again.
Reduce the influence of random errors : this can be done by having more repetitions in the experiment and reporting the average values. This is because if you have enough of these random errors – some above the true value and some below the true value – then averaging them will make them cancel each other out This brings your average value closer and closer to the true value.
Use More Sensitive Equipments: For example, use a recording to measure time by analysing motion of an object frame by frame, instead of using a stopwatch. The sensitivity of an equipment can be measured by the limit of reading . For example, stopwatches may only measure to the nearest millisecond – that is their limit of reading. But recordings can be analysed to the frame. And, depending on the frame rate of the camera, this could mean measuring to the nearest microsecond.
Obtain More Measurements and Over a Wider Range: In some cases, the relationship between two variables can be more accurately determined by testing over a wider range. For example, in the pendulum experiment, periods when strings of various lengths are used can be measured. In this instance, repeating the experiment does not relate to reliability because we have changed the value of the independent variable tested.

Reliability

Reliability involves the consistency of your results over multiple trials.

Assessing Reliability

The reliability of an experiment can be broken down into the reliability of the procedure and the reliability of the final results.

The reliability of the procedure refers to how consistently the steps of your experiment produce similar results. For example, if an experiment produces the same values every time it is repeated, then it is highly reliable. This can be assessed quantitatively by looking at the spread of measurements, using statistical tests such as greatest deviation from the mean, standard deviations, or z-scores.

Ask yourself: "Is my result reproducible?"

The reliability of results cannot be assessed if there is only one data point or measurement obtained in the experiment. There must be at least 3. When you're repeating the experiment to assess the reliability of its results, you must follow the same steps , use the same value for the independent variable. Results obtained from methods with different steps cannot be assessed for their reliability.

Obtaining only one measurement in an experiment is not enough because it could be affected by errors and have been produced due to pure chance. Repeating the experiment and obtaining the same or similar results will increase your confidence that the results are reproducible (therefore reliable).

In the soft drink experiment, reliability can be assessed by repeating the steps at least three times:

The mass loss measured in all three trials are fairly consistent, suggesting that the reliability of the underly method is high.

The reliability of the final results refers to how consistently your final data points (e.g. average value of repeated trials) point towards the same trend. That is, how close are they all to the trend line? This can be assessed quantitatively using the `R^2` value. `R^2` value ranges between 0 and 1, a value of 0 suggests there is no correlation between data points, and a value of 1 suggests a perfect correlation with no variance from trend line.

In the pendulum experiment, we can calculate the `R^2` value (done in Excel) by using the final average period values measured for each pendulum length.

Here, a `R^2` value of 0.9758 suggests the four average values are fairly close to the overall linear trend line (low variance from trend line). Thus, the results are fairly reliable.

How to Improve Reliability

A common misconception is that increasing the number of trials increases the reliability of the procedure . This is not true. The only way to increase the reliability of the procedure is to revise it. This could mean using instruments that are less susceptible to random errors, which cause measurements to be more variable.

Increasing the number of trials actually increases the reliability of the final results . This is because having more repetitions reduces the influence of random errors and brings the average values closer to the true values. Generally, the closer experimental values are to true values, the closer they are to the true trend. That is, accurate data points are generally reliable and all point towards the same trend.

Reliable but Inaccurate / Invalid

It is important to understand that results from an experiment can be reliable (consistent), but inaccurate (deviate greatly from theoretical values) and/or invalid. In this case, your procedure is reliable, but your final results likely are not.

Examples of Reliability

Using the soft drink example again, if the mass losses measured for three soft drinks (same brand and type of drink) are consistent, then it's reliable.

Using the pendulum example again, if you get similar period measurements every time you repeat the experiment, it’s reliable.

However, in both cases, if the underlying methods are invalid, the consistent results would be invalid and inaccurate (despite being reliable).

Do you have trouble understanding validity, accuracy or reliability in your science experiment or depth study?

Consider getting personalised help from our 1-on-1 mentoring program !

RETURN TO WORKING SCIENTIFICALLY

choosing a selection results in a full page refresh
press the space key then arrow keys to make a selection

If you're seeing this message, it means we're having trouble loading external resources on our website.

If you're behind a web filter, please make sure that the domains *.kastatic.org and *.kasandbox.org are unblocked.

To log in and use all the features of Khan Academy, please enable JavaScript in your browser.

Biology archive

Course: biology archive > unit 1, the scientific method.

Controlled experiments
The scientific method and experimental design

Introduction

Make an observation.
Ask a question.
Form a hypothesis , or testable explanation.
Make a prediction based on the hypothesis.
Test the prediction.
Iterate: use the results to make new hypotheses or predictions.

Scientific method example: Failure to toast

1. make an observation., 2. ask a question., 3. propose a hypothesis., 4. make predictions., 5. test the predictions..

If the toaster does toast, then the hypothesis is supported—likely correct.
If the toaster doesn't toast, then the hypothesis is not supported—likely wrong.

Logical possibility

Practical possibility, building a body of evidence, 6. iterate..

If the hypothesis was supported, we might do additional tests to confirm it, or revise it to be more specific. For instance, we might investigate why the outlet is broken.
If the hypothesis was not supported, we would come up with a new hypothesis. For instance, the next hypothesis might be that there's a broken wire in the toaster.

Want to join the conversation?

Upvote Button navigates to signup page
Downvote Button navigates to signup page
Flag Button navigates to signup page

Chapter 6: Experimental Research

6.1 experiment basics, learning objectives.

Explain what an experiment is and recognize examples of studies that are experiments and studies that are not experiments.
Explain what internal validity is and why experiments are considered to be high in internal validity.
Explain what external validity is and evaluate studies in terms of their external validity.
Distinguish between the manipulation of the independent variable and control of extraneous variables and explain the importance of each.
Recognize examples of confounding variables and explain how they affect the internal validity of a study.

What Is an Experiment?

As we saw earlier in the book, an experiment is a type of study designed specifically to answer the question of whether there is a causal relationship between two variables. Do changes in an independent variable cause changes in a dependent variable? Experiments have two fundamental features. The first is that the researchers manipulate, or systematically vary, the level of the independent variable. The different levels of the independent variable are called conditions. For example, in Darley and Latané’s experiment, the independent variable was the number of witnesses that participants believed to be present. The researchers manipulated this independent variable by telling participants that there were either one, two, or five other students involved in the discussion, thereby creating three conditions. The second fundamental feature of an experiment is that the researcher controls, or minimizes the variability in, variables other than the independent and dependent variable. These other variables are called extraneous variables. Darley and Latané tested all their participants in the same room, exposed them to the same emergency situation, and so on. They also randomly assigned their participants to conditions so that the three groups would be similar to each other to begin with. Notice that although the words manipulation and control have similar meanings in everyday language, researchers make a clear distinction between them. They manipulate the independent variable by systematically changing its levels and control other variables by holding them constant.

Internal and External Validity

Internal validity.

Recall that the fact that two variables are statistically related does not necessarily mean that one causes the other. “Correlation does not imply causation.” For example, if it were the case that people who exercise regularly are happier than people who do not exercise regularly, this would not necessarily mean that exercising increases people’s happiness. It could mean instead that greater happiness causes people to exercise (the directionality problem) or that something like better physical health causes people to exercise and be happier (the third-variable problem).

The purpose of an experiment, however, is to show that two variables are statistically related and to do so in a way that supports the conclusion that the independent variable caused any observed differences in the dependent variable. The basic logic is this: If the researcher creates two or more highly similar conditions and then manipulates the independent variable to produce just one difference between them, then any later difference between the conditions must have been caused by the independent variable. For example, because the only difference between Darley and Latané’s conditions was the number of students that participants believed to be involved in the discussion, this must have been responsible for differences in helping between the conditions.

An empirical study is said to be high in internal validity if the way it was conducted supports the conclusion that the independent variable caused any observed differences in the dependent variable. Thus experiments are high in internal validity because the way they are conducted—with the manipulation of the independent variable and the control of extraneous variables—provides strong support for causal conclusions.

External Validity

At the same time, the way that experiments are conducted sometimes leads to a different kind of criticism. Specifically, the need to manipulate the independent variable and control extraneous variables means that experiments are often conducted under conditions that seem artificial or unlike “real life” (Stanovich, 2010). In many psychology experiments, the participants are all college undergraduates and come to a classroom or laboratory to fill out a series of paper-and-pencil questionnaires or to perform a carefully designed computerized task. Consider, for example, an experiment in which researcher Barbara Fredrickson and her colleagues had college students come to a laboratory on campus and complete a math test while wearing a swimsuit (Fredrickson, Roberts, Noll, Quinn, & Twenge, 1998). At first, this might seem silly. When will college students ever have to complete math tests in their swimsuits outside of this experiment?

The issue we are confronting is that of external validity. An empirical study is high in external validity if the way it was conducted supports generalizing the results to people and situations beyond those actually studied. As a general rule, studies are higher in external validity when the participants and the situation studied are similar to those that the researchers want to generalize to. Imagine, for example, that a group of researchers is interested in how shoppers in large grocery stores are affected by whether breakfast cereal is packaged in yellow or purple boxes. Their study would be high in external validity if they studied the decisions of ordinary people doing their weekly shopping in a real grocery store. If the shoppers bought much more cereal in purple boxes, the researchers would be fairly confident that this would be true for other shoppers in other stores. Their study would be relatively low in external validity, however, if they studied a sample of college students in a laboratory at a selective college who merely judged the appeal of various colors presented on a computer screen. If the students judged purple to be more appealing than yellow, the researchers would not be very confident that this is relevant to grocery shoppers’ cereal-buying decisions.

We should be careful, however, not to draw the blanket conclusion that experiments are low in external validity. One reason is that experiments need not seem artificial. Consider that Darley and Latané’s experiment provided a reasonably good simulation of a real emergency situation. Or consider field experiments that are conducted entirely outside the laboratory. In one such experiment, Robert Cialdini and his colleagues studied whether hotel guests choose to reuse their towels for a second day as opposed to having them washed as a way of conserving water and energy (Cialdini, 2005). These researchers manipulated the message on a card left in a large sample of hotel rooms. One version of the message emphasized showing respect for the environment, another emphasized that the hotel would donate a portion of their savings to an environmental cause, and a third emphasized that most hotel guests choose to reuse their towels. The result was that guests who received the message that most hotel guests choose to reuse their towels reused their own towels substantially more often than guests receiving either of the other two messages. Given the way they conducted their study, it seems very likely that their result would hold true for other guests in other hotels.

A second reason not to draw the blanket conclusion that experiments are low in external validity is that they are often conducted to learn about psychological processes that are likely to operate in a variety of people and situations. Let us return to the experiment by Fredrickson and colleagues. They found that the women in their study, but not the men, performed worse on the math test when they were wearing swimsuits. They argued that this was due to women’s greater tendency to objectify themselves—to think about themselves from the perspective of an outside observer—which diverts their attention away from other tasks. They argued, furthermore, that this process of self-objectification and its effect on attention is likely to operate in a variety of women and situations—even if none of them ever finds herself taking a math test in her swimsuit.

Manipulation of the Independent Variable

Again, to manipulate an independent variable means to change its level systematically so that different groups of participants are exposed to different levels of that variable, or the same group of participants is exposed to different levels at different times. For example, to see whether expressive writing affects people’s health, a researcher might instruct some participants to write about traumatic experiences and others to write about neutral experiences. The different levels of the independent variable are referred to as conditions , and researchers often give the conditions short descriptive names to make it easy to talk and write about them. In this case, the conditions might be called the “traumatic condition” and the “neutral condition.”

Notice that the manipulation of an independent variable must involve the active intervention of the researcher. Comparing groups of people who differ on the independent variable before the study begins is not the same as manipulating that variable. For example, a researcher who compares the health of people who already keep a journal with the health of people who do not keep a journal has not manipulated this variable and therefore not conducted an experiment. This is important because groups that already differ in one way at the beginning of a study are likely to differ in other ways too. For example, people who choose to keep journals might also be more conscientious, more introverted, or less stressed than people who do not. Therefore, any observed difference between the two groups in terms of their health might have been caused by whether or not they keep a journal, or it might have been caused by any of the other differences between people who do and do not keep journals. Thus the active manipulation of the independent variable is crucial for eliminating the third-variable problem.

Of course, there are many situations in which the independent variable cannot be manipulated for practical or ethical reasons and therefore an experiment is not possible. For example, whether or not people have a significant early illness experience cannot be manipulated, making it impossible to do an experiment on the effect of early illness experiences on the development of hypochondriasis. This does not mean it is impossible to study the relationship between early illness experiences and hypochondriasis—only that it must be done using nonexperimental approaches. We will discuss this in detail later in the book.

In many experiments, the independent variable is a construct that can only be manipulated indirectly. For example, a researcher might try to manipulate participants’ stress levels indirectly by telling some of them that they have five minutes to prepare a short speech that they will then have to give to an audience of other participants. In such situations, researchers often include a manipulation check in their procedure. A manipulation check is a separate measure of the construct the researcher is trying to manipulate. For example, researchers trying to manipulate participants’ stress levels might give them a paper-and-pencil stress questionnaire or take their blood pressure—perhaps right after the manipulation or at the end of the procedure—to verify that they successfully manipulated this variable.

Control of Extraneous Variables

An extraneous variable is anything that varies in the context of a study other than the independent and dependent variables. In an experiment on the effect of expressive writing on health, for example, extraneous variables would include participant variables (individual differences) such as their writing ability, their diet, and their shoe size. They would also include situation or task variables such as the time of day when participants write, whether they write by hand or on a computer, and the weather. Extraneous variables pose a problem because many of them are likely to have some effect on the dependent variable. For example, participants’ health will be affected by many things other than whether or not they engage in expressive writing. This can make it difficult to separate the effect of the independent variable from the effects of the extraneous variables, which is why it is important to control extraneous variables by holding them constant.

Extraneous Variables as “Noise”

Extraneous variables make it difficult to detect the effect of the independent variable in two ways. One is by adding variability or “noise” to the data. Imagine a simple experiment on the effect of mood (happy vs. sad) on the number of happy childhood events people are able to recall. Participants are put into a negative or positive mood (by showing them a happy or sad video clip) and then asked to recall as many happy childhood events as they can. The two leftmost columns of Table 6.1 “Hypothetical Noiseless Data and Realistic Noisy Data” show what the data might look like if there were no extraneous variables and the number of happy childhood events participants recalled was affected only by their moods. Every participant in the happy mood condition recalled exactly four happy childhood events, and every participant in the sad mood condition recalled exactly three. The effect of mood here is quite obvious. In reality, however, the data would probably look more like those in the two rightmost columns of Table 6.1 “Hypothetical Noiseless Data and Realistic Noisy Data” . Even in the happy mood condition, some participants would recall fewer happy memories because they have fewer to draw on, use less effective strategies, or are less motivated. And even in the sad mood condition, some participants would recall more happy childhood memories because they have more happy memories to draw on, they use more effective recall strategies, or they are more motivated. Although the mean difference between the two groups is the same as in the idealized data, this difference is much less obvious in the context of the greater variability in the data. Thus one reason researchers try to control extraneous variables is so their data look more like the idealized data in Table 6.1 “Hypothetical Noiseless Data and Realistic Noisy Data” , which makes the effect of the independent variable is easier to detect (although real data never look quite that good).

Table 6.1 Hypothetical Noiseless Data and Realistic Noisy Data

Idealized “noiseless” data		Realistic “noisy” data

4	3	3	1
4	3	6	3
4	3	2	4
4	3	4	0
4	3	5	5
4	3	2	7
4	3	3	2
4	3	1	5
4	3	6	1
4	3	8	2
= 4	= 3	= 4	= 3

One way to control extraneous variables is to hold them constant. This can mean holding situation or task variables constant by testing all participants in the same location, giving them identical instructions, treating them in the same way, and so on. It can also mean holding participant variables constant. For example, many studies of language limit participants to right-handed people, who generally have their language areas isolated in their left cerebral hemispheres. Left-handed people are more likely to have their language areas isolated in their right cerebral hemispheres or distributed across both hemispheres, which can change the way they process language and thereby add noise to the data.

In principle, researchers can control extraneous variables by limiting participants to one very specific category of person, such as 20-year-old, straight, female, right-handed, sophomore psychology majors. The obvious downside to this approach is that it would lower the external validity of the study—in particular, the extent to which the results can be generalized beyond the people actually studied. For example, it might be unclear whether results obtained with a sample of younger straight women would apply to older gay men. In many situations, the advantages of a diverse sample outweigh the reduction in noise achieved by a homogeneous one.

Extraneous Variables as Confounding Variables

The second way that extraneous variables can make it difficult to detect the effect of the independent variable is by becoming confounding variables. A confounding variable is an extraneous variable that differs on average across levels of the independent variable. For example, in almost all experiments, participants’ intelligence quotients (IQs) will be an extraneous variable. But as long as there are participants with lower and higher IQs at each level of the independent variable so that the average IQ is roughly equal, then this variation is probably acceptable (and may even be desirable). What would be bad, however, would be for participants at one level of the independent variable to have substantially lower IQs on average and participants at another level to have substantially higher IQs on average. In this case, IQ would be a confounding variable.

To confound means to confuse, and this is exactly what confounding variables do. Because they differ across conditions—just like the independent variable—they provide an alternative explanation for any observed difference in the dependent variable. Figure 6.1 “Hypothetical Results From a Study on the Effect of Mood on Memory” shows the results of a hypothetical study, in which participants in a positive mood condition scored higher on a memory task than participants in a negative mood condition. But if IQ is a confounding variable—with participants in the positive mood condition having higher IQs on average than participants in the negative mood condition—then it is unclear whether it was the positive moods or the higher IQs that caused participants in the first condition to score higher. One way to avoid confounding variables is by holding extraneous variables constant. For example, one could prevent IQ from becoming a confounding variable by limiting participants only to those with IQs of exactly 100. But this approach is not always desirable for reasons we have already discussed. A second and much more general approach—random assignment to conditions—will be discussed in detail shortly.

Figure 6.1 Hypothetical Results From a Study on the Effect of Mood on Memory

Because IQ also differs across conditions, it is a confounding variable.

Key Takeaways

An experiment is a type of empirical study that features the manipulation of an independent variable, the measurement of a dependent variable, and control of extraneous variables.
Studies are high in internal validity to the extent that the way they are conducted supports the conclusion that the independent variable caused any observed differences in the dependent variable. Experiments are generally high in internal validity because of the manipulation of the independent variable and control of extraneous variables.
Studies are high in external validity to the extent that the result can be generalized to people and situations beyond those actually studied. Although experiments can seem “artificial”—and low in external validity—it is important to consider whether the psychological processes under study are likely to operate in other people and situations.
Practice: List five variables that can be manipulated by the researcher in an experiment. List five variables that cannot be manipulated by the researcher in an experiment.

Practice: For each of the following topics, decide whether that topic could be studied using an experimental research design and explain why or why not.

Effect of parietal lobe damage on people’s ability to do basic arithmetic.
Effect of being clinically depressed on the number of close friendships people have.
Effect of group training on the social skills of teenagers with Asperger’s syndrome.
Effect of paying people to take an IQ test on their performance on that test.

Cialdini, R. (2005, April). Don’t throw in the towel: Use social influence research. APS Observer . Retrieved from http://www.psychologicalscience.org/observer/getArticle.cfm?id=1762 .

Fredrickson, B. L., Roberts, T.-A., Noll, S. M., Quinn, D. M., & Twenge, J. M. (1998). The swimsuit becomes you: Sex differences in self-objectification, restrained eating, and math performance. Journal of Personality and Social Psychology, 75 , 269–284.

Stanovich, K. E. (2010). How to think straight about psychology (9th ed.). Boston, MA: Allyn & Bacon.

Research Methods in Psychology. Provided by : University of Minnesota Libraries Publishing. Located at : http://open.lib.umn.edu/psychologyresearchmethods . License : CC BY-NC-SA: Attribution-NonCommercial-ShareAlike

NTRS - NASA Technical Reports Server

Available downloads, related records.

COMMENTS

Controlled experiments (article)
In general, a controlled experiment must always have a control group as a baseline. However, there may be several experimental groups, each with a slightly different treatment applied to it. ... However, experiments with more than one independent variable have to follow specific design guidelines, and the results must be analyzed using a ...
Guide to Experimental Design
Table of contents. Step 1: Define your variables. Step 2: Write your hypothesis. Step 3: Design your experimental treatments. Step 4: Assign your subjects to treatment groups. Step 5: Measure your dependent variable. Other interesting articles. Frequently asked questions about experiments.
4.14: Experiments and Hypotheses
First, scientific experiments must have an experimental group. This is the group that receives the experimental treatment necessary to address the hypothesis. ... Moreover, if the doctor knows which group a patient is in, this can also influence the results of the experiment. Without saying so directly, the doctor may show—through body ...
How To Write A Lab Report
Introduction. Your lab report introduction should set the scene for your experiment. One way to write your introduction is with a funnel (an inverted triangle) structure: Start with the broad, general research topic. Narrow your topic down your specific study focus. End with a clear research question.
Experiments and Hypotheses
Forming a Hypothesis. When conducting scientific experiments, researchers develop hypotheses to guide experimental design. A hypothesis is a suggested explanation that is both testable and falsifiable. You must be able to test your hypothesis, and it must be possible to prove your hypothesis true or false.
Biology and the scientific method review
A simple experiment should have only one independent variable. All other factors that could have an effect on the outcome of the experiment must be controlled or held constant. In addition, one group in the experiment should be a control group, a designated group used as a comparative reference point. This group will not have a manipulated ...
The scientific method and experimental design
A. The facts collected from an experiment are written in the form of a hypothesis. A hypothesis is the correct answer to a scientific question. B. A hypothesis is the correct answer to a scientific question. A hypothesis is a possible, testable explanation for a scientific question. C.
1.2: Science Experiments
An experiment is a special type of scientific investigation that is performed under controlled conditions, usually in a laboratory. An experiment generally tests how one variable is affected by another. The sample size in an experiment directly affects the interpretation of the results.
Experimental Design in Science
Rule #2: The results of an experiment must be measurable and objective. Scientists use standard units to measure different properties like length, time, volume, mass, and speed.
1.6: Scientific Experiments
This page titled 1.6: Scientific Experiments is shared under a CK-12 license and was authored, remixed, and/or curated by Suzanne Wakim & Mandeep Grewal via source content that was edited to the style and standards of the LibreTexts platform. An experiment is a special type of scientific investigation that is performed under controlled ...
Experiment
An experiment must also control the possible confounding factors—any factors that would mar the accuracy or repeatability of the experiment or the ability to interpret the results. Confounding is commonly eliminated through scientific controls and/or, in randomized experiments , through random assignment .
The Scientific Method Tutorial
The experiment must be conducted on a group of subjects that are narrowly defined and have certain aspects in common. ... it must be a statement of reality, based upon the results of the experiment. To assure that this is the case, the conclusions made in an experiment must (1) relate back to the hypothesis being tested, (2) be limited to the ...
What Is a Controlled Experiment?
Controlled experiments have disadvantages when it comes to external validity—the extent to which your results can be generalized to broad populations and settings. The more controlled your experiment is, the less it resembles real world contexts. That makes it harder to apply your findings outside of a controlled setting.
What Are Constants & Controls of a Science Project Experiment?
What Are Constants & Controls of a Science Project Experiment? The scientific method involves asking a question, doing research, forming a hypothesis and testing the hypothesis via an experiment, so that the results can be analyzed. Every successful science experiment must include specific types of variables.
The Basics of an Experiment
An experiment is a procedure designed to test a hypothesis as part of the scientific method. The two key variables in any experiment are the independent and dependent variables. The independent variable is controlled or changed to test its effects on the dependent variable. Three key types of experiments are controlled experiments, field ...
Using A Fair Test in A Science Experiment or Project
Conducting a fair test is one of the most important ingredients of doing good, scientifically valuable experiments. To insure that your experiment is a fair test, you must change only one factor at a time while keeping all other conditions the same . Scientists call the changing factors in an experiment variables .
Steps of the Scientific Method
The six steps of the scientific method include: 1) asking a question about something you observe, 2) doing background research to learn what is already known about the topic, 3) constructing a hypothesis, 4) experimenting to test the hypothesis, 5) analyzing the data from the experiment and drawing conclusions, and 6) communicating the results ...
Validity, Accuracy and Reliability: A Comprehensive Guide
The reliability of results cannot be assessed if there is only one data point or measurement obtained in the experiment. There must be at least 3. When you're repeating the experiment to assess the reliability of its results, you must follow the same steps, use the same value for the independent variable. Results obtained from methods with ...
The scientific method (article)
The scientific method. At the core of biology and other sciences lies a problem-solving approach called the scientific method. The scientific method has five basic steps, plus one feedback step: Make an observation. Ask a question. Form a hypothesis, or testable explanation. Make a prediction based on the hypothesis.
6.1 Experiment Basics
Experiments have two fundamental features. The first is that the researchers manipulate, or systematically vary, the level of the independent variable. The different levels of the independent variable are called conditions. For example, in Darley and Latané's experiment, the independent variable was the number of witnesses that participants ...
Why Should Scientific Results Be Reproducible?
Reproducing experiments is one of the cornerstones of the scientific process. Here's why it's so important. Since 2005, when Stanford University professor John Ioannidis published his paper "Why ...
Why Precision, Accuracy, and Validity Are Vital in Research
When a scientist performs an experiment, they must be able to reproduce the same results every time. If they manage to reproduce the same or minimally different results in a series of tests, the results are precise. A lack of precision can lead to imperceptible random errors that yield widespread results.
Intro To BIO Lab Midterm Flashcards
Which rule does NOT need to be followed during an experiment for results to be valid? The experiment must be preformed in a Lab. ... (1 kg = 2.20462 lbs) 11lbs. The measurable components of an experiment are called. dependent. T/F If you spill a base on the floor, you should use baking soda to neutralize the bas before clean up.
NTRS
The reliance on CFD simulations to develop, design, and optimize scramjet systems (or components) has become commonplace. This reliance inevitably hinges on the ability of the computational analyst to quantify the level of confidence in their computational results. Unfortunately, nearly all the measured data available for this assessment comes from antiquated experimental datasets, or from ...