Internal Assessment

Home
Contact Information
Course Objectives
Course Content
Summer Assignment
Syllabus
Psychology Websites
Assignment Activities
PsychSim 4.0
Internal Assessment
Exams


INTERNAL ASSESSMENT GUIDELINES

Your Guide to Writing the IA in Psychology

(with thanks to the Crane website)

The following paragraphs indicate the information that candidates should include under each heading of the report. See below for ideas on topics for the IA.

Title Page

The title page should give clear indication of the experimental method and the specific topic of study. The hypothesis should determine how the title is constructed. The student's candidate number, the date, the instructor's name, the level of the course (HL or SL) and the final word count should all be clearly indicated on the title page.

Table of Contents

A table of contents must be included. All pages of the study should be numbered, including the appendices.

Abstract

The abstract contains a summary of important information about the study. It should summarize the aims, methods, and results of the study. It should also clearly state the conclusion drawn. It must not exceed 200 words.

Introduction

The introduction provides the background and justification for the research study. At SL, this sections includes the study that is being replicated and the aim of the study at hand. At HL, the introduction is longer and includes a more thorough review of literature related to the study. Candidates should use this section to justify the prediction that they are making in their research hypothesis. This section should move from broad concepts to more specific studies that are directly related to the current study. Be sure to define all important terminology in this section as relevant to your study.

This section should end in a clearly defined research question/aim (SL) or operationalized research and null hypothesis (HL).

The introduction should follow the order below:

  • A general introduction to the psychological subject area under investigation. Include an indication of the perspective that is being discussed.
  • A summary of the key theories and research studies. This must include proper reference, for example, Zajonc (1965). Candidates at SL must cite one reference, and at least three are recommended for HL. SL, you are simply to summarize the aim, procedure, and findings of the study that you are replicating.
  • A rationale and justification for the study.
  • The aim (HL and SL) and hypotheses (HL only).

Aim (HL and SL)

The aim of the study is a statement about what is being investigated and what is expected. It is less precise than an operationalized research hypothesis, for example: The aim of this study is to investigate how the use of category headings affects the number of words that binlingual students at the International School in Prague can recall.

Hypotheses (HL only)

Research Hypothesis: The research hypothesis must be a clear, concise prediction of what is expected to be demonstrated in the experiment. This must be operationalized: that is, it must be evident how the variables will be quantified, and may be either one- or two-tailed.

Null Hypothesis: The null hypothesis states that no significant difference is expected to be found between the groups on the measure of the dependent variable, and that any difference found is due to random variables. Candidates should make it clear that they understand that it is not the opposite of the research hypothesis.

Method

This section must be subdivided into four parts: design, participants, materials, procedures.

Design

Candidates must state the experimental method used, give details of the type of design (for example, independent samples, repeated measures), and explain and justify why this method and design were chosen. They must identify and explain any controls that were used, and address ethical considerations. Independent and dependent variables must be clearly identified.

Participants

A sample of 15 - 20 participants is recommended. The characteristics of the general population being sampled should be identified, in terms of, for example, the number of participants, age, gender distribution, even if some variables are not under investigation. Selection and allocation procedures must be identified and justified. If sampling is not done randomly, this must be clearly justified.

Materials

This section may be a list of materials developed for use in the experiment. Basic materials such as paper, pencils, chairs, tables - need not be listed. Any materials that were specifically developed for the experiment should be listed and referenced to a sample copy included in the appendices. The full text of, for example, standardized instructions, informed consent, and debriefing notes should be included in the appendices.

Procedures

Candidates must carefully and accurately describe how they carried out the experiment. This should be done in chronological order beginning with how materials were developed. Enough detail should be provided so that another researcher could replicate the experiment. It must include consent forms and debriefing. The sampling technique does not need to be re-described.

Results

This section includes numerical and graphical reporting of the data collected by the candidate. The results must be stated in narrative form and in graphical form. The data should be reported in a away that reflects the claims made in the aims and hypothesis.

The use of descriptive statistics is required at both levels. Candidates should use the descriptive statistics that best suit their study. Ideally, candidates will measure both the central tendency and dispersion as appropriate. Raw data should not be included here, but allocated to an appendix.

Graphs and tables may be drawn on the computer. One graph is usually sufficient. Computers can create many different graphs, but candidates should be advised against producing irrelevant graphs. Candidates must not include graphs that show each individual participant's score.

At higher level, inferential statistical analysis of the results is included in this section. Candidates must justify the use of the inferential statistical test chosen. Any calculations should be allocated to an appendix.

Discussion

The purpose of this section is to discuss the following:

  • the results of the current study and its relation to the studies cited in the introduction.
  • the strengths and limitations of the methodology used.
  • any relevant modifications and areas of further investigation.
  • an informed conclusion.

This section allows candidates to interpret their own results in the lights of previous research. They must relate their findings to theories or studies referred to in the introduction. No new studies or citations should be introduced.

Candidates should analyze and evaluate their own methodology. They should discuss any flaws or limitations that may have affected the outcome of the experiment. The strongest reports will identify possible confounding variables that may have influenced the study and not rely on a simplistic evaluation such as "the experimental study should have used a larger sample." Modifications that would remedy any limitations should also be included.

During the course of the experiment, candidates may come across unusual results or related topics that may be interesting to investigate experimentally. Candidates should make special note of any such thoughts that arose during the study.

Finally, a brief conclusion should be presented which summarizes the results of the experiment.

References

In this section, candidates must include a complete set of references to the works cited in the study. An approved reference format must be used. Remember not to number your sources and be sure to alphabetize them.

Appendices

In this section, candidates must include blank copies of any supplementary information. This section provides all the materials necessary to allow the experiment to be replicated. Tables of raw data must be included. Do not include all the participants' filled in materials - one blank copy is sufficient. Lastly, be sure to label each appendix appropriately.

 

Ideas for Internal Assessment

with thanks to Ian Campbell

Important Ethical Issues:

  • You may not do any conformity studies
  • All animal experiments are forbidden.
  • No stereotyping practical to do with race, religion, gender or social class is allowed.
  • Quasi experiments are out - e.g. where the IV is age/race/gender/class/ability
  • Informed consent must always be obtained, and debriefing must be carried out.

SL candidates, please note that your study is supposed to be based on a previous experiment carried out by psychologists (i.e. a replication). HL candidates must design an experiment and not simply replicate one. All plans for experiments should be approved by the teacher.

Here is a list of sample IA topics.

A. Memory

  1. Memory and the serial position curve: Cunitz and Glanzer's study. Participants learn a long list of words and after a delay they have to recall as many as possible. The hypothesis is that people tend to remember the first and last words in a list due to the primacy and recency effects.
  2. Memory and acoustic learning: There is a theory that when we have to learn something like a telephone number that we store the number in the form of the mental sound that it makes i.e. acoustically. Participants have to learn lists of letters and then write them down after a delay. According to the acoustic coding theory participants will have more difficulty recalling letters which sound similarcompared to a list of words which sound quite different.
  3. Improving memory: Imagery vs rehearsal: participants recall more words from a (20) word list when they use an imagery method (forming a vivid mental image and linking each item to the last in a dynamic fashion) than if they use either rehearsal (repeat each item until you hear the next) or no particular method (no prior instruction). Bower (1967); Paivio (1971).
  4. The Role of Ambiguity: Give participants an ambiguous passage (which could mean anything) which they later have to recall. Some participants are given a title to the passage which makes it sensible while other participants are not. Those with the title should process the passage more meaningfully and therefore recall the passage more successfully.
  5. Memory and levels of processing: Craik and Lockhart hypothesise that the deeper and more meaningfully we process information the better subsequent recall will be. Participants are asked to process words either at a basic structural level like 'is the word in capitals?' or at a level requiring the comprehension of meaning e.g. 'is it something you can eat'? Participants would be expected to recall those words processed more deeply more successfully. Craik and Tulving (1975).
  6. Organization and memory: If word lists are organised in some meaningful way, will participants recall better than from a jumbled list? 
  7. Memory interference: This could be nicely applied to school revision. Participants have to learn for example a list of words and then recall them. Memory is interfered with by learning another list of words; some participants learn this interfering list before the main list and some learn it after the main list to see which has the greater effect.
  8. Eye-witness reports: Loftus and Palmer (1974); Loftus and Zanni (1975). Participants asked how fast cars were going when they ‘smashed’ into each other, after viewing a car accident, report greater speeds than do participants asked the speed when they ‘hit’ each other. The former group are more likely to report seeing broken glass (when none is there) a week later.
  9. Does the time of day influence a person's ability to recall words?
  10. Does background noise impair memory?

B. Perception, Thinking and Performance

  1. Perceptual Sets: (a) This one is based on the idea that people have a very fixed association between colour and taste and attempts to find out how strong this is. It involves giving participants coloured drinks but in some conditions the colour doesn't match up with the flavour (e.g. red coloured mint). By measuring how long it takes for participants to judge the flavour will tell us how strong the associations are. b. This is based on the hypothesis that peoples' perception of colour depends on what they associate that colour with. For example, people associate tomatoes with being red so might perceive a stronger red colour than say a red hat which doesn't have the same associations. This practical involves showing participants pictures of fruit but with some of the colours mixed up. For example, participants are shown a picture of a red tomato and then a red banana and later have to judge the colours of each on a colour chart. c. Solving lists of anagrams is easier if all the words belong to a category (e.g. animals) than if they are random words. d. It has been suggested that people perceive the size of coins depending on how they perceive its value. Specifically children and people from low income families are thought to overestimate the size of coins. This can be designed easily enough by showing subjects different coins and asking them to later to judge their size using a comparison chart of different sized circles. e.  People are first predisposed to think about a particular set of objects e.g. fruit or letters of the alphabet, by showing them pictures. Then an ambiguous picture is flashed to them which could be a banana or a letter 'C' for example. The hypothesis is that they will label the object according to the set of objects they saw previously. g. The participant is presented briefly with a list of words about a topic e.g. letter, post, stamp etc. which they have to write down - and then one of them is misspelled e.g. mael, and the hypothesis is that because a strong mental concept of the topic has been set up that they will write down the word as 'mail'.
  2. Stroop effect: Participants take a lot longer to name the colour of ink that words are written in when the words themselves are contradictory colour words e.g. ‘red’ written in yellow ink – Dyer (1973).
  3. Word and letter recognition: Visual search: Time taken to find X’s hidden in a four column list of similar shaped letters (Y, Z etc.) is longer than for lists with letters such as S, R, P etc. – Neisser’s (1964) feature analysis model of pattern recognition. Alternatively: Participants will take longer to find 0 among letters if it is called tzero’ than when it is called letter ‘oh’ and vice versa – Jonides & Gleitman (1972).
  4. Estimation of time: Effects of mental activity on the estimation of time: Ornstein hypothesised that the more mental activity we do in a fixed period of time the longer the subjective estimate of that time interval. This could be tested by asking participants to listen to a passage of prose for say 100 seconds and then to estimate when a further 100 seconds of the prose had elapsed. In one group the prose could be read slowly to reduce the amount of mental processing and in the second group the prose would be read out faster.
  5. Is performance is impeded by noise? Participants have a task to performwith or without noise.
  6. Illusions: This is based on the Muller-Lyer illusion: By re-designing the illusion it is possible to test the hypothesis of constancy scaling put forward by Richard Gregory that the reason the illusion works is because it has hidden depth cues. The practical involves showing participants different forms of the illusion to find out when it is most powerful.
  7. Concrete vs abstract reasoning (problem solving) and concept formation: When it comes to logical problem solving people are typically very poor and tend just to look for examples which confirm their theory rather than looking for possible exceptions which might disprove it. This practical is based on giving participants seemingly simple tests of logic to see what errors they make. However, the hypothesis being tested is that if the logic problems are purely abstract (e.g. if x is always double y and y is never an even number) then many more mistakes will be made compared to more concrete problems (e.g., if worms are always animals and animals are sometimes birds) even if the basic logical pathway is the same. A similar alternative is as follows: The Wason (1969) selection task – participants are shown four cards displaying B E 3 and 8. They are given the rule ‘every consonant has an even number on the other side’. Participants have to decide which two cards to turn over in order to check the validity of the rule. They tend to demonstrate false logical reasoning and assume that the B and 8 cards must be turned over whereas B and 3 are correct. Griggs and Cox (1982) used more realistic material with ‘beer’ or ‘Pepsi’ on one side and an age on the other. To check the rule ‘only 18 year olds and above may drink’, and given cards showing ‘beer’, ‘Pepsi’, ‘16’ and ‘20’, participants do much better than on the abstract task. Similarly, participants might be asked to check the rule ‘all first class envelopes are sealed’ given sealed unsealed, first and second class stamped envelopes. Frequencies of correct response can be compared with those for the abstract Wason task using chi–squared.
  8. Heuristics: Tversky and Kahneman’s (1973) ‘availability’ hypothesis. If people recall more items from one set than from another they assume (heuristically) that there actually were more in the former set. Demonstrate this by giving participants a set of names to remember containing 19 very famous males and 20 not so famous females. Since participants tend to recall more male names they tend to judge that more males were in the list.
  9. Anchoring Bias - Tversky and Kahneman. Someone's estimates of something will be greatly influenced by the way the question is structured. For example, people asked to estimate 1x2x3x4x5x6x7x8x9 give lower estimates than those estimating 9x8x7x6x5x4x3x2x1 because the earliest numbers affect perception of the answer. Similarly, if subjects are asked to guess the length of the river Vltava, their estimates will be influenced by a preceding question "is 2000 km (or 20,000km in the other condition) an over or underestimate of the length of the river Vltava". (Kahnemann and Tversky (1973) and Northcraft and Neale (1987)
  10. First impressions and Primacy effects: ask participants to score a person’s maths test and then estimate the overall score on that person on Maths ability: Group A have correct answers first, then incorrect. Group B have incorrect answers first, then correct. Group A will estimate a higher score than Group B.

C. Social Psychology

  1. Person perception: Versions of Asch’s (1946) ‘warm’, ‘cold’ central traits paradigm can be implemented in many topical ways. Candidates give one description of a person to one group of participants and an identical version to another group varying only one characteristic, for instance ‘agrees with nuclear testing’ for one group and ‘disagrees....’ for the other. They then ask participants to assess the fictitious person on, say, liking or trustworthiness on a 10 point scale and look for differences between groups. Asch’s ‘primacy’ effect can also be tested using a list of descriptors – e.g. ‘orderly entertaining humble cool calculating moody’ in that order for one group and in the opposite order for another. Those hearing the positive traits first might rate the person more favourably on a ten point scale – Anderson and Barrios (1961). Luria and Rubin (1974) – participants given the same picture of a baby but one group told it is male the other female. Record differences in descriptions. It is best to give a checklist to participants containing ‘typical’ masculine and feminine traits – fine featured, strong, robust, sensitive, cute, and delicate.
  2. Person perception and first impressions: A two paragraph description or story of someone is written with one paragraph being more positive than the other. Paragraph order can be reversed and subjects get one of the two possible orders. Subjects then have to rate the person on a questionnaire. The hypothesis is that due to first impressions the subjects who hear the story with the first paragraph being the positive one will rate the person more positively. E.g. Story of Jim with the first half suggesting he is friendly and the second half suggestion he is unfriendly. Different subjects receive different orders of the story.
  3. Social facilitation: The idea is that people tend to perform better when in groups than when on their own. Subjects can be given tasks (e.g. word searches) either in groups or on their own to test this theory.
  4. Snyder and Uranowitz (1978) presented two versions of a woman's life to two different groups. One group was later told that she took up a lesbian life-style (avoid this specific example); for the other group, she married. From the statement in the description 'although she never had a steady boyfriend she did go out on dates', the former group were more likely to recall the first part of this statement and the second group did the reverse.
  5. The halo effect: The effects of physical attractiveness: The halo effect states that attractive people are perceived as having more positive attributes.
  6. Social inference : Do people over-estimate the number of beads in a jar if they see a list of other peoples "over-estimates?" i.e. do they base their estimates on other peoples' views?
  7. Defensive attribution: The more serious an accident appears, the more people wish to assign responsibility to the driver.

D. Drives and Motivation

  1. Incentives and performance: It would seem logical that incentives should improve our performance. This could be tested by asking subjects to perform simple tasks like anagrams and measuring their speed of performance under different conditions. e.g. with or without an incentive like a Mars bar or alternatively by creating a fear of failing - to see whether positive or negative incentives are the most effective (e.g. telling the subjects that the results  will be put on display and that the average number of completed anagrams in 5 minutes by 8 year olds is 15!) This could be developed to find whether the fear of failure impedes performance most on more complex cognitive tasks like anagrams rather than simple memory recall type tasks. Alternatively you could test the hypothesis that a group of subjects will perform better on a task if they have previously (based on an earlier test) been told they have scored highly, than a group who have been told (falsely) they have performed badly.
 

 

 


Home | Contact Information | Course Objectives | Course Content | Summer Assignment | Syllabus | Psychology Websites | Assignment Activities | PsychSim 4.0 | Internal Assessment | Exams

 
Last updated: 04/25/08.