ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	8

Descriptor

Probability	14
Scores	14
Test Reliability	14
Multiple Choice Tests	5
Statistical Analysis	5
Test Construction	4
Test Validity	4
Classification	3
College Students	3
Correlation	3
Criterion Referenced Tests	3
Cutting Scores	3
Student Attitudes	3
Test Interpretation	3
Academic Achievement	2
Adolescents	2
Attitude Measures	2
Biology	2
College Faculty	2
College Freshmen	2
College Science	2
Comparative Analysis	2
Context Effect	2
Difficulty Level	2
Educational Objectives	2
More ▼

Source

Educational and Psychological…	2
International Association for…	1
International Journal of…	1
Journal of College Student…	1
Journal of Educational…	1
Journal of Research in…	1
Physical Review Special…	1
Practical Assessment,…	1
TESOL Quarterly: A Journal…	1

Publication Type

Reports - Research	11
Journal Articles	8
Collected Works - Proceedings	1
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Higher Education	5
Postsecondary Education	4
Elementary Secondary Education	1
Secondary Education	1

Audience

Location

Asia	1
Australia	1
Brazil	1
Colorado	1
Connecticut	1
Denmark	1
Egypt	1
Estonia	1
Florida	1
Germany	1
Greece	1
Hawaii	1
Ireland	1
Israel	1
Italy	1
Japan	1
Kazakhstan	1
Montana	1
Netherlands	1
Norway	1
Ohio	1
Pakistan	1
Pennsylvania	1
Philippines	1
Portugal	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)

What Works Clearinghouse Rating

Showing all 14 results Save | Export

How to Obtain the Most Error-Free Estimate of Reliability? Eight Sources of Deflation in the Estimates of Reliability to Avoid

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…

Descriptors: Test Reliability, Scores, Test Items, Correlation

Academic Reality Check: A Fast, Free, Practical Tool for Freshmen Retention

Peer reviewed

Direct link

Anderson, Darcie L.; Hooks, Tisha – Journal of College Student Retention: Research, Theory & Practice, 2022

With limited budgets and increasing enrollment demands, colleges need fast, free, and practical solutions supporting academic success and retention. The Academic Reality Check (ARC) tool helps to predict traditional freshmen awareness of their own academic expectations in college quickly, supporting the financial investment being made by all…

Descriptors: College Freshmen, Expectation, Predictor Variables, Academic Achievement

Development and Validation of a Scientific (Formal) Reasoning Test for College Students

Peer reviewed

Direct link

Kalinowski, Steven T.; Willoughby, Shannon – Journal of Research in Science Teaching, 2019

We present a multiple-choice test, the Montana State University Formal Reasoning Test (FORT), to assess college students' scientific reasoning ability. The test defines scientific reasoning to be equivalent to formal operational reasoning. It contains 20 questions divided evenly among five types of problems: control of variables, hypothesis…

Descriptors: Science Tests, Test Construction, Science Instruction, Introductory Courses

The Influence of Time Attitudes on Alcohol-Related Attitudes, Behaviors and Subjective Life Expectancy in Early Adolescence: A Longitudinal Examination Using Mover-Stayer Latent Transition Analysis

Peer reviewed

Direct link

Wells, Kevin Eugene; Morgan, Grant; Worrell, Frank C.; Sumnall, Harry; McKay, Michael Thomas – International Journal of Behavioral Development, 2018

The goal of the present study is to examine the stability of time attitudes profiles across a one-year period as well as the association between time attitudes profiles and several variables. These variables include attitudes towards alcohol, context of alcohol use, consumption of a full drink, and subjective life expectancy. We assessed the…

Descriptors: Time, Attitude Measures, Drinking, Context Effect

Quantum Mechanics Concept Assessment: Development and Validation Study

Peer reviewed

Direct link

Sadaghiani, Homeyra R.; Pollock, Steven J. – Physical Review Special Topics - Physics Education Research, 2015

As part of an ongoing investigation of students' learning in first semester upper-division quantum mechanics, we needed a high-quality conceptual assessment instrument for comparing outcomes of different curricular approaches. The process of developing such a tool started with converting a preliminary version of a 14-item open-ended quantum…

Descriptors: Science Instruction, Quantum Mechanics, Mechanics (Physics), Multiple Choice Tests

Estimating Guessing Effects on the Vocabulary Levels Test for Differing Degrees of Word Knowledge

Peer reviewed

Direct link

Stewart, Jeffrey; White, David A. – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2011

Multiple-choice tests such as the Vocabulary Levels Test (VLT) are often viewed as a preferable estimator of vocabulary knowledge when compared to yes/no checklists, because self-reporting tests introduce the possibility of students overreporting or underreporting scores. However, multiple-choice tests have their own unique disadvantages. It has…

Descriptors: Guessing (Tests), Scoring Formulas, Multiple Choice Tests, Test Reliability

The Probability of Obtaining Two Statistically Different Test Scores as a Test Index

Peer reviewed

Direct link

Muller, Jorg M. – Educational and Psychological Measurement, 2006

A new test index is defined as the probability of obtaining two randomly selected test scores (PDTS) as statistically different. After giving a concept definition of the test index, two simulation studies are presented. The first analyzes the influence of the distribution of test scores, test reliability, and sample size on PDTS within classical…

Descriptors: Test Reliability, Probability, Scores, Item Response Theory

Prediction Analysis and the Reliability of a Mastery Test.

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1979

The classical estimate of a binomial probability function is to estimate its mean in the usual manner and to substitute the results in the appropriate expression. Two alternative estimation procedures are described and examined. Emphasis is given to the single administration estimate of the mastery test reliability. (Author/CTM)

Descriptors: Cutting Scores, Mastery Tests, Probability, Scores

The Influence of Variables Other Than Knowledge on Probabilistic Tests

Peer reviewed

Hansen, Richard – Journal of Educational Measurement, 1971

The relationship between certain personality variables and the degree to which examines display certainty in their responses was investigated. (Author)

Descriptors: Guessing (Tests), Individual Characteristics, Multiple Choice Tests, Personality Assessment

Monte Carlo Approach for Reliability Estimations in Generalizability Studies.

Download full text

Dimitrov, Dimiter M. – 1996

A Monte Carlo approach is proposed, using the Statistical Analysis System (SAS) programming language, for estimating reliability coefficients in generalizability theory studies. Test scores are generated by a probabilistic model that considers the probability for a person with a given ability score to answer an item with a given difficulty…

Descriptors: Classification, Criterion Referenced Tests, Cutting Scores, Estimation (Mathematics)

The Criterion-Referenced Reliability of a Single Score. Report 76-01.

Livingston, Samuel A. – 1976

A distinction is made between reliability of measurement and reliability of classification; the "criterion-referenced reliability coefficient" describes the former. Application of this coefficient to the probability distribution of possible scores for a single student yields a meaningful way to describe the reliability of a single score. (Author)

Descriptors: Classification, Criterion Referenced Tests, Error of Measurement, Measurement

Measuring the Appropriateness of Multiple-Choice Test Scores.

Download full text

Levine, Michael V.; Rubin, Donald B. – 1976

Appropriateness indexes (statistical formulas) for detecting suspiciously high or low scores on aptitude tests were presented, based on a simulation of the Scholastic Aptitude Test (SAT) with 3,000 simulated scores--2,800 normal and 200 suspicious. The traditional index--marginal probability--uses a model for the normal examinee's test-taking…

Descriptors: Academic Ability, Aptitude Tests, College Entrance Examinations, High Schools

Criterion-Referenced Measurement.

Millman, Jason – 1974

This chapter should not only acquaint the reader with the present state of the art on Criterion-Referenced (CR) measurement but also suggest possible directions for further inquiry. The goal of the first part of this chapter is to deal with the definitional dilemma of CR measurement by proceeding from the more traditional view of CR measurement to…

Descriptors: Analysis of Variance, Bayesian Statistics, Behavioral Objectives, Comparative Analysis

Proceedings of the International Association for Development of the Information Society (IADIS) International Conference on Cognition and Exploratory Learning in Digital Age (CELDA) (Madrid, Spain, October 19-21, 2012)

Download full text

International Association for Development of the Information Society, 2012

The IADIS CELDA 2012 Conference intention was to address the main issues concerned with evolving learning processes and supporting pedagogies and applications in the digital age. There had been advances in both cognitive psychology and computing that have affected the educational arena. The convergence of these two disciplines is increasing at a…

Descriptors: Academic Achievement, Academic Persistence, Academic Support Services, Access to Computers

Anderson, Darcie L.	1
Dimitrov, Dimiter M.	1
Hansen, Richard	1
Hooks, Tisha	1
Kalinowski, Steven T.	1
Levine, Michael V.	1
Livingston, Samuel A.	1
McKay, Michael Thomas	1
Metsämuuronen, Jari	1
Millman, Jason	1
Morgan, Grant	1
Muller, Jorg M.	1
Pollock, Steven J.	1
Rubin, Donald B.	1
Sadaghiani, Homeyra R.	1
Stewart, Jeffrey	1
Sumnall, Harry	1
Wells, Kevin Eugene	1
White, David A.	1
Wilcox, Rand R.	1
Willoughby, Shannon	1
Worrell, Frank C.	1
More ▼