ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	2

Descriptor

Statistical Distributions	14
Test Format	14
Scores	6
Equated Scores	4
Item Banks	4
Test Construction	4
Achievement Tests	3
Item Response Theory	3
Adaptive Testing	2
Elementary School Students	2
Elementary Secondary Education	2
Equations (Mathematics)	2
Error of Measurement	2
Estimation (Mathematics)	2
Foreign Countries	2
Grade 4	2
Sampling	2
Scoring	2
Test Content	2
Test Items	2
Test Reliability	2
Test Results	2
Testing Problems	2
Testing Programs	2
Academic Achievement	1
More ▼

Source

Applied Psychological…	3
Applied Measurement in…	1
International Journal of…	1
Journal of Educational and…	1
Journal of Marital and Family…	1
Psychometrika	1

Publication Type

Reports - Evaluative	10
Journal Articles	8
Reports - Research	4
Speeches/Meeting Papers	4

Education Level

Elementary Education	1
Grade 2	1
Grade 4	1

Audience

Researchers

Location

Canada	1
Germany	1

Laws, Policies, & Programs

Assessments and Surveys

Family Adaptability Cohesion…	1
Law School Admission Test	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Assessment of Person Fit for Mixed-Format Tests

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015

Person-fit assessment may help the researcher to obtain additional information regarding the answering behavior of persons. Although several researchers examined person fit, there is a lack of research on person-fit assessment for mixed-format tests. In this article, the lz statistic and the ?2 statistic, both of which have been used for tests…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Bayesian Statistics

Is the Male Advantage in Mental-Rotation Performance Task Independent? On the Usability of Chronometric Tests and Paper-and-Pencil Tests in Children

Peer reviewed

Direct link

Quaiser-Pohl, Claudia; Neuburger, Sarah; Heil, Martin; Jansen, Petra; Schmelter, Andrea – International Journal of Testing, 2014

This article presents a reanalysis of the data of 862 second and fourth graders collected in two previous studies, focusing on the influence of method (psychometric vs. chronometric) and stimulus type on the gender difference in mental-rotation accuracy. The children had to solve mental-rotation tasks with animal pictures, letters, or cube…

Descriptors: Foreign Countries, Gender Differences, Accuracy, Age Differences

Testing for Differences in Test Score Distributions Using Loglinear Models.

Peer reviewed

Hanson, Bradley A. – Applied Measurement in Education, 1996

Determining whether score distributions differ on two or more test forms administered to samples of examinees from a single population is explored using three statistical tests using loglinear models. Examples are presented of applying tests of distribution differences to decide if equating is needed for alternative forms of a test. (SLD)

Descriptors: Equated Scores, Scoring, Statistical Distributions, Test Format

An Investigation of the Sampling Distributions of Equating Coefficients.

Peer reviewed

Baker, Frank B. – Applied Psychological Measurement, 1996

Using the characteristic curve method for dichotomously scored test items, the sampling distributions of equating coefficients were examined. Simulations indicate that for the equating conditions studied, the sampling distributions of the equating coefficients appear to have acceptable characteristics, suggesting confidence in the values obtained…

Descriptors: Equated Scores, Item Response Theory, Sampling, Statistical Distributions

Standard Errors of Levine Linear Equating.

Peer reviewed

Hanson, Bradley A.; And Others – Applied Psychological Measurement, 1993

The delta method was used to derive standard errors (SES) of the Levine observed score and Levine true score linear test equating methods using data from two test forms. SES derived without the normality assumption and bootstrap SES were very close. The situation with skewed score distributions is also discussed. (SLD)

Descriptors: Equated Scores, Equations (Mathematics), Error of Measurement, Sampling

A General Approach to Algorithmic Design of Fixed-Form Tests, Adaptive Tests, and Testlets.

Peer reviewed

Berger, Martijn P. F. – Applied Psychological Measurement, 1994

This paper focuses on similarities of optimal design of fixed-form tests, adaptive tests, and testlets within the framework of the general theory of optimal designs. A sequential design procedure is proposed that uses these similarities to obtain consistent estimates for the trait level distribution. (SLD)

Descriptors: Achievement Tests, Adaptive Testing, Algorithms, Estimation (Mathematics)

Observed-Score Equating as a Test Assembly Problem.

Peer reviewed

van der Linden, Wim J.; Luecht, Richard M. – Psychometrika, 1998

Derives a set of linear conditions of item-response functions that guarantees identical observed-score distributions on two test forms. The conditions can be added as constraints to a linear programming model for test assembly. An example illustrates the use of the model for an item pool from the Law School Admissions Test (LSAT). (SLD)

Descriptors: Equated Scores, Item Banks, Item Response Theory, Linear Programming

Validation of CATSIB To Investigate DIF of CAT Data.

Download full text

Nandakumar, Ratna; Roussos, Louis – 1997

This paper investigates the performance of CATSIB (a modified version of the SIBTEST computer program) to assess differential item functioning (DIF) in the context of computerized adaptive testing (CAT). One of the distinguishing features of CATSIB is its theoretically built-in regression correction to control for the Type I error rates when the…

Descriptors: Adaptive Testing, Computer Assisted Testing, Item Bias, Power (Statistics)

A Test of the Curvilinear Hypothesis with FACES II and III.

Peer reviewed

Pratt, David M.; Hansen, James C. – Journal of Marital and Family Therapy, 1987

Olson's Circumplex Model hypothesizes that cohesiveness and adaptability dimensions measured on Family Adaptability and Cohesion Evaluation Scales (FACES) have a curvilinear relationship with family functioning. Study testing curvilinear hypothesis indicated that FACES II and III did not adequately operationalize curvilinear hypothesis. Findings…

Descriptors: Adjustment (to Environment), Evaluation Methods, Family Characteristics, Family Counseling

A Missing Data Approach to Estimating Distributions of Scores for Optional Test Sections.

Allen, Nancy L.; And Others – 1992

Many testing programs include a section of optional questions in addition to mandatory parts of a test. These optional parts of a test are not often truly parallel to one another, and groups of examinees selecting each optional test section are not equivalent to one another. This paper provides a general method based on missing-data methods for…

Descriptors: Comparative Testing, Estimation (Mathematics), Graphs, Scaling

Trends in SAT Content and Statistical Characteristics and Their Relationship to SAT Predictive Validity.

Download full text

Marco, Gary L.; And Others – 1990

Data from the College Board Validity Study Service show that the average multiple correlation of the Scholastic Aptitude Test (SAT) with college grades peaked in 1974 and then tended to decline. Data from other sources also estimate a small average decline from 1974 to 1985. This study documented changes in the SAT and related these changes to…

Descriptors: Change, College Entrance Examinations, Correlation, Educational Trends

Construction of Parallel Test Forms Using Optimal Test Designs.

Download full text

Dirir, Mohamed A. – 1995

The effectiveness of an optimal item selection method in designing parallel test forms was studied during the development of two forms that were parallel to an existing form for each of three language arts tests for fourth graders used in the Connecticut Mastery Test. Two listening comprehension forms, two reading comprehension forms, and two…

Descriptors: Elementary School Students, Grade 4, Intermediate Grades, Item Banks

Distributional Projections: A Practical Application of the Rasch Model.

Download full text

Phillips, Gary W.; Huynh, Huynh – 1985

A procedure which may be used to project the frequency distribution of one test onto that of another test is described and illustrated. The procedure is useful when a test developer wishes to construct an alternate form with preferred distributional characteristics. For example, the test developer may wish to construct a new test form with a…

Descriptors: Achievement Tests, Elementary Secondary Education, Item Analysis, Item Banks

Provincial Report: Achievement Tests, September 1986. Student Evaluation.

Alberta Dept. of Education, Edmonton. Student Evaluation and Data Processing Branch. – 1986

This document reports the provincial results of the June 1986 student achievement tests in Alberta in grade 3 mathematics, grade 6 science, and grade 9 English language arts. The achievement tests are specific to the program of studies prescribed by the Minister of Education. The document starts with general information about the testing program…

Descriptors: Achievement Tests, Elementary Secondary Education, Foreign Countries, Grade 3

Hanson, Bradley A.	2
Allen, Nancy L.	1
Baker, Frank B.	1
Berger, Martijn P. F.	1
Dirir, Mohamed A.	1
Hansen, James C.	1
Heil, Martin	1
Huynh, Huynh	1
Jansen, Petra	1
Luecht, Richard M.	1
Marco, Gary L.	1
Nandakumar, Ratna	1
Neuburger, Sarah	1
Phillips, Gary W.	1
Pratt, David M.	1
Quaiser-Pohl, Claudia	1
Roussos, Louis	1
Schmelter, Andrea	1
Sinharay, Sandip	1
van der Linden, Wim J.	1
More ▼