Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 2 |
Descriptor
Statistical Distributions | 14 |
Test Format | 14 |
Scores | 6 |
Equated Scores | 4 |
Item Banks | 4 |
Test Construction | 4 |
Achievement Tests | 3 |
Item Response Theory | 3 |
Adaptive Testing | 2 |
Elementary School Students | 2 |
Elementary Secondary Education | 2 |
More ▼ |
Source
Applied Psychological… | 3 |
Applied Measurement in… | 1 |
International Journal of… | 1 |
Journal of Educational and… | 1 |
Journal of Marital and Family… | 1 |
Psychometrika | 1 |
Author
Hanson, Bradley A. | 2 |
Allen, Nancy L. | 1 |
Baker, Frank B. | 1 |
Berger, Martijn P. F. | 1 |
Dirir, Mohamed A. | 1 |
Hansen, James C. | 1 |
Heil, Martin | 1 |
Huynh, Huynh | 1 |
Jansen, Petra | 1 |
Luecht, Richard M. | 1 |
Marco, Gary L. | 1 |
More ▼ |
Publication Type
Reports - Evaluative | 10 |
Journal Articles | 8 |
Reports - Research | 4 |
Speeches/Meeting Papers | 4 |
Education Level
Elementary Education | 1 |
Grade 2 | 1 |
Grade 4 | 1 |
Audience
Researchers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Family Adaptability Cohesion… | 1 |
Law School Admission Test | 1 |
SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015
Person-fit assessment may help the researcher to obtain additional information regarding the answering behavior of persons. Although several researchers examined person fit, there is a lack of research on person-fit assessment for mixed-format tests. In this article, the lz statistic and the ?2 statistic, both of which have been used for tests…
Descriptors: Test Format, Goodness of Fit, Item Response Theory, Bayesian Statistics
Quaiser-Pohl, Claudia; Neuburger, Sarah; Heil, Martin; Jansen, Petra; Schmelter, Andrea – International Journal of Testing, 2014
This article presents a reanalysis of the data of 862 second and fourth graders collected in two previous studies, focusing on the influence of method (psychometric vs. chronometric) and stimulus type on the gender difference in mental-rotation accuracy. The children had to solve mental-rotation tasks with animal pictures, letters, or cube…
Descriptors: Foreign Countries, Gender Differences, Accuracy, Age Differences

Hanson, Bradley A. – Applied Measurement in Education, 1996
Determining whether score distributions differ on two or more test forms administered to samples of examinees from a single population is explored using three statistical tests using loglinear models. Examples are presented of applying tests of distribution differences to decide if equating is needed for alternative forms of a test. (SLD)
Descriptors: Equated Scores, Scoring, Statistical Distributions, Test Format

Baker, Frank B. – Applied Psychological Measurement, 1996
Using the characteristic curve method for dichotomously scored test items, the sampling distributions of equating coefficients were examined. Simulations indicate that for the equating conditions studied, the sampling distributions of the equating coefficients appear to have acceptable characteristics, suggesting confidence in the values obtained…
Descriptors: Equated Scores, Item Response Theory, Sampling, Statistical Distributions

Hanson, Bradley A.; And Others – Applied Psychological Measurement, 1993
The delta method was used to derive standard errors (SES) of the Levine observed score and Levine true score linear test equating methods using data from two test forms. SES derived without the normality assumption and bootstrap SES were very close. The situation with skewed score distributions is also discussed. (SLD)
Descriptors: Equated Scores, Equations (Mathematics), Error of Measurement, Sampling

Berger, Martijn P. F. – Applied Psychological Measurement, 1994
This paper focuses on similarities of optimal design of fixed-form tests, adaptive tests, and testlets within the framework of the general theory of optimal designs. A sequential design procedure is proposed that uses these similarities to obtain consistent estimates for the trait level distribution. (SLD)
Descriptors: Achievement Tests, Adaptive Testing, Algorithms, Estimation (Mathematics)

van der Linden, Wim J.; Luecht, Richard M. – Psychometrika, 1998
Derives a set of linear conditions of item-response functions that guarantees identical observed-score distributions on two test forms. The conditions can be added as constraints to a linear programming model for test assembly. An example illustrates the use of the model for an item pool from the Law School Admissions Test (LSAT). (SLD)
Descriptors: Equated Scores, Item Banks, Item Response Theory, Linear Programming
Nandakumar, Ratna; Roussos, Louis – 1997
This paper investigates the performance of CATSIB (a modified version of the SIBTEST computer program) to assess differential item functioning (DIF) in the context of computerized adaptive testing (CAT). One of the distinguishing features of CATSIB is its theoretically built-in regression correction to control for the Type I error rates when the…
Descriptors: Adaptive Testing, Computer Assisted Testing, Item Bias, Power (Statistics)

Pratt, David M.; Hansen, James C. – Journal of Marital and Family Therapy, 1987
Olson's Circumplex Model hypothesizes that cohesiveness and adaptability dimensions measured on Family Adaptability and Cohesion Evaluation Scales (FACES) have a curvilinear relationship with family functioning. Study testing curvilinear hypothesis indicated that FACES II and III did not adequately operationalize curvilinear hypothesis. Findings…
Descriptors: Adjustment (to Environment), Evaluation Methods, Family Characteristics, Family Counseling
Allen, Nancy L.; And Others – 1992
Many testing programs include a section of optional questions in addition to mandatory parts of a test. These optional parts of a test are not often truly parallel to one another, and groups of examinees selecting each optional test section are not equivalent to one another. This paper provides a general method based on missing-data methods for…
Descriptors: Comparative Testing, Estimation (Mathematics), Graphs, Scaling
Marco, Gary L.; And Others – 1990
Data from the College Board Validity Study Service show that the average multiple correlation of the Scholastic Aptitude Test (SAT) with college grades peaked in 1974 and then tended to decline. Data from other sources also estimate a small average decline from 1974 to 1985. This study documented changes in the SAT and related these changes to…
Descriptors: Change, College Entrance Examinations, Correlation, Educational Trends
Dirir, Mohamed A. – 1995
The effectiveness of an optimal item selection method in designing parallel test forms was studied during the development of two forms that were parallel to an existing form for each of three language arts tests for fourth graders used in the Connecticut Mastery Test. Two listening comprehension forms, two reading comprehension forms, and two…
Descriptors: Elementary School Students, Grade 4, Intermediate Grades, Item Banks
Phillips, Gary W.; Huynh, Huynh – 1985
A procedure which may be used to project the frequency distribution of one test onto that of another test is described and illustrated. The procedure is useful when a test developer wishes to construct an alternate form with preferred distributional characteristics. For example, the test developer may wish to construct a new test form with a…
Descriptors: Achievement Tests, Elementary Secondary Education, Item Analysis, Item Banks
Alberta Dept. of Education, Edmonton. Student Evaluation and Data Processing Branch. – 1986
This document reports the provincial results of the June 1986 student achievement tests in Alberta in grade 3 mathematics, grade 6 science, and grade 9 English language arts. The achievement tests are specific to the program of studies prescribed by the Minister of Education. The document starts with general information about the testing program…
Descriptors: Achievement Tests, Elementary Secondary Education, Foreign Countries, Grade 3