ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	7

Descriptor

Test Reliability	26
Simulation	17
Computer Simulation	9
Estimation (Mathematics)	7
Mathematical Models	7
Test Items	7
Computer Assisted Testing	6
Item Response Theory	6
Scores	6
Error of Measurement	5
Monte Carlo Methods	5
Test Validity	5
Adaptive Testing	4
Equations (Mathematics)	4
Evaluation Methods	4
Sample Size	4
Test Bias	4
Bayesian Statistics	3
Comparative Analysis	3
Correlation	3
Difficulty Level	3
Higher Education	3
Psychometrics	3
Robustness (Statistics)	3
Selection	3
More ▼

Source

Psychometrika	4
Applied Psychological…	3
Educational and Psychological…	2
Psychological Methods	2
Academic Medicine	1
Applied Measurement in…	1
Assessment and Evaluation in…	1
International Educational…	1
Journal of Educational…	1
Journal of Vocational…	1
Practical Assessment,…	1
More ▼

Publication Type

Reports - Evaluative	26
Journal Articles	17
Speeches/Meeting Papers	8
Collected Works - General	1
Numerical/Quantitative Data	1

Education Level

Adult Education

Audience

Practitioners	2
Administrators	1
Teachers	1

Location

Laws, Policies, & Programs

Assessments and Surveys

Armed Forces Qualification…	1
Graduate Record Examinations	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 26 results Save | Export

A Comparison of Automated Scale Short Form Selection Strategies

Peer reviewed
PDF on ERIC

Download full text

Raborn, Anthony W.; Leite, Walter L.; Marcoulides, Katerina M. – International Educational Data Mining Society, 2019

Short forms of psychometric scales have been commonly used in educational and psychological research to reduce the burden of test administration. However, it is challenging to select items for a short form that preserve the validity and reliability of the scores of the original scale. This paper presents and evaluates multiple automated methods…

Descriptors: Psychometrics, Measures (Individuals), Mathematics, Heuristics

Hybrid Computerized Adaptive Testing: From Group Sequential Design to Fully Sequential Design

Peer reviewed

Direct link

Wang, Shiyu; Lin, Haiyan; Chang, Hua-Hua; Douglas, Jeff – Journal of Educational Measurement, 2016

Computerized adaptive testing (CAT) and multistage testing (MST) have become two of the most popular modes in large-scale computer-based sequential testing. Though most designs of CAT and MST exhibit strength and weakness in recent large-scale implementations, there is no simple answer to the question of which design is better because different…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Format, Sequential Approach

Comparing the Performance of Five Multidimensional CAT Selection Procedures with Different Stopping Rules

Peer reviewed

Direct link

Yao, Lihua – Applied Psychological Measurement, 2013

Through simulated data, five multidimensional computerized adaptive testing (MCAT) selection procedures with varying test lengths are examined and compared using different stopping rules. Fixed item exposure rates are used for all the items, and the Priority Index (PI) method is used for the content constraints. Two stopping rules, standard error…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Selection

Assumptions of Multiple Regression: Correcting Two Misconceptions

Peer reviewed
PDF on ERIC

Download full text

Williams, Matt N.; Gomez Grajales, Carlos Alberto; Kurkiewicz, Dason – Practical Assessment, Research & Evaluation, 2013

In 2002, an article entitled "Four assumptions of multiple regression that researchers should always test" by Osborne and Waters was published in "PARE." This article has gone on to be viewed more than 275,000 times (as of August 2013), and it is one of the first results displayed in a Google search for "regression…

Descriptors: Multiple Regression Analysis, Misconceptions, Reader Response, Predictor Variables

Computer-Based Assessment in Safety-Critical Industries: The Case of Shipping

Peer reviewed

Direct link

Gekara, Victor Oyaro; Bloor, Michael; Sampson, Helen – Journal of Vocational Education and Training, 2011

Vocational education and training (VET) concerns the cultivation and development of specific skills and competencies, in addition to broad underpinning knowledge relating to paid employment. VET assessment is, therefore, designed to determine the extent to which a trainee has effectively acquired the knowledge, skills, and competencies required by…

Descriptors: Marine Education, Occupational Safety and Health, Computer Assisted Testing, Vocational Education

Asymptotically Distribution-Free (ADF) Interval Estimation of Coefficient Alpha

Peer reviewed

Direct link

Maydeu-Olivares, Alberto; Coffman, Donna L.; Hartmann, Wolfgang M. – Psychological Methods, 2007

The point estimate of sample coefficient alpha may provide a misleading impression of the reliability of the test score. Because sample coefficient alpha is consistently biased downward, it is more likely to yield a misleading impression of poor reliability. The magnitude of the bias is greatest precisely when the variability of sample alpha is…

Descriptors: Intervals, Scores, Sample Size, Simulation

Measurement Invariance versus Selection Invariance: Is Fair Selection Possible?

Peer reviewed

Direct link

Borsman, Denny; Romeijn, Jan-Willem; Wicherts, Jelte M. – Psychological Methods, 2008

This article shows that measurement invariance (defined in terms of an invariant measurement model in different groups) is generally inconsistent with selection invariance (defined in terms of equal sensitivity and specificity across groups). In particular, when a unidimensional measurement instrument is used and group differences are present in…

Descriptors: Test Items, Minority Groups, Measurement, Scores

Coefficient Alpha as an Estimate of Test Reliability under Violation of Two Assumptions.

Peer reviewed

Zimmerman, Donald W.; And Others – Educational and Psychological Measurement, 1993

Coefficient alpha was examined through computer simulation as an estimate of test reliability under violation of two assumptions. Coefficient alpha underestimated reliability under violation of the assumption of essential tau-equivalence of subtest scores and overestimated it under violation of the assumption of uncorrelated subtest error scores.…

Descriptors: Computer Simulation, Estimation (Mathematics), Mathematical Models, Robustness (Statistics)

Performance of SIBTEST When the Percentage of DIF Items Is Large

Peer reviewed

Direct link

Gierl, Mark J.; Gotzmann, Andrea; Boughton, Keith A. – Applied Measurement in Education, 2004

Differential item functioning (DIF) analyses are used to identify items that operate differently between two groups, after controlling for ability. The Simultaneous Item Bias Test (SIBTEST) is a popular DIF detection method that matches examinees on a true score estimate of ability. However in some testing situations, like test translation and…

Descriptors: True Scores, Simulation, Test Bias, Student Evaluation

A Comparative Analysis of Simulated and Direct Oral Proficiency Interviews.

Download full text

Stansfield, Charles W. – 1990

The simulated oral proficiency interview (SOPI) is a semi-direct speaking test that models the format of the oral proficiency interview (OPI). The OPI is a method of assessing general speaking proficiency in a second language. The SOPI is a tape-recorded test consisting of six parts: simple personal background questions posed in a simulated…

Descriptors: Comparative Analysis, Interviews, Language Proficiency, Language Tests

The Reliability of Linearly Equated Tests.

Peer reviewed

Segall, Daniel O. – Psychometrika, 1994

An asymptotic expression for the reliability of a linearly equated test is developed using normal theory. Reliability is expressed as the product of test reliability before equating and an adjustment term that is a function of the sample sizes used to estimate the linear equating transformation. The approach is illustrated. (SLD)

Descriptors: Equated Scores, Error of Measurement, Estimation (Mathematics), Sample Size

Obtaining Some Degree of Correspondence Between Unequatable Scores: A Comparison of Item Response Theory and Equipercentile Equating Methods.

Yen, Wendy M. – 1982

Test scores that are not perfectly reliable cannot be strictly equated unless they are strictly parallel. This fact implies that tau equivalence can be lost if an equipercentile equating is applied to observed scores that are not strictly parallel. Thirty-six simulated data sets are produced to simulate equating tests with different difficulties…

Descriptors: Difficulty Level, Equated Scores, Latent Trait Theory, Methods

Influence of Test and Person Characteristics on Nonparametric Appropriateness Measurement.

Peer reviewed

Meijer, Rob R.; And Others – Applied Psychological Measurement, 1994

The power of the nonparametric person-fit statistic, U3, is investigated through simulations as a function of item characteristics, test characteristics, person characteristics, and the group to which examinees belong. Results suggest conditions under which relatively short tests can be used for person-fit analysis. (SLD)

Descriptors: Difficulty Level, Group Membership, Item Response Theory, Nonparametric Statistics

Professor "X": How Experts Rated His Student Ratings.

Peer reviewed

Renner, Richard R.; Greenwood, Gordon E. – Assessment and Evaluation in Higher Education, 1985

Fictitious student evaluations of a faculty member's teaching performance are presented to the reader in an exercise in interpreting such information. Evaluator comments reveal a widespread divergence of views. (MSE)

Descriptors: College Faculty, Evaluation Criteria, Evaluation Methods, Higher Education

Exact Distributions of Intraclass Correlation and Cronbach's Alpha with Gaussian Data and General Covariance

Peer reviewed

Direct link

Kistner, Emily O.; Muller, Keith E. – Psychometrika, 2004

Intraclass correlation and Cronbach's alpha are widely used to describe reliability of tests and measurements. Even with Gaussian data, exact distributions are known only for compound symmetric covariance (equal variances and equal correlations). Recently, large sample Gaussian approximations were derived for the distribution functions. New exact…

Descriptors: Correlation, Test Reliability, Test Results, Probability

Previous Page | Next Page »

Pages: 1 | 2

Segall, Daniel O.	2
Bloor, Michael	1
Borsman, Denny	1
Boughton, Keith A.	1
Brown, R. L.	1
Chang, Hua-Hua	1
Coffman, Donna L.	1
Douglas, Jeff	1
Eignor, Daniel R.	1
Eiting, Mindert H.	1
Friedman, Miriam	1
Gekara, Victor Oyaro	1
Gierl, Mark J.	1
Gomez Grajales, Carlos Alberto	1
Gotzmann, Andrea	1
Greenwood, Gordon E.	1
Hartmann, Wolfgang M.	1
Houston, Walter M.	1
Hsiung, Chao A.	1
Huynh, Huynh	1
Kistner, Emily O.	1
Kurkiewicz, Dason	1
Leite, Walter L.	1
Lin, Haiyan	1
More ▼