ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	9

Descriptor

Statistical Distributions	19
Test Construction	19
Foreign Countries	6
Item Response Theory	6
Achievement Tests	5
Sampling	4
Scores	4
Test Reliability	4
Validity	4
Correlation	3
Higher Education	3
Mathematics Tests	3
Reliability	3
Scoring	3
Simulation	3
Test Items	3
Testing Problems	3
Computer Assisted Testing	2
Elementary Secondary Education	2
Equated Scores	2
Evaluation Methods	2
Goodness of Fit	2
International Assessment	2
Item Analysis	2
Item Banks	2
More ▼

Source

Educational and Psychological…	2
Journal of Educational…	2
Applied Psychological…	1
College Board	1
ETS Research Report Series	1
Education	1
International Education…	1
Journal of Educational and…	1
Journal on Efficiency and…	1
Measurement:…	1
Online Submission	1
Research in Mathematics…	1
More ▼

Publication Type

Reports - Research	19
Journal Articles	12
Speeches/Meeting Papers	5
Numerical/Quantitative Data	2
Reports - Evaluative	1
Tests/Questionnaires	1

Education Level

Higher Education	3
Postsecondary Education	3
Secondary Education	3
Elementary Secondary Education	1
High Schools	1

Audience

Researchers

Location

Jordan	2
India	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

California Achievement Tests	1
Comprehensive Tests of Basic…	1
NEO Personality Inventory	1
Praxis Series	1
Program for International…	1
SAT (College Admission Test)	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

Neutrosophic Estimators for Estimating the Population Mean in Survey Sampling

Peer reviewed

Direct link

Vinay Kumar Yadav; Shakti Prasad – Measurement: Interdisciplinary Research and Perspectives, 2024

In sample survey analysis, accurate population mean estimation is an important task, but traditional approaches frequently ignore the intricacies of real-world data, leading to biassed results. In order to handle uncertainties, indeterminacies, and ambiguity, this work presents an innovative approach based on neutrosophic statistics. We proposed…

Descriptors: Sampling, Statistical Bias, Predictor Variables, Predictive Measurement

Investigating Constructed-Response Scoring over Time: The Effects of Study Design on Trend Rescore Statistics. Research Report. ETS RR-22-15

Peer reviewed
PDF on ERIC

Download full text

Donoghue, John R.; McClellan, Catherine A.; Hess, Melinda R. – ETS Research Report Series, 2022

When constructed-response items are administered for a second time, it is necessary to evaluate whether the current Time B administration's raters have drifted from the scoring of the original administration at Time A. To study this, Time A papers are sampled and rescored by Time B scorers. Commonly the scores are compared using the proportion of…

Descriptors: Item Response Theory, Test Construction, Scoring, Testing

Performance of Coefficient Alpha and Its Alternatives: Effects of Different Types of Non-Normality

Peer reviewed

Direct link

Xiao, Leifeng; Hau, Kit-Tai – Educational and Psychological Measurement, 2023

We examined the performance of coefficient alpha and its potential competitors (ordinal alpha, omega total, Revelle's omega total [omega RT], omega hierarchical [omega h], greatest lower bound [GLB], and coefficient "H") with continuous and discrete data having different types of non-normality. Results showed the estimation bias was…

Descriptors: Statistical Bias, Statistical Analysis, Likert Scales, Statistical Distributions

Chance-Constrained Automated Test Assembly

Peer reviewed

Direct link

Giada Spaccapanico Proietti; Mariagiulia Matteucci; Stefania Mignani; Bernard P. Veldkamp – Journal of Educational and Behavioral Statistics, 2024

Classical automated test assembly (ATA) methods assume fixed and known coefficients for the constraints and the objective function. This hypothesis is not true for the estimates of item response theory parameters, which are crucial elements in test assembly classical models. To account for uncertainty in ATA, we propose a chance-constrained…

Descriptors: Automation, Computer Assisted Testing, Ambiguity (Context), Item Response Theory

Bayesian Diagnostics for Test Design and Analysis

Peer reviewed
PDF on ERIC

Download full text

Silva, R. M.; Guan, Y.; Swartz, T. B. – Journal on Efficiency and Responsibility in Education and Science, 2017

This paper attempts to bridge the gap between classical test theory and item response theory. It is demonstrated that the familiar and popular statistics used in classical test theory can be translated into a Bayesian framework where all of the advantages of the Bayesian paradigm can be realized. In particular, prior opinion can be introduced and…

Descriptors: Item Response Theory, Bayesian Statistics, Test Construction, Markov Processes

Developing a Numerical Ability Test for Students of Education in Jordan: An Application of Item Response Theory

Peer reviewed
PDF on ERIC

Download full text

Abed, Eman Rasmi; Al-Absi, Mohammad Mustafa; Abu shindi, Yousef Abdelqader – International Education Studies, 2016

The purpose of the present study is developing a test to measure the numerical ability for students of education. The sample of the study consisted of (504) students from 8 universities in Jordan. The final draft of the test contains 45 items distributed among 5 dimensions. The results revealed that acceptable psychometric properties of the test;…

Descriptors: Foreign Countries, Item Response Theory, Numeracy, Reliability

Some Implications of Choice of Tiering Model in GCSE Mathematics for Inferences about What Students Know and Can Do

Peer reviewed

Direct link

Bramley, Tom – Research in Mathematics Education, 2017

This study compared models of assessment structure for achieving differentiation across the range of examinee attainment in the General Certificate of Secondary Education (GCSE) examination taken by 16-year-olds in England. The focus was on the "adjacent levels" model, where papers are targeted at three specific non-overlapping ranges of…

Descriptors: Foreign Countries, Mathematics Education, Student Certification, Student Evaluation

An Investigation of Jordanian EFL Teachers' Procedures of Achievement Test Construction

Peer reviewed

Direct link

Al-Shara'H, Nayel Darweesh – Education, 2013

The study aimed at investigating Jordanian EFL teachers' self-reported frequencies of using the procedures of preparing, correcting, analyzing, interpreting an achievement test, and discussing its results with students. To achieve this, a 31-item questionnaire was used. The questionnaire was administered to 118 basic stage EFL teachers after…

Descriptors: Foreign Countries, English (Second Language), Second Language Instruction, Test Construction

Comparing Simulated and Theoretical Sampling Distributions of the U3 Person-Fit Statistic.

Peer reviewed

Emons, Wilco H. M.; Meijer, Rob R.; Sijtsma, Klaas – Applied Psychological Measurement, 2002

Studied whether the theoretical sampling distribution of the U3 person-fit statistic is in agreement with the simulated sampling distribution under different item response theory models and varying item and test characteristics. Simulation results suggest that the use of standard normal deviates for the standardized version of the U3 statistic may…

Descriptors: Item Response Theory, Sampling, Simulation, Statistical Distributions

Inter-item Correlation Frequency Distribution Analysis: A Method for Evaluating Scale Dimensionality.

Peer reviewed

Piedmont, Ralph L.; Hyland, Michael E. – Educational and Psychological Measurement, 1993

The use of mean inter-item correlation as a technique for examining homogeneity is proposed as a descriptive tool that can orient researchers to salient aspects of their scales. A study of 341 undergraduates who completed the NEO Personality Inventory illustrates the technique. (SLD)

Descriptors: Correlation, Evaluation Methods, Higher Education, Personality Measures

Investigating the Effects of Increased SAT Reasoning Test™ Length and Time on Performance of Regular SAT® Examinees. Research Report No. 2006-9

Download full text

Wang, Xiang Bo – College Board, 2007

This research examines the effect of increased testing time by comparing the four performance indices of randomly equivalent examinee subpopulations on sections of similar content and difficulty administered at different times on three SAT administrations. A variety of analyses were used in this study and found no evidence that the current SAT…

Descriptors: College Entrance Examinations, Thinking Skills, High School Students, Test Length

Combining Data on Criticality and Frequency in Developing Test Plans for Licensure and Certification Examinations.

Peer reviewed

Kane, Michael T.; And Others – Journal of Educational Measurement, 1989

This paper develops a multiplicative model as a means of combining ratings of criticality and frequency of various activities involved in job analyses. The model incorporates adjustments to ensure that effective weights of criticality and frequency are appropriate. An example of the model's use is presented. (TJH)

Descriptors: Critical Incidents Method, Higher Education, Job Analysis, Licensing Examinations (Professions)

A Comparison of Equal Percentile and Partial Credit Equatings for Performance-Based Assessments Composed of Free-Response Items.

Peer reviewed

Huynh, Huynh; Ferrara, Steven – Journal of Educational Measurement, 1994

Equal percentile (EP) and partial credit (PC) equatings for raw scores from performance-based assessments with free-response items are compared through the use of data from the Maryland School Performance Assessment Program. Results suggest that EP and PC methods do not give equivalent results when distributions are markedly skewed. (SLD)

Descriptors: Comparative Analysis, Equated Scores, Mathematics Tests, Performance Based Assessment

The Effect of Poor Fitting Items on the Distributions of Extended Caution Indices.

Tomsic, Margie L.; And Others – 1987

Extended caution indices (ECI) specify the degree of confidence that can be placed in an individual's test score by analyzing patterns of item response. Among the most promising of such indices are the standardized ECIs. Contrary to the literature, several instances were found, in a previous study, of nonnormal distributions of ECIs with samples…

Descriptors: Achievement Tests, Elementary Education, Goodness of Fit, Latent Trait Theory

An Investigation of Two Procedures for Smoothing Test Norms.

Download full text

Jones, Patricia B.; Sabers, Darrell L. – 1984

Several techniques have been developed for creating continuous smooth distributions of test norms. This paper describes two studies that explore the behavior of cubic splines in order to determine their appropriateness for use in test norming. The first study uses data from the Curriculum Referenced Tests of Mastery (CRTM) and employs two…

Descriptors: Equated Scores, Goodness of Fit, Measurement Techniques, Norm Referenced Tests

Previous Page | Next Page »

Pages: 1 | 2

Huynh, Huynh	2
Abed, Eman Rasmi	1
Abu shindi, Yousef Abdelqader	1
Al-Absi, Mohammad Mustafa	1
Al-Shara'H, Nayel Darweesh	1
Bernard P. Veldkamp	1
Bramley, Tom	1
Case, Susan M.	1
Donoghue, John R.	1
Emons, Wilco H. M.	1
Ferrara, Steven	1
Giada Spaccapanico Proietti	1
Guan, Y.	1
Hariharan, Swaminathan	1
Hau, Kit-Tai	1
Hess, Melinda R.	1
Hyland, Michael E.	1
Jones, Patricia B.	1
Kane, Michael T.	1
Mariagiulia Matteucci	1
McClellan, Catherine A.	1
Meijer, Rob R.	1
Phillips, Gary W.	1
Piedmont, Ralph L.	1
More ▼