ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	3

Descriptor

Difficulty Level	10
Measurement Techniques	10
Test Theory	10
Test Items	9
Latent Trait Theory	4
Mathematical Models	4
Testing Problems	4
Psychometrics	3
Scores	3
Test Format	3
Test Validity	3
Adaptive Testing	2
Criterion Referenced Tests	2
Evaluation Criteria	2
Evaluation Methods	2
Foreign Countries	2
Higher Education	2
Item Analysis	2
Item Banks	2
Language Tests	2
Multiple Choice Tests	2
Sample Size	2
Scaling	2
Scoring	2
Standardized Tests	2
More ▼

Source

College Board	1
International Journal of…	1
International Journal of…	1

Author

Chakrabartty, Satyendra Nath	1
Demirtas Tolaman, Tugba	1
Engelhard, George, Jr.	1
Gur Erdogan, Duygu	1
Hambleton, Ronald K.	1
Kaya Uyanik, Gulden	1
Kiely, Gerard L.	1
Rogers, H. Jane	1
Seong, Tae-Je	1
Subkoviak, Michael J.	1
Theunissen, Phiel J. J. M.	1
Thomas, Gregory P.	1
Wainer, Howard	1
Wind, Stefanie A.	1
Zwick, Rebecca	1
van Weeren, J., Ed.	1
More ▼

Publication Type

Reports - Research	7
Speeches/Meeting Papers	4
Journal Articles	2
Collected Works - Proceedings	1
Opinion Papers	1
Reports - Evaluative	1

Education Level

Elementary Education	1
Grade 6	1
High Schools	1
Intermediate Grades	1
Middle Schools	1
Secondary Education	1

Audience

Researchers

Location

Netherlands	1
Sweden	1
Turkey	1
United Kingdom (England)	1
United Kingdom (Northern…	1
United Kingdom (Wales)	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Assessment of Item and Test Parameters: Cosine Similarity Approach

Peer reviewed
PDF on ERIC

Download full text

Chakrabartty, Satyendra Nath – International Journal of Psychology and Educational Studies, 2021

The paper proposes new measures of difficulty and discriminating values of binary items and test consisting of such items and find their relationships including estimation of test error variance and thereby the test reliability, as per definition using cosine similarities. The measures use entire data. Difficulty value of test and item is defined…

Descriptors: Test Items, Difficulty Level, Scores, Test Reliability

Examination of Common Exams Held by Measurement and Assessment Centers: Many Facet Rasch Analysis

Peer reviewed
PDF on ERIC

Download full text

Kaya Uyanik, Gulden; Demirtas Tolaman, Tugba; Gur Erdogan, Duygu – International Journal of Assessment Tools in Education, 2021

This paper aims to examine and assess the questions included in the "Turkish Common Exam" for sixth graders held in the first semester of 2018 which is one of the common exams carried out by The Measurement and Evaluation Centers, in terms of question structure, quality and taxonomic value. To this end, the test questions were examined…

Descriptors: Foreign Countries, Grade 6, Standardized Tests, Test Items

Rating Quality Studies Using Rasch Measurement Theory. Research Report 2013-3

Download full text

Engelhard, George, Jr.; Wind, Stefanie A. – College Board, 2013

The major purpose of this study is to examine the quality of ratings assigned to CR (constructed-response) questions in large-scale assessments from the perspective of Rasch Measurement Theory. Rasch Measurement Theory provides a framework for the examination of rating scale category structure that can yield useful information for interpreting the…

Descriptors: Measurement Techniques, Rating Scales, Test Theory, Scores

A Comparative Study of Recently Proposed Item Bias Detection Methods.

Download full text

Seong, Tae-Je; Subkoviak, Michael J. – 1987

The purpose of this research was to reinvestigate the accuracy of three item bias detection procedures: (1) Linn and Harnisch's pseudo-IRT(Z) method; (2) Camilli's chi-square technique; and (3) Angoff's revised transformed item difficulty method. These methods are applied when the minority group sample size is too small to obtain stable estimates…

Descriptors: Blacks, Difficulty Level, Higher Education, Item Analysis

Some Properties of the Pearson Correlation Matrix of Guttman-Scalable Items.

Download full text

Zwick, Rebecca – 1986

Although perfectly scalable items rarely occur in practice, Guttman's concept of a scale has proved to be valuable to the development of measurement theory. If the score distribution is uniform and there is an equal number of items at each difficulty level, both the elements and the eigenvalues of the Pearson correlation matrix of dichotomous…

Descriptors: Correlation, Difficulty Level, Item Analysis, Latent Trait Theory

Introduction to Rasch Measurement: Some Implications for Languages.

Theunissen, Phiel J. J. M. – 1983

Any systematic approach to the assessment of students' ability implies the use of a model. The more explicit the model is, the more its users know about what they are doing and what the consequences are. The Rasch model is a strong model where measurement is a bonus of the model itself. It is based on four ideas: (1) separation of observable…

Descriptors: Ability Grouping, Difficulty Level, Evaluation Criteria, Item Sampling

Information Needs within a Multi-District Environment.

Thomas, Gregory P. – 1986

This paper argues that no single measurement strategy serves all purposes and that applying methods and techniques which allow a variety of data elements to be retrieved and juxtaposed may be an investment in the future. Item response theory, Rasch model, and latent trait theory are all approaches to a single conceptual topic. An abbreviated look…

Descriptors: Achievement Tests, Adaptive Testing, Criterion Referenced Tests, Data Collection

CATs, Testlets, and Test Construction: A Rationale for Putting Test Developers Back into CAT.

Wainer, Howard; Kiely, Gerard L. – 1986

Recent experience with the Computerized Adaptive Test (CAT) has raised a number of concerns about its practical applications. The concerns are principally involved with the concept of having the computer construct the test from a precalibrated item pool, and substituting statistical characteristics for the test developer's skills. Problems with…

Descriptors: Adaptive Testing, Algorithms, Computer Assisted Testing, Construct Validity

Practice and Problems in Language Testing 5. Non-Classical Test Theory; Final Examinations in Secondary Schools. Papers Presented at the International Language Testing Symposium (5th, Arnhem, Netherlands, March 25-26, 1982).

van Weeren, J., Ed. – 1983

Presented in this symposium reader are nine papers, four of which deal with the theory and impact of the Rasch model on language testing and five of which discuss final examinations in secondary schools in both general and specific terms. The papers are: "Introduction to Rasch Measurement: Some Implications for Language Testing" (J. J.…

Descriptors: Adolescents, Comparative Analysis, Comparative Education, Difficulty Level

Evaluation of the Plot Method for Identifying Potentially Biased Test Items.

Download full text

Hambleton, Ronald K.; Rogers, H. Jane – 1986

This report was designed to respond to two major methodological shortcomings in the item bias literature: (1) misfitting test models; and (2) the use of significance tests. Specifically, the goals of the research were to describe a newly developed method known as the "plot method" for identifying potentially biased test items and to…

Descriptors: Criterion Referenced Tests, Culture Fair Tests, Difficulty Level, Estimation (Mathematics)