NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 5,296 to 5,310 of 9,547 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Emons, Wilco H. M.; Sijtsma, Klaas; Meijer, Rob R. – Psychological Methods, 2007
Short tests containing at most 15 items are used in clinical and health psychology, medicine, and psychiatry for making decisions about patients. Because short tests have large measurement error, the authors ask whether they are reliable enough for classifying patients into a treatment and a nontreatment group. For a given certainty level,…
Descriptors: Psychiatry, Patients, Error of Measurement, Test Length
Peer reviewed Peer reviewed
Direct linkDirect link
Reid, Christine A.; Kolakowsky-Hayner, Stephanie A.; Lewis, Allen N.; Armstrong, Amy J. – Rehabilitation Counseling Bulletin, 2007
Item response theory (IRT) methodology is introduced as a tool for improving assessment instruments used with people who have disabilities. Need for this approach in rehabilitation is emphasized; differences between IRT and classical test theory are clarified. Concepts essential to understanding IRT are defined, necessary data assumptions are…
Descriptors: Psychometrics, Methods, Item Response Theory, Aptitude Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Ketterlin-Geller, Leanne R.; Yovanoff, Paul; Tindal, Gerald – Exceptional Children, 2007
Accommodations influence the measurement of student proficiency. However, with discrepant research findings, it is difficult to evaluate the effects of these practices on the measurement of performance of students with special needs. In this article, we present results from an experimental study investigating the effects of item characteristics…
Descriptors: Student Characteristics, Special Needs Students, Mathematics Tests, Testing Accommodations
Peer reviewed Peer reviewed
Direct linkDirect link
van der Linden, Wim J.; Breithaupt, Krista; Chuah, Siang Chee; Zhang, Yanwei – Journal of Educational Measurement, 2007
A potential undesirable effect of multistage testing is differential speededness, which happens if some of the test takers run out of time because they receive subtests with items that are more time intensive than others. This article shows how a probabilistic response-time model can be used for estimating differences in time intensities and speed…
Descriptors: Adaptive Testing, Evaluation Methods, Test Items, Reaction Time
Draaijer, S.; Hartog, R. J. M. – E-Journal of Instructional Science and Technology, 2007
A set of design patterns for digital item types has been developed in response to challenges identified in various projects by teachers in higher education. The goal of the projects in question was to design and develop formative and summative tests, and to develop interactive learning material in the form of quizzes. The subject domains involved…
Descriptors: Higher Education, Instructional Design, Test Format, Biological Sciences
Peer reviewed Peer reviewed
Direct linkDirect link
Zabaleta, Francisco – CALICO Journal, 2007
Placing students of a foreign language within a basic language program constitutes an ongoing problem, particularly for large university departments when they have many incoming freshmen and transfer students. This article outlines the author's experience designing and piloting a language placement test for a university level Spanish program. The…
Descriptors: Test Items, Student Placement, Spanish, Transfer Students
Peer reviewed Peer reviewed
Direct linkDirect link
Gvozdenko, Eugene; Chambers, Dianne – Australasian Journal of Educational Technology, 2007
This paper investigates how monitoring the time spent on a question in a test of basic mathematics skills can provide insights into learning processes, the quality of test takers' knowledge, and cognitive demands and performance of test items that otherwise would remain undiscovered if the usual test outcome of accuracy only format…
Descriptors: Reaction Time, Computer Assisted Testing, Mathematics Tests, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Mariano, Louis T.; Junker, Brian W. – Journal of Educational and Behavioral Statistics, 2007
When constructed response test items are scored by more than one rater, the repeated ratings allow for the consideration of individual rater bias and variability in estimating student proficiency. Several hierarchical models based on item response theory have been introduced to model such effects. In this article, the authors demonstrate how these…
Descriptors: Test Items, Item Response Theory, Rating Scales, Scoring
Peer reviewed Peer reviewed
Direct linkDirect link
Elosua, Paula; Lopez-Jauregui, Alicia – International Journal of Testing, 2007
This report shows a classification of differential item functioning (DIF) sources that have an effect on the adaptation of tests. This classification is based on linguistic and cultural criteria. Four general DIF sources are distinguished: cultural relevance, translation problems, morph syntactical differences, and semantic differences. The…
Descriptors: Semantics, Cultural Relevance, Classification, Test Bias
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Liao, Chi-Wen; Livingston, Samuel A. – ETS Research Report Series, 2008
Randomly equivalent forms (REF) of tests in listening and reading for nonnative speakers of English were created by stratified random assignment of items to forms, stratifying on item content and predicted difficulty. The study included 50 replications of the procedure for each test. Each replication generated 2 REFs. The equivalence of those 2…
Descriptors: Equated Scores, Item Analysis, Test Items, Difficulty Level
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Chaudhary, Shreesh – TESL-EJ, 2008
Courses in Spoken English (SE) are yet to be acceptable in Indian universities because conducting session-end tests in SE is assumed to be logistically difficult and academically problematic. This article argues that it need not necessarily be so; session-end tests can be conducted just as in other courses. With voice recording, preferably a…
Descriptors: Educational Technology, Computer Networks, French, English (Second Language)
Peer reviewed Peer reviewed
PDF on ERIC Download full text
MacSwan, Jeff; Mahoney, Kate – Journal of Educational Research & Policy Studies, 2008
Construct validity concerns for the IPT I Oral Grades K-6 Spanish Second Edition (IPT-S) as a measure of native oral language proficiency are examined. The examination included describing a subset of items that contributes most to overall score and native-language proficiency designation. Correlations between this subset of items and the overall…
Descriptors: Language Research, Oral Language, Language Tests, Construct Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Frey, Andreas; Hartig, Johannes; Rupp, Andre A. – Educational Measurement: Issues and Practice, 2009
In most large-scale assessments of student achievement, several broad content domains are tested. Because more items are needed to cover the content domains than can be presented in the limited testing time to each individual student, multiple test forms or booklets are utilized to distribute the items to the students. The construction of an…
Descriptors: Measures (Individuals), Test Construction, Theory Practice Relationship, Design
Smith, Richard M.; And Others – 1995
In the mid to late 1970s, considerable research was conducted on the properties of Rasch fit mean squares, resulting in transformations to convert the mean squares into approximate t-statistics. In the late 1980s and the early 1990s, the trend seems to have reversed, with numerous researchers using the untransformed fit mean squares as a means of…
Descriptors: Evaluation Methods, Goodness of Fit, Item Response Theory, Sample Size
Stocking, Martha L.; And Others – 1991
A previously developed method of automatically selecting items for inclusion in a test subject to constraints on item content and statistical properties is applied to real data. Two tests are first assembled by experts in test construction who normally assemble such tests on a routine basis. Using the same pool of items and constraints articulated…
Descriptors: Algorithms, Automation, Coding, Computer Assisted Testing
Pages: 1  |  ...  |  350  |  351  |  352  |  353  |  354  |  355  |  356  |  357  |  358  |  ...  |  637