WBSSC SLST Education IX & X : Measurement and Evaluation in Education

MCQ on Measurement and Evaluation in Education

1. Which of the following represents the quantitative description of a student’s performance?
নিম্নলিখিত কোনটি একজন শিক্ষার্থীর পারফরম্যান্সের পরিমাণগত বিবরণ উপস্থাপন করে?

A) Evaluation / মূল্যায়ন B) Measurement / পরিমাপ C) Assessment / অ্যাসেসমেন্ট D) Test / পরীক্ষা

Correct Answer: B) Measurement / পরিমাপ

Explanation: Measurement is the process of assigning numerals to objects or events according to rules. It is quantitative and objective, focusing on “how much” a student has learned.
ব্যাখ্যা: পরিমাপ হল নিয়ম অনুসারে বস্তু বা ঘটনাকে সংখ্যাসূচক মান নির্ধারণ করার প্রক্রিয়া। এটি পরিমাণগত এবং বস্তুনিষ্ঠ, একজন শিক্ষার্থী “কতটা” শিখেছে তার উপর আলোকপাত করে।

2. Evaluation is a process that is:
মূল্যায়ন একটি প্রক্রিয়া যা হল:

A) Quantitative / পরিমাণগত B) Qualitative / গুণগত C) Both Quantitative and Qualitative / পরিমাণগত এবং গুণগত উভয়ই D) Subjective only / শুধুমাত্র ব্যক্তিগত

Correct Answer: C) Both Quantitative and Qualitative / পরিমাণগত এবং গুণগত উভয়ই

Explanation: Evaluation is a broader term than measurement. It includes both quantitative data (from tests and measurements) and qualitative data (from observations, interviews) to make a value judgment about the worth of something.
ব্যাখ্যা: মূল্যায়ন পরিমাপের চেয়ে একটি ব্যাপক পরিভাষা। এটি কোনো কিছুর যোগ্যতা সম্পর্কে মূল্য বিচার করার জন্য পরিমাণগত তথ্য (পরীক্ষা এবং পরিমাপ থেকে) এবং গুণগত তথ্য (পর্যবেক্ষণ, সাক্ষাৎকার থেকে) উভয়ই অন্তর্ভুক্ত করে।

3. The primary purpose of formative evaluation is:
গঠনমূলক মূল্যায়নের প্রাথমিক উদ্দেশ্য হল:

A) To grade students at the end of the course / কোর্সের শেষে শিক্ষার্থীদের গ্রেড দেওয়া B) To improve teaching and learning during the process / শিক্ষণ-শিখন প্রক্রিয়া চলাকালীন তার উন্নতি করা C) To select students for a specific program / একটি নির্দিষ্ট প্রোগ্রামের জন্য শিক্ষার্থী নির্বাচন করা D) To certify student achievement / শিক্ষার্থীর কৃতিত্বকে પ્રમાণপত্র দেওয়া

Correct Answer: B) To improve teaching and learning during the process / শিক্ষণ-শিখন প্রক্রিয়া চলাকালীন তার উন্নতি করা

Explanation: Formative evaluation is conducted during the instructional process to provide feedback to teachers and students to improve learning. It is ‘evaluation for learning’.
ব্যাখ্যা: গঠনমূলক মূল্যায়ন শিক্ষণ প্রক্রিয়া চলাকালীন পরিচালিত হয় যাতে শিক্ষক এবং শিক্ষার্থীদের শেখার উন্নতির জন্য মতামত প্রদান করা যায়। এটি ‘শিখনের জন্য মূল্যায়ন’।

4. Summative evaluation is typically conducted:
সার্বিক বা অন্তিম মূল্যায়ন সাধারণত পরিচালিত হয়:

A) At the beginning of a course / একটি কোর্সের শুরুতে B) During a course / একটি কোর্স চলাকালীন C) At the end of a course or unit / একটি কোর্স বা ইউনিটের শেষে D) On a daily basis / প্রতিদিনের ভিত্তিতে

Correct Answer: C) At the end of a course or unit / একটি কোর্স বা ইউনিটের শেষে

Explanation: Summative evaluation is used at the end of an instructional period (e.g., end of a chapter, semester, or year) to assess the final outcome of learning. It is ‘evaluation of learning’.
ব্যাখ্যা: সার্বিক মূল্যায়ন একটি নির্দেশনামূলক সময়কালের শেষে (যেমন, অধ্যায়, সেমিস্টার বা বছরের শেষে) শেখার চূড়ান্ত ফলাফল মূল্যায়ন করতে ব্যবহৃত হয়। এটি ‘শিখনের মূল্যায়ন’।

5. Which of the following is a tool for evaluation, not a technique?
নিচের কোনটি মূল্যায়নের একটি উপকরণ (tool), কৌশল (technique) নয়?

A) Observation / পর্যবেক্ষণ B) Interview / সাক্ষাৎকার C) Questionnaire / প্রশ্নাবলী D) Sociometry / সমাজমিতি

Correct Answer: C) Questionnaire / প্রশ্নাবলী

Explanation: A tool is a specific instrument used to collect data (e.g., a test, a checklist, a questionnaire). A technique is a method or procedure for gathering information (e.g., observation, interview). A questionnaire is a physical or digital tool containing questions.
ব্যাখ্যা: একটি উপকরণ (tool) হল তথ্য সংগ্রহের জন্য ব্যবহৃত একটি নির্দিষ্ট যন্ত্র (যেমন, একটি পরীক্ষা, চেকলিস্ট, প্রশ্নাবলী)। একটি কৌশল (technique) হল তথ্য সংগ্রহের একটি পদ্ধতি (যেমন, পর্যবেক্ষণ, সাক্ষাৎকার)। প্রশ্নাবলী হল প্রশ্ন সম্বলিত একটি ভৌত বা ডিজিটাল উপকরণ।

6. Reliability of a test refers to its:
একটি পরীক্ষার নির্ভরযোগ্যতা (Reliability) বলতে বোঝায় তার:

A) Usefulness / উপযোগিতা B) Consistency / সামঞ্জস্যতা C) Truthfulness or Accuracy / সত্যতা বা যথার্থতা D) Objectivity / বস্তুনিষ্ঠতা

Correct Answer: B) Consistency / সামঞ্জস্যতা

Explanation: Reliability means the consistency of scores obtained by the same individuals on different occasions or with different sets of equivalent items. A reliable test yields consistent results.
ব্যাখ্যা: নির্ভরযোগ্যতা মানে একই ব্যক্তি দ্বারা বিভিন্ন সময়ে বা সমতুল্য আইটেমের বিভিন্ন সেটের মাধ্যমে প্রাপ্ত স্কোরের সামঞ্জস্যতা। একটি নির্ভরযোগ্য পরীক্ষা সামঞ্জস্যপূর্ণ ফলাফল দেয়।

7. Validity of a test refers to the extent to which it:
একটি পরীক্ষার বৈধতা (Validity) বলতে বোঝায় যে এটি কতটা:

A) Measures what it intends to measure / যা পরিমাপ করার উদ্দেশ্যে তৈরি, তা-ই পরিমাপ করে B) Is free from scoring bias / স্কোরিং-এর পক্ষপাত থেকে মুক্ত C) Gives consistent scores / সামঞ্জস্যপূর্ণ স্কোর দেয় D) Is easy to administer / পরিচালনা করা সহজ

Correct Answer: A) Measures what it intends to measure / যা পরিমাপ করার উদ্দেশ্যে তৈরি, তা-ই পরিমাপ করে

Explanation: Validity is the most important characteristic of a good test. It ensures that the test is measuring the specific trait, skill, or knowledge it was designed to measure.
ব্যাখ্যা: বৈধতা একটি ভালো পরীক্ষার সবচেয়ে গুরুত্বপূর্ণ বৈশিষ্ট্য। এটি নিশ্চিত করে যে পরীক্ষাটি সেই নির্দিষ্ট বৈশিষ্ট্য, দক্ষতা বা জ্ঞান পরিমাপ করছে যা পরিমাপ করার জন্য এটি ডিজাইন করা হয়েছিল।

8. The first step in constructing an achievement test is:
একটি পারদর্শিতার অভীক্ষা (Achievement Test) তৈরির প্রথম ধাপ হল:

A) Writing the test items / পরীক্ষার প্রশ্নগুলি লেখা B) Preparing a blueprint / একটি ব্লুপ্রিন্ট প্রস্তুত করা C) Determining the test objectives / পরীক্ষার উদ্দেশ্য নির্ধারণ করা D) Administering the test / পরীক্ষা পরিচালনা করা

Correct Answer: C) Determining the test objectives / পরীক্ষার উদ্দেশ্য নির্ধারণ করা

Explanation: Before creating any test, it is crucial to first define what you want to measure. The instructional objectives guide the entire test construction process, from item writing to scoring.
ব্যাখ্যা: যেকোনো পরীক্ষা তৈরির আগে, আপনি কী পরিমাপ করতে চান তা নির্ধারণ করা অত্যন্ত গুরুত্বপূর্ণ। নির্দেশনামূলক উদ্দেশ্যগুলি প্রশ্ন লেখা থেকে শুরু করে স্কোরিং পর্যন্ত পুরো পরীক্ষা নির্মাণ প্রক্রিয়াকে வழி দেখায়।

9. A blueprint (table of specifications) for a test helps in ensuring:
একটি পরীক্ষার জন্য ব্লুপ্রিন্ট (নির্দিষ্টকরণের সারণী) কী নিশ্চিত করতে সাহায্য করে?

A) Content validity / বিষয়বস্তুগত বৈধতা (Content validity) B) Reliability / নির্ভরযোগ্যতা C) Objectivity / বস্তুনিষ্ঠতা D) Usability / ব্যবহারযোগ্যতা

Correct Answer: A) Content validity / বিষয়বস্তুগত বৈধতা (Content validity)

Explanation: A blueprint is a two-way chart that relates instructional objectives to the content areas, ensuring that the test items are a balanced and representative sample of the content taught and the objectives set.
ব্যাখ্যা: একটি ব্লুপ্রিন্ট হল একটি দ্বি-মাত্রিক চার্ট যা নির্দেশনামূলক উদ্দেশ্যগুলিকে বিষয়বস্তুর সাথে সম্পর্কিত করে, এটি নিশ্চিত করে যে পরীক্ষার প্রশ্নগুলি শেখানো বিষয়বস্তু এবং নির্ধারিত উদ্দেশ্যগুলির একটি ভারসাম্যপূর্ণ ও প্রতিনিধিত্বমূলক নমুনা।

10. A major defect of the traditional examination system is its emphasis on:
প্রচলিত পরীক্ষা ব্যবস্থার একটি প্রধান ত্রুটি হল এর উপর জোর দেওয়া:

A) Rote memorization / মুখস্থ বিদ্যার উপর B) Critical thinking / সমালোচনামূলক চিন্তাভাবনার উপর C) Practical skills / ব্যবহারিক দক্ষতার উপর D) Continuous assessment / নিরবচ্ছিন্ন মূল্যায়নের উপর

Correct Answer: A) Rote memorization / মুখস্থ বিদ্যার উপর

Explanation: One of the most significant criticisms of traditional examination systems is that they often reward the ability to recall facts (rote memorization) rather than higher-order thinking skills like analysis, synthesis, and evaluation.
ব্যাখ্যা: প্রচলিত পরীক্ষা ব্যবস্থার অন্যতম গুরুত্বপূর্ণ সমালোচনা হল যে এটি প্রায়শই বিশ্লেষণ, সংশ্লেষণ এবং মূল্যায়নের মতো উচ্চ-স্তরের চিন্তাভাবনার দক্ষতার পরিবর্তে তথ্য মনে রাখার ক্ষমতাকে (মুখস্থ বিদ্যা) পুরস্কৃত করে।

11. A Norm-Referenced Test (NRT) compares a student’s performance with:
একটি নর্ম-ভিত্তিক অভীক্ষা (NRT) একজন শিক্ষার্থীর পারফরম্যান্সকে কার সাথে তুলনা করে?

A) A pre-defined standard of mastery / দক্ষতার একটি পূর্ব-নির্ধারিত মানের সাথে B) The performance of other students / অন্য শিক্ষার্থীদের পারফরম্যান্সের সাথে C) The student’s own previous performance / শিক্ষার্থীর নিজের পূর্ববর্তী পারফরম্যান্সের সাথে D) The teacher’s expectations / শিক্ষকের প্রত্যাশার সাথে

Correct Answer: B) The performance of other students / অন্য শিক্ষার্থীদের পারফরম্যান্সের সাথে

Explanation: Norm-Referenced Tests are designed to rank students and compare their performance against a ‘norm’ group, which is a representative sample of other students.
ব্যাখ্যা: নর্ম-ভিত্তিক অভীক্ষাগুলি শিক্ষার্থীদের র‍্যাঙ্ক করার জন্য এবং তাদের পারফরম্যান্সকে একটি ‘নর্ম’ গ্রুপের সাথে তুলনা করার জন্য ডিজাইন করা হয়েছে, যা অন্যান্য শিক্ষার্থীদের একটি প্রতিনিধিত্বমূলক নমুনা।

12. A Criterion-Referenced Test (CRT) is designed to measure:
একটি নির্ণায়ক-ভিত্তিক অভীক্ষা (CRT) কী পরিমাপ করার জন্য ডিজাইন করা হয়েছে?

A) How a student ranks among peers / একজন শিক্ষার্থী তার সহপাঠীদের মধ্যে কোথায় স্থান পেয়েছে B) A student’s mastery of specific skills or knowledge / নির্দিষ্ট দক্ষতা বা জ্ঞানে শিক্ষার্থীর পারদর্শিতা C) A student’s general intelligence / একজন শিক্ষার্থীর সাধারণ বুদ্ধিমত্তা D) A student’s interest in a subject / একটি বিষয়ে শিক্ষার্থীর আগ্রহ

Correct Answer: B) A student’s mastery of specific skills or knowledge / নির্দিষ্ট দক্ষতা বা জ্ঞানে শিক্ষার্থীর পারদর্শিতা

Explanation: Criterion-Referenced Tests compare a student’s performance to a fixed standard or criterion of mastery. It tells what a student can or cannot do, irrespective of how others perform.
ব্যাখ্যা: নির্ণায়ক-ভিত্তিক অভীক্ষা একজন শিক্ষার্থীর পারফরম্যান্সকে একটি নির্দিষ্ট মান বা পারদর্শিতার নির্ণায়কের সাথে তুলনা করে। এটি বলে যে একজন শিক্ষার্থী কী করতে পারে বা পারে না, অন্যরা কেমন পারফর্ম করছে তা নির্বিশেষে।

13. The Stanford-Binet test is primarily a measure of:
স্ট্যানফোর্ড-বিনেট পরীক্ষাটি মূলত কীসের পরিমাপ?

A) Personality / ব্যক্তিত্ব B) Aptitude / প্রবণতা C) Intelligence / বুদ্ধিমত্তা D) Interest / আগ্রহ

Correct Answer: C) Intelligence / বুদ্ধিমত্তা

Explanation: The Stanford-Binet Intelligence Scales is one of the most famous and widely used individual intelligence tests. It was a pioneering test in the field of intelligence measurement.
ব্যাখ্যা: স্ট্যানফোর্ড-বিনেট ইন্টেলিজেন্স স্কেল হল সবচেয়ে বিখ্যাত এবং বহুল ব্যবহৃত ব্যক্তিগত বুদ্ধিমত্তার পরীক্ষাগুলির মধ্যে একটি। এটি বুদ্ধিমত্তা পরিমাপের ক্ষেত্রে একটি অগ্রণী পরীক্ষা ছিল।

14. The Rorschach Inkblot Test is an example of a:
রোরশ্যাক ইংকব্লট টেস্ট কিসের উদাহরণ?

A) Projective personality test / প্রক্ষেপণমূলক ব্যক্তিত্বের অভীক্ষা B) Objective personality test / বস্তুনিষ্ঠ ব্যক্তিত্বের অভীক্ষা C) Intelligence test / বুদ্ধিমত্তার অভীক্ষা D) Interest inventory / আগ্রহের তালিকা

Correct Answer: A) Projective personality test / প্রক্ষেপণমূলক ব্যক্তিত্বের অভীক্ষা

Explanation: Projective tests present ambiguous stimuli (like inkblots) and ask the individual to interpret them. The idea is that the individual will “project” their unconscious thoughts, feelings, and conflicts onto the stimuli.
ব্যাখ্যা: প্রক্ষেপণমূলক অভীক্ষাগুলি অস্পষ্ট উদ্দীপনা (যেমন কালির দাগ) উপস্থাপন করে এবং ব্যক্তিকে সেগুলি ব্যাখ্যা করতে বলে। এর মূল ধারণাটি হল যে ব্যক্তি তার অচেতন চিন্তা, অনুভূতি এবং দ্বন্দ্বগুলিকে উদ্দীপনার উপর “প্রক্ষেপ” করবে।

15. Thematic Apperception Test (TAT) was developed by:
থিমেটিক অ্যাপারসেপশন টেস্ট (TAT) কে তৈরি করেন?

A) Hermann Rorschach / হারম্যান রোরশ্যাক B) Henry Murray and Christiana Morgan / হেনরি মারে এবং ক্রিস্টিয়ানা মরগান C) Alfred Binet / আলফ্রেড বিনে D) Raymond Cattell / রেমন্ড ক্যাটেল

Correct Answer: B) Henry Murray and Christiana Morgan / হেনরি মারে এবং ক্রিস্টিয়ানা মরগান

Explanation: The Thematic Apperception Test (TAT), a projective personality test, was developed by Henry A. Murray and Christiana D. Morgan at Harvard University in the 1930s.
ব্যাখ্যা: থিমেটিক অ্যাপারসেপশন টেস্ট (TAT), একটি প্রক্ষেপণমূলক ব্যক্তিত্বের অভীক্ষা, ১৯৩০-এর দশকে হার্ভার্ড বিশ্ববিদ্যালয়ে হেনরি এ. মারে এবং ক্রিস্টিয়ানা ডি. মরগান দ্বারা তৈরি হয়েছিল।

16. The Strong Vocational Interest Blank (SVIB) is used to measure:
স্ট্রং ভোকেশনাল ইন্টারেস্ট ব্ল্যাঙ্ক (SVIB) কী পরিমাপ করতে ব্যবহৃত হয়?

A) Personality traits / ব্যক্তিত্বের বৈশিষ্ট্য B) Academic achievement / একাডেমিক কৃতিত্ব C) Occupational interests / পেশাগত আগ্রহ D) Mental disorders / মানসিক ব্যাধি

Correct Answer: C) Occupational interests / পেশাগত আগ্রহ

Explanation: The SVIB is one of the most widely used interest inventories. It helps individuals identify careers and occupations that are most likely to be satisfying for them by comparing their interests with those of people happily employed in various fields.
ব্যাখ্যা: SVIB সবচেয়ে বহুল ব্যবহৃত আগ্রহের তালিকাগুলির মধ্যে একটি। এটি ব্যক্তিদের সেই সব পেশা চিহ্নিত করতে সাহায্য করে যা তাদের জন্য সন্তোষজনক হওয়ার সম্ভাবনা বেশি, কারণ এটি তাদের আগ্রহকে বিভিন্ন ক্ষেত্রে সফলভাবে কর্মরত ব্যক্তিদের আগ্রহের সাথে তুলনা করে।

17. Which of the following is an objective type of test item?
নিচের কোনটি একটি বস্তুনিষ্ঠ ধরনের পরীক্ষার প্রশ্ন?

A) Essay type / প্রবন্ধমূলক B) Short answer type / সংক্ষিপ্ত উত্তরধর্মী C) Multiple choice type / বহুনির্বাচনী D) Oral examination / মৌখিক পরীক্ষা

Correct Answer: C) Multiple choice type / বহুনির্বাচনী

Explanation: Objective test items are those that can be scored without any subjective judgment from the scorer. Multiple choice, true/false, and matching items are examples. The answer is fixed and not open to interpretation.
ব্যাখ্যা: বস্তুনিষ্ঠ পরীক্ষার প্রশ্ন সেগুলিই যেগুলি পরীক্ষকের কোনো ব্যক্তিগত বিচার ছাড়াই স্কোর করা যায়। বহুনির্বাচনী, সত্য/মিথ্যা এবং মেলানো প্রশ্নগুলি এর উদাহরণ। উত্তর নির্দিষ্ট এবং ব্যাখ্যার জন্য উন্মুক্ত নয়।

18. The concept of Intelligence Quotient (IQ) was first suggested by:
বুদ্ধ্যাঙ্ক বা ইন্টেলিজেন্স কোশেন্ট (IQ)-এর ধারণাটি প্রথম কে প্রস্তাব করেন?

A) William Stern / উইলিয়াম স্টার্ন B) Alfred Binet / আলফ্রেড বিনে C) Lewis Terman / লьюইস টারম্যান D) David Wechsler / ডেভিড ওয়েক্সলার

Correct Answer: A) William Stern / উইলিয়াম স্টার্ন

Explanation: While Binet developed the concept of mental age, it was the German psychologist William Stern who, in 1912, proposed the formula for the Intelligence Quotient (IQ) as Mental Age (MA) divided by Chronological Age (CA). Lewis Terman later multiplied this by 100.
ব্যাখ্যা: যদিও বিনে মানসিক বয়সের ধারণাটি তৈরি করেছিলেন, তবে জার্মান মনোবিজ্ঞানী উইলিয়াম স্টার্ন ১৯১২ সালে মানসিক বয়সকে (MA) প্রকৃত বয়স (CA) দ্বারা ভাগ করে বুদ্ধ্যাঙ্কের (IQ) সূত্রটি প্রস্তাব করেন। পরে লьюইস টারম্যান এটিকে ১০০ দ্বারা গুণ করেন।

19. Standardization of a test involves establishing:
একটি পরীক্ষার মাননির্ণয় (Standardization) কী প্রতিষ্ঠা করাকে বোঝায়?

A) Only validity / শুধুমাত্র বৈধতা B) Only reliability / শুধুমাত্র নির্ভরযোগ্যতা C) Norms, validity, and reliability / নর্ম, বৈধতা এবং নির্ভরযোগ্যতা D) Only the scoring key / শুধুমাত্র স্কোরিং কী

Correct Answer: C) Norms, validity, and reliability / নর্ম, বৈধতা এবং নির্ভরযোগ্যতা

Explanation: Standardization is a comprehensive process that includes developing uniform procedures for administration and scoring, and establishing norms (standards for comparison) by administering the test to a large, representative sample. This process also critically involves checking for validity and reliability.
ব্যাখ্যা: মাননির্ণয় একটি ব্যাপক প্রক্রিয়া যা পরিচালনা ও স্কোরিংয়ের জন্য অভিন্ন পদ্ধতি তৈরি করা এবং একটি বৃহৎ, প্রতিনিধিত্বমূলক নমুনার উপর পরীক্ষা পরিচালনা করে নর্ম (তুলনার জন্য মান) প্রতিষ্ঠা করাকে অন্তর্ভুক্ত করে। এই প্রক্রিয়াটি বৈধতা এবং নির্ভরযোগ্যতা যাচাইয়ের সাথেও জড়িত।

20. The grading system (e.g., A, B, C) is a suggestion to improve the examination system because it:
গ্রেডিং সিস্টেম (যেমন, A, B, C) পরীক্ষা ব্যবস্থার উন্নতির জন্য একটি পরামর্শ কারণ এটি:

A) Increases competition among students / শিক্ষার্থীদের মধ্যে প্রতিযোগিতা বাড়ায় B) Reduces the fine-grained, often misleading, distinctions of numerical marks / সংখ্যার সূক্ষ্ম এবং প্রায়শই বিভ্রান্তিকর পার্থক্য হ্রাস করে C) Is easier for teachers to calculate / শিক্ষকদের গণনা করা সহজ D) Promotes rote learning / মুখস্থ বিদ্যাকে উৎসাহিত করে

Correct Answer: B) Reduces the fine-grained, often misleading, distinctions of numerical marks / সংখ্যার সূক্ষ্ম এবং প্রায়শই বিভ্রান্তিকর পার্থক্য হ্রাস করে

Explanation: A major criticism of numerical marking is the illusion of precision. Is a score of 75 truly different from 76? Grading groups students into broader performance bands, reducing undue stress and unhealthy competition over minor differences in marks.
ব্যাখ্যা: সংখ্যাসূচক নম্বরের একটি প্রধান সমালোচনা হল নির্ভুলতার বিভ্রম। ৭৫ স্কোর কি সত্যিই ৭৬ থেকে আলাদা? গ্রেডিং শিক্ষার্থীদের পারফরম্যান্সের বৃহত্তর ব্যান্ডে বিভক্ত করে, যা নম্বরের সামান্য পার্থক্যের উপর ভিত্তি করে অপ্রয়োজনীয় চাপ এবং অস্বাস্থ্যকর প্রতিযোগিতা হ্রাস করে।

21. Which tool is most suitable for assessing social relationships and group structure within a classroom?
একটি শ্রেণিকক্ষের মধ্যে সামাজিক সম্পর্ক এবং গোষ্ঠী কাঠামো মূল্যায়নের জন্য কোন উপকরণটি সবচেয়ে উপযুক্ত?

A) Anecdotal Record / ঘটনাপঞ্জী B) Checklist / চেকলিস্ট C) Rating Scale / রেটিং স্কেল D) Sociometry / সমাজমিতি

Correct Answer: D) Sociometry / সমাজমিতি

Explanation: Sociometry is a technique used to measure the social choices and interpersonal relationships of members within a group. It helps identify leaders (‘stars’), isolates, and cliques within the classroom.
ব্যাখ্যা: সমাজমিতি একটি গোষ্ঠীর সদস্যদের সামাজিক পছন্দ এবং আন্তঃব্যক্তিক সম্পর্ক পরিমাপ করার জন্য ব্যবহৃত একটি কৌশল। এটি শ্রেণিকক্ষের মধ্যে নেতা (‘স্টার’), বিচ্ছিন্ন ব্যক্তি এবং গোষ্ঠী সনাক্ত করতে সাহায্য করে।

22. An anecdotal record is a:
একটি ঘটনাপঞ্জী (Anecdotal record) হল:

A) Quantitative summary of behavior / আচরণের একটি পরিমাণগত সারাংশ B) Brief, objective description of a significant student behavior / একজন শিক্ষার্থীর একটি গুরুত্বপূর্ণ আচরণের সংক্ষিপ্ত, বস্তুনিষ্ঠ বর্ণনা C) A checklist of skills mastered / আয়ত্ত করা দক্ষতার একটি চেকলিস্ট D) A rating of a student’s personality / একজন শিক্ষার্থীর ব্যক্তিত্বের একটি রেটিং

Correct Answer: B) Brief, objective description of a significant student behavior / একজন শিক্ষার্থীর একটি গুরুত্বপূর্ণ আচরণের সংক্ষিপ্ত, বস্তুনিষ্ঠ বর্ণনা

Explanation: An anecdotal record is a factual, narrative description of a specific incident of a student’s behavior. It should be objective and focus on what was said and done, without interpretation.
ব্যাখ্যা: একটি ঘটনাপঞ্জী হল একজন শিক্ষার্থীর আচরণের একটি নির্দিষ্ট ঘটনার বাস্তবসম্মত, বর্ণনামূলক বিবরণ। এটি বস্তুনিষ্ঠ হওয়া উচিত এবং ব্যাখ্যা ছাড়াই কী বলা হয়েছিল ও করা হয়েছিল তার উপর ফোকাস করা উচিত।

23. The scope of evaluation is ____ than the scope of measurement.
মূল্যায়নের পরিধি পরিমাপের পরিধির চেয়ে ____।

A) Narrower / সংকীর্ণ B) The same / একই C) Broader / ব্যাপক D) Less significant / কম তাৎপর্যপূর্ণ

Correct Answer: C) Broader / ব্যাপক

Explanation: Evaluation is a comprehensive term. It uses the results of measurement (quantitative data) along with other qualitative information to make a value judgment. Measurement is just one part of the evaluation process.
ব্যাখ্যা: মূল্যায়ন একটি ব্যাপক পরিভাষা। এটি মূল্য বিচার করার জন্য পরিমাপের ফলাফলের (পরিমাণগত তথ্য) সাথে অন্যান্য গুণগত তথ্য ব্যবহার করে। পরিমাপ হল মূল্যায়ন প্রক্রিয়ার একটি অংশ মাত্র।

24. Wechsler Adult Intelligence Scale (WAIS) provides scores for:
ওয়েক্সলার অ্যাডাল্ট ইন্টেলিজেন্স স্কেল (WAIS) কিসের জন্য স্কোর প্রদান করে?

A) Only Verbal IQ / শুধুমাত্র ভার্বাল আইকিউ B) Only Performance IQ / শুধুমাত্র পারফরম্যান্স আইকিউ C) Verbal IQ, Performance IQ, and a Full-Scale IQ / ভার্বাল আইকিউ, পারফরম্যান্স আইকিউ, এবং একটি ফুল-স্কেল আইকিউ D) Emotional IQ / ইমোশনাল আইকিউ

Correct Answer: C) Verbal IQ, Performance IQ, and a Full-Scale IQ / ভার্বাল আইকিউ, পারফরম্যান্স আইকিউ, এবং একটি ফুল-স্কেল আইকিউ

Explanation: A key feature of the Wechsler scales (including WAIS for adults and WISC for children) is that they are divided into verbal and performance subtests, which yield separate scores and a combined Full-Scale IQ score.
ব্যাখ্যা: ওয়েক্সলার স্কেলগুলির (প্রাপ্তবয়স্কদের জন্য WAIS এবং শিশুদের জন্য WISC সহ) একটি প্রধান বৈশিষ্ট্য হল যে এগুলি ভার্বাল এবং পারফরম্যান্স উপ-পরীক্ষায় বিভক্ত, যা পৃথক স্কোর এবং একটি সম্মিলিত ফুল-স্কেল আইকিউ স্কোর প্রদান করে।

25. A portfolio is a tool for evaluation that primarily assesses:
পোর্টফোলিও মূল্যায়নের একটি উপকরণ যা প্রাথমিকভাবে কী মূল্যায়ন করে?

A) A student’s performance on a single day / একদিনে একজন শিক্ষার্থীর পারফরম্যান্স B) A student’s growth and achievement over time / সময়ের সাথে সাথে একজন শিক্ষার্থীর বৃদ্ধি এবং কৃতিত্ব C) A student’s rank compared to others / অন্যদের তুলনায় একজন শিক্ষার্থীর র‍্যাঙ্ক D) A student’s innate intelligence / একজন শিক্ষার্থীর সহজাত বুদ্ধিমত্তা

Correct Answer: B) A student’s growth and achievement over time / সময়ের সাথে সাথে একজন শিক্ষার্থীর বৃদ্ধি এবং কৃতিত্ব

Explanation: A portfolio is a purposeful collection of a student’s work that demonstrates their efforts, progress, and achievements in one or more areas. It provides a more holistic view of learning than a single test.
ব্যাখ্যা: একটি পোর্টফোলিও হল একজন শিক্ষার্থীর কাজের একটি উদ্দেশ্যপূর্ণ সংগ্রহ যা এক বা একাধিক ক্ষেত্রে তাদের প্রচেষ্টা, অগ্রগতি এবং কৃতিত্ব প্রদর্শন করে। এটি একটি একক পরীক্ষার চেয়ে শেখার একটি অধিক সামগ্রিক চিত্র প্রদান করে।

26. “The test appears to be a good measure of the content just by looking at it.” This statement refers to:
“পরীক্ষাটি শুধু দেখেই বিষয়বস্তুর একটি ভালো পরিমাপক বলে মনে হচ্ছে।” এই বিবৃতিটি কী বোঝায়?

A) Content Validity / বিষয়বস্তুগত বৈধতা B) Face Validity / আপাত বৈধতা C) Construct Validity / গঠনগত বৈধতা D) Predictive Validity / ভবিষ্যদ্বাণীমূলক বৈধতা

Correct Answer: B) Face Validity / আপাত বৈধতা

Explanation: Face validity refers to the superficial appearance of a test. It’s about whether the test “looks like” it measures what it’s supposed to measure. While not a technical form of validity, it can be important for test-taker motivation.
ব্যাখ্যা: আপাত বৈধতা একটি পরীক্ষার বাহ্যিক চেহারা বোঝায়। এটি হল যে পরীক্ষাটি “দেখতে” যা পরিমাপ করার কথা তা পরিমাপ করছে বলে মনে হচ্ছে কিনা। যদিও এটি বৈধতার একটি প্রযুক্তিগত রূপ নয়, এটি পরীক্ষার্থীদের অনুপ্রেরণার জন্য গুরুত্বপূর্ণ হতে পারে।

27. Which type of test is best for measuring higher-order thinking skills like analysis and synthesis?
বিশ্লেষণ এবং সংশ্লেষণের মতো উচ্চ-স্তরের চিন্তাভাবনার দক্ষতা পরিমাপের জন্য কোন ধরনের পরীক্ষা সবচেয়ে ভালো?

A) True-False items / সত্য-মিথ্যা প্রশ্ন B) Matching items / মেলানো প্রশ্ন C) Multiple-choice items / বহুনির্বাচনী প্রশ্ন D) Essay type items / প্রবন্ধমূলক প্রশ্ন

Correct Answer: D) Essay type items / প্রবন্ধমূলক প্রশ্ন

Explanation: Essay questions require students to organize their thoughts, construct arguments, analyze information, and express ideas in their own words. This makes them well-suited for assessing complex cognitive skills, whereas objective items are better for testing recall and recognition.
ব্যাখ্যা: প্রবন্ধমূলক প্রশ্নগুলিতে শিক্ষার্থীদের তাদের চিন্তাভাবনা সংগঠিত করতে, যুক্তি তৈরি করতে, তথ্য বিশ্লেষণ করতে এবং নিজের ভাষায় ধারণা প্রকাশ করতে হয়। এটি তাদের জটিল জ্ঞানীয় দক্ষতা মূল্যায়নের জন্য উপযুক্ত করে তোলে, যেখানে বস্তুনিষ্ঠ প্রশ্নগুলি স্মরণ এবং সনাক্তকরণ পরীক্ষার জন্য ভালো।

28. The difficulty value of a test item is considered ideal when it is around:
একটি পরীক্ষার প্রশ্নের কাঠিন্য মান (difficulty value) কখন আদর্শ বলে মনে করা হয়?

A) 0.90 B) 0.10 C) 0.50 D) 1.00

Correct Answer: C) 0.50

Explanation: The difficulty value (or p-value) is the proportion of students who answer an item correctly. A value of 0.50 (meaning 50% got it right) provides the maximum discrimination between high and low-achieving students. Items that are too easy (p close to 1.0) or too hard (p close to 0.0) are less effective at differentiating students.
ব্যাখ্যা: কাঠিন্য মান (বা p-মান) হল সেই শিক্ষার্থীদের অনুপাত যারা একটি প্রশ্নের সঠিক উত্তর দেয়। 0.50-এর একটি মান (অর্থাৎ ৫০% সঠিক উত্তর দিয়েছে) উচ্চ এবং নিম্ন পারফর্মিং শিক্ষার্থীদের মধ্যে সর্বোচ্চ পার্থক্য তৈরি করে। যে প্রশ্নগুলি খুব সহজ (p-মান 1.0 এর কাছাকাছি) বা খুব কঠিন (p-মান 0.0 এর কাছাকাছি) সেগুলি শিক্ষার্থীদের মধ্যে পার্থক্য করতে কম কার্যকর।

29. A test given at the beginning of an instructional unit to determine students’ prior knowledge is called a:
একটি নির্দেশনামূলক ইউনিটের শুরুতে শিক্ষার্থীদের পূর্ববর্তী জ্ঞান নির্ধারণের জন্য যে পরীক্ষা নেওয়া হয় তাকে বলে:

A) Summative test / সার্বিক অভীক্ষা B) Diagnostic test / নির্ণায়ক অভীক্ষা C) Placement test / স্থাননির্ণায়ক অভীক্ষা D) Achievement test / পারদর্শিতার অভীক্ষা

Correct Answer: C) Placement test / স্থাননির্ণায়ক অভীক্ষা

Explanation: Placement evaluation (often done with a placement test) is used to determine a student’s entry-level performance and knowledge to decide where they should be ‘placed’ in an instructional sequence. Diagnostic tests are more focused on identifying specific learning difficulties.
ব্যাখ্যা: স্থাননির্ণায়ক মূল্যায়ন (প্রায়ই একটি স্থাননির্ণায়ক পরীক্ষার মাধ্যমে করা হয়) একজন শিক্ষার্থীর প্রবেশ-স্তরের পারফরম্যান্স এবং জ্ঞান নির্ধারণ করতে ব্যবহৃত হয়, যাতে সিদ্ধান্ত নেওয়া যায় যে তাকে একটি নির্দেশনামূলক অনুক্রমের কোথায় ‘স্থান’ দেওয়া উচিত। নির্ণায়ক অভীক্ষাগুলি নির্দিষ্ট শেখার অসুবিধাগুলি চিহ্নিত করার উপর বেশি মনোনিবেশ করে।

30. The Minnesota Multiphasic Personality Inventory (MMPI) is what type of test?
মিনেসোটা মাল্টিফেজিক পার্সোনালিটি ইনভেন্টরি (MMPI) কোন ধরনের পরীক্ষা?

A) A projective test / একটি প্রক্ষেপণমূলক পরীক্ষা B) An interest inventory / একটি আগ্রহের তালিকা C) An objective personality inventory (self-report) / একটি বস্তুনিষ্ঠ ব্যক্তিত্বের তালিকা (স্ব-বিবরণী) D) An intelligence test / একটি বুদ্ধিমত্তার পরীক্ষা

Correct Answer: C) An objective personality inventory (self-report) / একটি বস্তুনিষ্ঠ ব্যক্তিত্বের তালিকা (স্ব-বিবরণী)

Explanation: The MMPI is a standardized, self-report questionnaire where individuals respond to a large number of true/false statements. It’s considered an objective test because the scoring is standardized and does not rely on the scorer’s interpretation, unlike projective tests.
ব্যাখ্যা: MMPI একটি মানসম্মত, স্ব-বিবরণী প্রশ্নাবলী যেখানে ব্যক্তিরা প্রচুর সত্য/মিথ্যা বিবৃতির প্রতিক্রিয়া জানায়। এটি একটি বস্তুনিষ্ঠ পরীক্ষা হিসাবে বিবেচিত হয় কারণ স্কোরিং মানসম্মত এবং প্রক্ষেপণমূলক পরীক্ষার মতো পরীক্ষকের ব্যাখ্যার উপর নির্ভর করে না।

31. Continuous and Comprehensive Evaluation (CCE) aims to:
নিরবচ্ছিন্ন এবং ব্যাপক মূল্যায়নের (CCE) লক্ষ্য হল:

A) Evaluate only scholastic aspects / শুধুমাত্র পুঁথিগত দিক মূল্যায়ন করা B) Reduce the workload of teachers / শিক্ষকদের কাজের চাপ কমানো C) Evaluate both scholastic and co-scholastic aspects of a child’s growth / একটি শিশুর বৃদ্ধির পুঁথিগত এবং সহ-পুঁথিগত উভয় দিক মূল্যায়ন করা D) Conduct examinations more frequently / আরও ঘন ঘন পরীক্ষা পরিচালনা করা

Correct Answer: C) Evaluate both scholastic and co-scholastic aspects of a child’s growth / একটি শিশুর বৃদ্ধির পুঁথিগত এবং সহ-পুঁথিগত উভয় দিক মূল্যায়ন করা

Explanation: CCE is a system of evaluation that covers all aspects of a student’s development. ‘Continuous’ refers to evaluation throughout the year, and ‘Comprehensive’ refers to covering both academic (scholastic) and non-academic (co-scholastic) areas like life skills, attitudes, and values.
ব্যাখ্যা: CCE হল মূল্যায়নের একটি ব্যবস্থা যা একজন শিক্ষার্থীর বিকাশের সমস্ত দিককে অন্তর্ভুক্ত করে। ‘নিরবচ্ছিন্ন’ বলতে সারা বছর ধরে মূল্যায়ন বোঝায়, এবং ‘ব্যাপক’ বলতে জীবন দক্ষতা, মনোভাব এবং মূল্যের মতো পুঁথিগত এবং সহ-পুঁথিগত উভয় ক্ষেত্রকে অন্তর্ভুক্ত করা বোঝায়।

32. The term ‘measurement’ is primarily concerned with:
‘পরিমাপ’ শব্দটি প্রাথমিকভাবে কীসের সাথে সম্পর্কিত?

A) Value judgment / মূল্য বিচার B) Assigning a numerical value / একটি সংখ্যাসূচক মান নির্ধারণ C) Overall development / সামগ্রিক উন্নয়ন D) Future performance / ভবিষ্যৎ কর্মক্ষমতা

Correct Answer: B) Assigning a numerical value / একটি সংখ্যাসূচক মান নির্ধারণ

Explanation: At its core, measurement is the process of quantifying a characteristic. It provides the “data” or “score” (e.g., 75 out of 100), but does not, by itself, judge whether that score is good or bad. That judgment is part of evaluation.
ব্যাখ্যা: মূলতঃ, পরিমাপ হল একটি বৈশিষ্ট্যের পরিমাণ নির্ধারণের প্রক্রিয়া। এটি “তথ্য” বা “স্কোর” প্রদান করে (যেমন, ১০০ এর মধ্যে ৭৫), কিন্তু এই স্কোরটি ভালো না খারাপ, তা নিজে থেকে বিচার করে না। সেই বিচারটি মূল্যায়নের অংশ।

33. If a test is reliable, it means that:
যদি একটি পরীক্ষা নির্ভরযোগ্য হয়, তার মানে হল:

A) It is also valid / এটি বৈধও বটে B) It yields consistent results / এটি সামঞ্জস্যপূর্ণ ফলাফল দেয় C) It measures what it is supposed to measure / এটি যা পরিমাপ করার কথা তা-ই পরিমাপ করে D) It is easy to score / এটি স্কোর করা সহজ

Correct Answer: B) It yields consistent results / এটি সামঞ্জস্যপূর্ণ ফলাফল দেয়

Explanation: Reliability is synonymous with consistency. A reliable test will produce similar scores for a person who takes it multiple times. Note that a test can be reliable without being valid (e.g., a scale that consistently shows the wrong weight is reliable but not valid).
ব্যাখ্যা: নির্ভরযোগ্যতা সামঞ্জস্যের সমার্থক। একটি নির্ভরযোগ্য পরীক্ষা এমন একজন ব্যক্তির জন্য অনুরূপ স্কোর তৈরি করবে যে এটি একাধিকবার দেয়। মনে রাখবেন যে একটি পরীক্ষা বৈধ না হয়েও নির্ভরযোগ্য হতে পারে (যেমন, একটি স্কেল যা ধারাবাহিকভাবে ভুল ওজন দেখায় তা নির্ভরযোগ্য কিন্তু বৈধ নয়)।

34. Diagnostic evaluation is aimed at:
নির্ণায়ক মূল্যায়নের লক্ষ্য হল:

A) Grading the students / শিক্ষার্থীদের গ্রেড দেওয়া B) Finding out the learning difficulties of students / শিক্ষার্থীদের শেখার অসুবিধাগুলি খুঁজে বের করা C) Assessing the suitability of a candidate / একজন প্রার্থীর উপযুক্ততা মূল্যায়ন করা D) Comparing students’ performance / শিক্ষার্থীদের পারফরম্যান্স তুলনা করা

Correct Answer: B) Finding out the learning difficulties of students / শিক্ষার্থীদের শেখার অসুবিধাগুলি খুঁজে বের করা

Explanation: Diagnostic evaluation goes deeper than formative evaluation. Its purpose is to identify the specific causes of persistent learning problems and to formulate a plan for remedial action.
ব্যাখ্যা: নির্ণায়ক মূল্যায়ন গঠনমূলক মূল্যায়নের চেয়ে গভীরে যায়। এর উদ্দেশ্য হল ক্রমাগত শেখার সমস্যার নির্দিষ্ট কারণগুলি চিহ্নিত করা এবং প্রতিকারমূলক ব্যবস্থার জন্য একটি পরিকল্পনা তৈরি করা।

35. The 16 Personality Factor Questionnaire (16PF) was developed by:
16 পার্সোনালিটি ফ্যাক্টর কোয়েশ্চনেয়ার (16PF) কে তৈরি করেন?

A) Gordon Allport / গর্ডন অলপোর্ট B) Raymond Cattell / রেমন্ড ক্যাটেল C) Carl Jung / কার্ল ইয়ুং D) Sigmund Freud / সিগমুন্ড ফ্রয়েড

Correct Answer: B) Raymond Cattell / রেমন্ড ক্যাটেল

Explanation: The 16PF is a self-report personality inventory developed by Raymond B. Cattell, Maurice Tatsuoka and Herbert Eber. It is based on Cattell’s trait theory of personality and measures 16 primary personality traits.
ব্যাখ্যা: 16PF হল রেমন্ড বি. ক্যাটেল, মরিস তাতসুওকা এবং হার্বার্ট ইবার দ্বারা তৈরি একটি স্ব-বিবরণী ব্যক্তিত্বের তালিকা। এটি ক্যাটেলের ব্যক্তিত্বের বৈশিষ্ট্য তত্ত্বের উপর ভিত্তি করে এবং ১৬টি প্রাথমিক ব্যক্তিত্বের বৈশিষ্ট্য পরিমাপ করে।

36. Which of the following is NOT a characteristic of a good test?
নিম্নলিখিত কোনটি একটি ভালো পরীক্ষার বৈশিষ্ট্য নয়?

A) Objectivity / বস্তুনিষ্ঠতা B) Reliability / নির্ভরযোগ্যতা C) Subjectivity / ব্যক্তিনিষ্ঠতা D) Validity / বৈধতা

Correct Answer: C) Subjectivity / ব্যক্তিনিষ্ঠতা

Explanation: A good test should be as objective as possible, meaning the scoring is not influenced by the scorer’s personal judgment or bias. Subjectivity, where scoring can vary from person to person, is considered a weakness in testing, especially in large-scale assessments.
ব্যাখ্যা: একটি ভালো পরীক্ষা যতটা সম্ভব বস্তুনিষ্ঠ হওয়া উচিত, যার অর্থ হল স্কোরিং পরীক্ষকের ব্যক্তিগত বিচার বা পক্ষপাত দ্বারা প্রভাবিত হয় না। ব্যক্তিনিষ্ঠতা, যেখানে স্কোরিং ব্যক্তিভেদে পরিবর্তিত হতে পারে, তা পরীক্ষায় একটি দুর্বলতা হিসাবে বিবেচিত হয়, বিশেষ করে বড় আকারের মূল্যায়নে।

37. An oral test (viva-voce) is a technique of:
মৌখিক পরীক্ষা (viva-voce) কিসের একটি কৌশল?

A) Measurement / পরিমাপ B) Evaluation / মূল্যায়ন C) Observation / পর্যবেক্ষণ D) Examination / পরীক্ষা

Correct Answer: B) Evaluation / মূল্যায়ন

Explanation: An oral test is a comprehensive evaluation technique. While it involves measurement (assigning a score), its primary strength is in evaluating qualities that are hard to assess on paper, such as communication skills, confidence, and depth of understanding. ‘Examination’ is a broad term, but ‘Evaluation’ best describes the holistic process of a viva-voce.
ব্যাখ্যা: মৌখিক পরীক্ষা একটি ব্যাপক মূল্যায়ন কৌশল। যদিও এটি পরিমাপের সাথে জড়িত (একটি স্কোর নির্ধারণ), এর প্রধান শক্তি হল সেই সব গুণাবলী মূল্যায়ন করা যা কাগজে মূল্যায়ন করা কঠিন, যেমন যোগাযোগ দক্ষতা, আত্মবিশ্বাস এবং বোঝার গভীরতা। ‘পরীক্ষা’ একটি ব্যাপক শব্দ, কিন্তু ‘মূল্যায়ন’ একটি ভাইভা-ভোস-এর সামগ্রিক প্রক্রিয়াকে সবচেয়ে ভালোভাবে বর্ণনা করে।

38. Test-retest method is used to determine a test’s:
টেস্ট-রিটেস্ট পদ্ধতি একটি পরীক্ষার কী নির্ধারণ করতে ব্যবহৃত হয়?

A) Validity / বৈধতা B) Reliability / নির্ভরযোগ্যতা C) Objectivity / বস্তুনিষ্ঠতা D) Norms / নর্ম

Correct Answer: B) Reliability / নির্ভরযোগ্যতা

Explanation: The test-retest method is a common way to measure a test’s reliability (specifically, its stability over time). It involves administering the same test to the same group of individuals on two different occasions and then correlating the two sets of scores.
ব্যাখ্যা: টেস্ট-রিটেস্ট পদ্ধতি একটি পরীক্ষার নির্ভরযোগ্যতা (বিশেষত, সময়ের সাথে এর স্থিতিশীলতা) পরিমাপ করার একটি সাধারণ উপায়। এটি একই গোষ্ঠীর ব্যক্তিদের দুটি ভিন্ন অনুষ্ঠানে একই পরীক্ষা দেওয়া এবং তারপর দুটি স্কোর সেটের মধ্যে পারস্পরিক সম্পর্ক স্থাপন করাকে অন্তর্ভুক্ত করে।

39. The main suggestion for improving the present examination system is to make it more:
বর্তমান পরীক্ষা ব্যবস্থার উন্নতির জন্য প্রধান পরামর্শ হল এটিকে আরও ____ করা।

A) Frequent and lengthy / ঘন ঘন এবং দীর্ঘ B) Valid, reliable and comprehensive / বৈধ, নির্ভরযোগ্য এবং ব্যাপক C) Based on rote memorization / মুখস্থ বিদ্যার উপর ভিত্তি করে D) Subjective and flexible / ব্যক্তিনিষ্ঠ এবং নমনীয়

Correct Answer: B) Valid, reliable and comprehensive / বৈধ, নির্ভরযোগ্য এবং ব্যাপক

Explanation: Key reforms aim to move away from memory-based, one-shot exams towards a system that is technically sound (valid and reliable) and holistic (comprehensive), assessing a wider range of skills and abilities over time.
ব্যাখ্যা: প্রধান সংস্কারগুলির লক্ষ্য হল স্মৃতি-ভিত্তিক, এককালীন পরীক্ষা থেকে সরে এসে এমন একটি ব্যবস্থার দিকে যাওয়া যা প্রযুক্তিগতভাবে সঠিক (বৈধ এবং নির্ভরযোগ্য) এবং সামগ্রিক (ব্যাপক), যা সময়ের সাথে সাথে বিস্তৃত দক্ষতা এবং ক্ষমতা মূল্যায়ন করে।

40. The Kuder Preference Record is a type of:
কুডার প্রেফারেন্স রেকর্ড কীসের একটি প্রকার?

A) Intelligence Test / বুদ্ধিমত্তার অভীক্ষা B) Personality Test / ব্যক্তিত্বের অভীক্ষা C) Interest Inventory / আগ্রহের তালিকা D) Achievement Test / পারদর্শিতার অভীক্ষা

Correct Answer: C) Interest Inventory / আগ্রহের তালিকা

Explanation: The Kuder Preference Record is a widely used vocational interest inventory that measures an individual’s preferences for activities across various broad interest areas (e.g., Outdoor, Mechanical, Computational, Artistic, Literary).
ব্যাখ্যা: কুডার প্রেফারেন্স রেকর্ড একটি বহুল ব্যবহৃত বৃত্তিমূলক আগ্রহের তালিকা যা বিভিন্ন বিস্তৃত আগ্রহের ক্ষেত্রে (যেমন, আউটডোর, যান্ত্রিক, গণনামূলক, শৈল্পিক, সাহিত্যিক) একজন ব্যক্তির ক্রিয়াকলাপের প্রতি পছন্দ পরিমাপ করে।

41. Which of the following is considered a ‘supply type’ test item?
নিম্নলিখিত কোনটি ‘সাপ্লাই টাইপ’ পরীক্ষার প্রশ্ন হিসাবে বিবেচিত হয়?

A) Multiple Choice / বহুনির্বাচনী B) True/False / সত্য/মিথ্যা C) Matching / মেলানো D) Completion (Fill-in-the-blanks) / শূন্যস্থান পূরণ

Correct Answer: D) Completion (Fill-in-the-blanks) / শূন্যস্থান পূরণ

Explanation: Test items are broadly classified into ‘selection type’ (where the student chooses from given options, like MCQ, T/F, Matching) and ‘supply type’ (where the student has to supply the answer, like short answer, completion, and essay).
ব্যাখ্যা: পরীক্ষার প্রশ্নগুলিকে বিস্তৃতভাবে ‘নির্বাচন প্রকার’ (যেখানে শিক্ষার্থী প্রদত্ত বিকল্পগুলি থেকে বেছে নেয়, যেমন MCQ, T/F, মেলানো) এবং ‘সাপ্লাই প্রকার’ (যেখানে শিক্ষার্থীকে উত্তর সরবরাহ করতে হয়, যেমন সংক্ষিপ্ত উত্তর, শূন্যস্থান পূরণ এবং প্রবন্ধ) হিসাবে শ্রেণীবদ্ধ করা হয়।

42. A test designed to predict future performance in a specific area is called a(n):
একটি নির্দিষ্ট ক্ষেত্রে ভবিষ্যতের কর্মক্ষমতা ভবিষ্যদ্বাণী করার জন্য ডিজাইন করা একটি পরীক্ষাকে কী বলা হয়?

A) Achievement Test / পারদর্শিতার অভীক্ষা B) Diagnostic Test / নির্ণায়ক অভীক্ষা C) Aptitude Test / প্রবণতা অভীক্ষা D) Summative Test / সার্বিক অভীক্ষা

Correct Answer: C) Aptitude Test / প্রবণতা অভীক্ষা

Explanation: Aptitude tests measure a person’s potential to learn or develop proficiency in a certain area. They are forward-looking and used for prediction, unlike achievement tests which measure past learning.
ব্যাখ্যা: প্রবণতা অভীক্ষা একটি নির্দিষ্ট ক্ষেত্রে শেখার বা দক্ষতা বিকাশের জন্য একজন ব্যক্তির সম্ভাবনা পরিমাপ করে। এগুলি ভবিষ্যদ্বাণীমূলক এবং ভবিষ্যতের জন্য ব্যবহৃত হয়, পারদর্শিতার অভীক্ষার মতো নয় যা অতীতের শেখা পরিমাপ করে।

43. The objectivity of a test is most affected by:
একটি পরীক্ষার বস্তুনিষ্ঠতা সবচেয়ে বেশি প্রভাবিত হয় কীসের দ্বারা?

A) The length of the test / পরীক্ষার দৈর্ঘ্য B) The type of questions (e.g., essay vs. MCQ) / প্রশ্নের ধরন (যেমন, প্রবন্ধ বনাম MCQ) C) The time given for the test / পরীক্ষার জন্য দেওয়া সময় D) The difficulty of the test / পরীক্ষার কাঠিন্য

Correct Answer: B) The type of questions (e.g., essay vs. MCQ) / প্রশ্নের ধরন (যেমন, প্রবন্ধ বনাম MCQ)

Explanation: Objectivity refers to the degree to which a test is free from scorer bias. Multiple-choice questions (MCQs) are highly objective because the scoring key is fixed. Essay questions are highly subjective because different scorers can award different marks for the same answer.
ব্যাখ্যা: বস্তুনিষ্ঠতা বলতে বোঝায় একটি পরীক্ষা কতটা পরীক্ষকের পক্ষপাত থেকে মুক্ত। বহুনির্বাচনী প্রশ্ন (MCQ) অত্যন্ত বস্তুনিষ্ঠ কারণ স্কোরিং কী নির্দিষ্ট। প্রবন্ধমূলক প্রশ্নগুলি অত্যন্ত ব্যক্তিনিষ্ঠ কারণ বিভিন্ন পরীক্ষক একই উত্তরের জন্য বিভিন্ন নম্বর দিতে পারেন।

44. “Evaluation is the assignment of symbols to phenomena in order to characterize the worth of a phenomenon.” Who said this?
“মূল্যায়ন হল কোনো ঘটনার মূল্য চিহ্নিত করার জন্য সেই ঘটনাকে প্রতীক বরাদ্দ করা।” – এই উক্তিটি কার?

A) Cronbach / ক্রনব্যাক B) Stufflebeam / স্টাফেলবিম C) Gronlund / গ্রোনলান্ড D) Ebel / ইবেল

Correct Answer: B) Stufflebeam / স্টাফেলবিম

Explanation: This is a classic definition of evaluation provided by Daniel Stufflebeam, a prominent figure in the field of program evaluation. It emphasizes the “value judgment” or “worth” aspect of evaluation.
ব্যাখ্যা: এটি প্রোগ্রাম মূল্যায়নের ক্ষেত্রে একজন বিশিষ্ট ব্যক্তিত্ব ড্যানিয়েল স্টাফেলবিম কর্তৃক প্রদত্ত মূল্যায়নের একটি ক্লাসিক সংজ্ঞা। এটি মূল্যায়নের “মূল্য বিচার” বা “যোগ্যতা”র দিকটির উপর জোর দেয়।

45. A check list is essentially a:
একটি চেকলিস্ট মূলত একটি:

A) List of traits with a scale to rate them / বৈশিষ্ট্যগুলির একটি তালিকা যা রেট করার জন্য একটি স্কেল সহ থাকে B) Method of recording subjective impressions / ব্যক্তিগত ধারণা রেকর্ড করার একটি পদ্ধতি C) List of behaviors or characteristics where the observer just marks their presence or absence / আচরণ বা বৈশিষ্ট্যের একটি তালিকা যেখানে পর্যবেক্ষক কেবল তাদের উপস্থিতি বা অনুপস্থিতি চিহ্নিত করেন D) Tool for sociometric analysis / সমাজমিতিক বিশ্লেষণের জন্য একটি উপকরণ

Correct Answer: C) List of behaviors or characteristics where the observer just marks their presence or absence / আচরণ বা বৈশিষ্ট্যের একটি তালিকা যেখানে পর্যবেক্ষক কেবল তাদের উপস্থিতি বা অনুপস্থিতি চিহ্নিত করেন

Explanation: A checklist is a simple tool where an observer checks off items from a list as they are observed. It is a binary (yes/no, present/absent) method and does not indicate the quality or degree of the behavior, unlike a rating scale.
ব্যাখ্যা: একটি চেকলিস্ট একটি সাধারণ উপকরণ যেখানে একজন পর্যবেক্ষক একটি তালিকা থেকে আইটেমগুলি পর্যবেক্ষণ করার সাথে সাথে চিহ্নিত করেন। এটি একটি বাইনারি (হ্যাঁ/না, উপস্থিত/অনুপস্থিত) পদ্ধতি এবং একটি রেটিং স্কেলের মতো আচরণের গুণমান বা মাত্রা নির্দেশ করে না।

46. The formula IQ = (MA/CA) x 100 was popularized by:
IQ = (MA/CA) x 100 সূত্রটি কে জনপ্রিয় করেন?

A) Alfred Binet / আলফ্রেড বিনে B) William Stern / উইলিয়াম স্টার্ন C) Lewis Terman / লьюইস টারম্যান D) David Wechsler / ডেভিড ওয়েক্সলার

Correct Answer: C) Lewis Terman / লьюইস টারম্যান

Explanation: While William Stern first proposed dividing Mental Age (MA) by Chronological Age (CA), it was Lewis Terman, in his 1916 revision of the Binet-Simon scale (the Stanford-Binet), who multiplied the ratio by 100 to remove the decimal and create the now-famous IQ formula.
ব্যাখ্যা: যদিও উইলিয়াম স্টার্ন প্রথমে মানসিক বয়সকে (MA) প্রকৃত বয়স (CA) দ্বারা ভাগ করার প্রস্তাব দিয়েছিলেন, তবে লьюইস টারম্যান ১৯১৬ সালে বিনে-সাইমন স্কেলের (স্ট্যানফোর্ড-বিনে) সংশোধনে এই অনুপাতকে ১০০ দ্বারা গুণ করে দশমিক অপসারণ করেন এবং বর্তমানে বিখ্যাত আইকিউ সূত্রটি তৈরি করেন।

47. Split-half method is a technique to estimate:
স্প্লিট-হাফ পদ্ধতি কী অনুমান করার একটি কৌশল?

A) Content Validity / বিষয়বস্তুগত বৈধতা B) Predictive Validity / ভবিষ্যদ্বাণীমূলক বৈধতা C) Internal Consistency Reliability / অভ্যন্তরীণ সঙ্গতি নির্ভরযোগ্যতা D) Objectivity / বস্তুনিষ্ঠতা

Correct Answer: C) Internal Consistency Reliability / অভ্যন্তরীণ সঙ্গতি নির্ভরযোগ্যতা

Explanation: The split-half method measures a test’s internal consistency. The test is administered once, then divided into two equivalent halves (e.g., odd vs. even items), and the scores on the two halves are correlated. It assesses how consistently the items within a test measure the same construct.
ব্যাখ্যা: স্প্লিট-হাফ পদ্ধতি একটি পরীক্ষার অভ্যন্তরীণ সঙ্গতি পরিমাপ করে। পরীক্ষাটি একবার পরিচালনা করা হয়, তারপর দুটি সমতুল্য অর্ধে (যেমন, বিজোড় বনাম জোড় প্রশ্ন) ভাগ করা হয় এবং দুটি অর্ধেকের স্কোরের মধ্যে পারস্পরিক সম্পর্ক স্থাপন করা হয়। এটি মূল্যায়ন করে যে একটি পরীক্ষার মধ্যে প্রশ্নগুলি কতটা সঙ্গতভাবে একই গঠন পরিমাপ করে।

48. Which of the following is NOT a projective technique of personality measurement?
নিম্নলিখিত কোনটি ব্যক্তিত্ব পরিমাপের একটি প্রক্ষেপণমূলক কৌশল নয়?

A) Rorschach Inkblot Test / রোরশ্যাক ইংকব্লট টেস্ট B) Thematic Apperception Test (TAT) / থিমেটিক অ্যাপারসেপশন টেস্ট (TAT) C) Sentence Completion Test / বাক্য পূরণ পরীক্ষা D) Rating Scale / রেটিং স্কেল

Correct Answer: D) Rating Scale / রেটিং স্কেল

Explanation: Projective techniques involve ambiguous stimuli. Rorschach, TAT, and Sentence Completion tests all ask the subject to respond to ambiguous prompts. A rating scale, however, is a non-projective, structured tool where a rater provides a judgment on a specific trait along a continuum.
ব্যাখ্যা: প্রক্ষেপণমূলক কৌশলগুলিতে অস্পষ্ট উদ্দীপনা জড়িত। রোরশ্যাক, TAT, এবং বাক্য পূরণ পরীক্ষা সবই বিষয়কে অস্পষ্ট প্রম্পটে প্রতিক্রিয়া জানাতে বলে। একটি রেটিং স্কেল, তবে, একটি অ-প্রক্ষেপণমূলক, কাঠামোগত উপকরণ যেখানে একজন রেটার একটি নির্দিষ্ট বৈশিষ্ট্যের উপর একটি ধারাবাহিকতায় বিচার প্রদান করেন।

49. The process of item analysis in test construction primarily helps to:
পরীক্ষা নির্মাণে প্রশ্ন বিশ্লেষণের প্রক্রিয়াটি প্রাথমিকভাবে কী করতে সাহায্য করে?

A) Determine the length of the test / পরীক্ষার দৈর্ঘ্য নির্ধারণ করতে B) Select the best items and reject the poor ones / সেরা প্রশ্নগুলি নির্বাচন করতে এবং দুর্বলগুলি বাতিল করতে C) Set the time limit for the test / পরীক্ষার জন্য সময়সীমা নির্ধারণ করতে D) Standardize the administration procedure / পরিচালনা পদ্ধতিকে মানসম্মত করতে

Correct Answer: B) Select the best items and reject the poor ones / সেরা প্রশ্নগুলি নির্বাচন করতে এবং দুর্বলগুলি বাতিল করতে

Explanation: Item analysis involves calculating the difficulty value and discrimination index for each item. This data allows the test constructor to identify and discard items that are too easy, too hard, or do not effectively differentiate between high and low-performing students, thereby improving the overall quality of the test.
ব্যাখ্যা: প্রশ্ন বিশ্লেষণে প্রতিটি প্রশ্নের জন্য কাঠিন্য মান এবং পৃথকীকরণ সূচক গণনা করা জড়িত। এই ডেটা পরীক্ষা নির্মাতাকে সেইসব প্রশ্ন শনাক্ত করতে এবং বাতিল করতে দেয় যা খুব সহজ, খুব কঠিন, বা উচ্চ এবং নিম্ন-পারফর্মিং শিক্ষার্থীদের মধ্যে কার্যকরভাবে পার্থক্য করে না,從এইভাবে পরীক্ষার সামগ্রিক মান উন্নত হয়।

50. A major advantage of objective tests over essay tests is their:
প্রবন্ধমূলক পরীক্ষার তুলনায় বস্তুনিষ্ঠ পরীক্ষার একটি প্রধান সুবিধা হল তাদের:

A) Ability to measure complex thinking / জটিল চিন্তাভাবনা পরিমাপের ক্ষমতা B) Higher reliability in scoring / স্কোরিংয়ে উচ্চতর নির্ভরযোগ্যতা C) Freedom from guessing / অনুমান থেকে মুক্তি D) Ease of construction / নির্মাণের সহজতা

Correct Answer: B) Higher reliability in scoring / স্কোরিংয়ে উচ্চতর নির্ভরযোগ্যতা

Explanation: Because objective tests have a single correct answer and can be scored with a key, the scoring is highly reliable and consistent across different scorers. Essay tests suffer from scorer subjectivity, which lowers their scoring reliability.
ব্যাখ্যা: কারণ বস্তুনিষ্ঠ পরীক্ষার একটিমাত্র সঠিক উত্তর থাকে এবং একটি কী দিয়ে স্কোর করা যায়, তাই স্কোরিং অত্যন্ত নির্ভরযোগ্য এবং বিভিন্ন পরীক্ষকদের মধ্যে সামঞ্জস্যপূর্ণ। প্রবন্ধমূলক পরীক্ষাগুলি পরীক্ষকের ব্যক্তিনিষ্ঠতার কারণে ক্ষতিগ্রস্ত হয়, যা তাদের স্কোরিং নির্ভরযোগ্যতা হ্রাস করে।

51. The concept “Evaluation is a continuous process” is a core principle of:
“মূল্যায়ন একটি নিরবচ্ছিন্ন প্রক্রিয়া” ধারণাটি কিসের মূল নীতি?

A) Traditional Examination System / প্রচলিত পরীক্ষা ব্যবস্থা B) Continuous and Comprehensive Evaluation (CCE) / নিরবচ্ছিন্ন এবং ব্যাপক মূল্যায়ন (CCE) C) Norm-Referenced Testing / নর্ম-ভিত্তিক অভীক্ষা D) Standardized Testing / মানসম্মত অভীক্ষা

Correct Answer: B) Continuous and Comprehensive Evaluation (CCE) / নিরবচ্ছিন্ন এবং ব্যাপক মূল্যায়ন (CCE)

Explanation: CCE emphasizes that evaluation should not be a one-time event at the end of a term but an integral part of the teaching-learning process, occurring continuously throughout the academic year to track progress and provide timely feedback.
ব্যাখ্যা: CCE এই বিষয়ে জোর দেয় যে মূল্যায়ন একটি মেয়াদের শেষে এককালীন ঘটনা হওয়া উচিত নয়, বরং শিক্ষণ-শিখন প্রক্রিয়ার একটি অবিচ্ছেদ্য অংশ হওয়া উচিত, যা অগ্রগতি ট্র্যাক করতে এবং সময়মত প্রতিক্রিয়া প্রদানের জন্য শিক্ষাবর্ষ জুড়ে ক্রমাগত ঘটে।

52. A test that covers a wide range of content but in less detail is a:
একটি পরীক্ষা যা বিস্তৃত বিষয়বস্তু কভার করে কিন্তু কম বিস্তারিতভাবে, তা হল একটি:

A) Diagnostic Test / নির্ণায়ক অভীক্ষা B) Survey Test / সমীক্ষা অভীক্ষা C) Mastery Test / পারদর্শিতার অভীক্ষা D) Prognostic Test / প্রাগনস্টিক অভীক্ষা

Correct Answer: B) Survey Test / সমীক্ষা অভীক্ষা

Explanation: A survey test is a type of achievement test designed to measure a student’s general achievement across several broad areas of content. It provides an overall picture rather than an in-depth analysis of specific skills.
ব্যাখ্যা: একটি সমীক্ষা অভীক্ষা হল এক ধরণের পারদর্শিতার অভীক্ষা যা বিভিন্ন বিস্তৃত বিষয়বস্তু জুড়ে একজন শিক্ষার্থীর সাধারণ কৃতিত্ব পরিমাপ করার জন্য ডিজাইন করা হয়েছে। এটি নির্দিষ্ট দক্ষতার গভীর বিশ্লেষণের পরিবর্তে একটি সামগ্রিক চিত্র প্রদান করে।

53. The halo effect is a potential error in which evaluation technique?
হ্যালো প্রভাব (Halo effect) কোন মূল্যায়ন কৌশলে একটি সম্ভাব্য ত্রুটি?

A) Multiple Choice Test / বহুনির্বাচনী পরীক্ষা B) Rating Scale / রেটিং স্কেল C) Sociogram / সোসিওগ্রাম D) Checklist / চেকলিস্ট

Correct Answer: B) Rating Scale / রেটিং স্কেল

Explanation: The halo effect is a cognitive bias where a rater’s overall impression of a person influences their judgment of that person’s specific traits. For instance, if a teacher thinks a student is “good,” they might rate them highly on all traits (e.g., creativity, discipline) on a rating scale, regardless of actual performance.
ব্যাখ্যা: হ্যালো প্রভাব হল একটি জ্ঞানীয় পক্ষপাত যেখানে একজন রেটারের একজন ব্যক্তি সম্পর্কে সামগ্রিক ধারণা সেই ব্যক্তির নির্দিষ্ট বৈশিষ্ট্যের বিচারকে প্রভাবিত করে। উদাহরণস্বরূপ, যদি একজন শিক্ষক মনে করেন একজন শিক্ষার্থী “ভালো”, তবে তিনি তাকে রেটিং স্কেলে সমস্ত বৈশিষ্ট্যে (যেমন, সৃজনশীলতা, শৃঙ্খলা) উচ্চ রেটিং দিতে পারেন, প্রকৃত পারফরম্যান্স নির্বিশেষে।

54. An intelligence test is a type of:
একটি বুদ্ধিমত্তার পরীক্ষা হল এক ধরণের:

A) Maximum Performance Test / সর্বোচ্চ কর্মক্ষমতা পরীক্ষা B) Typical Performance Test / সাধারণ কর্মক্ষমতা পরীক্ষা C) Formative Test / গঠনমূলক পরীক্ষা D) Subjective Test / ব্যক্তিনিষ্ঠ পরীক্ষা

Correct Answer: A) Maximum Performance Test / সর্বোচ্চ কর্মক্ষমতা পরীক্ষা

Explanation: Tests can be classified based on the performance they measure. Maximum performance tests (like achievement and intelligence tests) are designed to see how well individuals can perform when they are motivated to do their best. Typical performance tests (like personality and interest inventories) assess what individuals usually do or feel.
ব্যাখ্যা: পরীক্ষাগুলিকে তারা যে কর্মক্ষমতা পরিমাপ করে তার উপর ভিত্তি করে শ্রেণীবদ্ধ করা যেতে পারে। সর্বোচ্চ কর্মক্ষমতা পরীক্ষা (যেমন পারদর্শিতা এবং বুদ্ধিমত্তার পরীক্ষা) দেখতে ডিজাইন করা হয়েছে যে ব্যক্তিরা যখন তাদের সেরাটা করার জন্য অনুপ্রাণিত হয় তখন তারা কতটা ভালো পারফর্ম করতে পারে। সাধারণ কর্মক্ষমতা পরীক্ষা (যেমন ব্যক্তিত্ব এবং আগ্রহের তালিকা) মূল্যায়ন করে যে ব্যক্তিরা সাধারণত কী করে বা অনুভব করে।

55. ‘Norms’ in the context of test standardization refer to:
পরীক্ষার মাননির্ণয়ের প্রেক্ষাপটে ‘নর্মস’ বলতে কী বোঝায়?

A) The rules for administering the test / পরীক্ষা পরিচালনার নিয়ম B) The average or typical scores of a specific group / একটি নির্দিষ্ট গোষ্ঠীর গড় বা সাধারণ স্কোর C) The difficulty level of the test items / পরীক্ষার প্রশ্নগুলির কাঠিন্যের স্তর D) The validity evidence for the test / পরীক্ষার জন্য বৈধতার প্রমাণ

Correct Answer: B) The average or typical scores of a specific group / একটি নির্দিষ্ট গোষ্ঠীর গড় বা সাধারণ স্কোর

Explanation: Norms are sets of scores derived from administering a test to a large, representative sample (the norm group). These scores serve as a frame of reference for interpreting the scores of individuals who take the test later.
ব্যাখ্যা: নর্মস হল একটি বৃহৎ, প্রতিনিধিত্বমূলক নমুনায় (নর্ম গ্রুপ) একটি পরীক্ষা পরিচালনা করে প্রাপ্ত স্কোরগুলির সেট। এই স্কোরগুলি পরে পরীক্ষা দেওয়া ব্যক্তিদের স্কোর ব্যাখ্যা করার জন্য একটি রেফারেন্স ফ্রেম হিসাবে কাজ করে।

56. Which is the broadest term?
কোনটি সবচেয়ে ব্যাপক শব্দ?

A) Test / পরীক্ষা B) Measurement / পরিমাপ C) Assessment / অ্যাসেসমেন্ট D) Evaluation / মূল্যায়ন

Correct Answer: D) Evaluation / মূল্যায়ন

Explanation: The hierarchy is generally Test -> Measurement -> Assessment -> Evaluation. A test is a tool. Measurement is assigning numbers. Assessment is the process of collecting and interpreting data. Evaluation is the broadest term, involving all the others plus a value judgment about the outcome.
ব্যাখ্যা: অনুক্রমটি সাধারণত পরীক্ষা -> পরিমাপ -> অ্যাসেসমেন্ট -> মূল্যায়ন। একটি পরীক্ষা একটি উপকরণ। পরিমাপ হল সংখ্যা নির্ধারণ। অ্যাসেসমেন্ট হল তথ্য সংগ্রহ এবং ব্যাখ্যা করার প্রক্রিয়া। মূল্যায়ন হল সবচেয়ে ব্যাপক শব্দ, যা অন্য সবগুলিকে এবং ফলাফলের সম্পর্কে একটি মূল্য বিচারকে অন্তর্ভুক্ত করে।

57. Which type of validity is most important for an achievement test?
একটি পারদর্শিতার অভীক্ষার জন্য কোন ধরণের বৈধতা সবচেয়ে গুরুত্বপূর্ণ?

A) Content Validity / বিষয়বস্তুগত বৈধতা B) Construct Validity / গঠনগত বৈধতা C) Predictive Validity / ভবিষ্যদ্বাণীমূলক বৈধতা D) Concurrent Validity / সহগামী বৈধতা

Correct Answer: A) Content Validity / বিষয়বস্তুগত বৈধতা

Explanation: Content validity is crucial for achievement tests because their primary purpose is to measure how much a student has learned from a specific curriculum or content domain. The test must be a representative sample of that content.
ব্যাখ্যা: পারদর্শিতার অভীক্ষার জন্য বিষয়বস্তুগত বৈধতা অত্যন্ত গুরুত্বপূর্ণ কারণ তাদের প্রাথমিক উদ্দেশ্য হল একজন শিক্ষার্থী একটি নির্দিষ্ট পাঠ্যক্রম বা বিষয়বস্তুর ক্ষেত্র থেকে কতটা শিখেছে তা পরিমাপ করা। পরীক্ষাটি অবশ্যই সেই বিষয়বস্তুর একটি প্রতিনিধিত্বমূলক নমুনা হতে হবে।

58. The main purpose of evaluation in education is to:
শিক্ষায় মূল্যায়নের প্রধান উদ্দেশ্য হল:

A) Rank students / শিক্ষার্থীদের র‍্যাঙ্ক করা B) Label students as ‘pass’ or ‘fail’ / শিক্ষার্থীদের ‘পাশ’ বা ‘ফেল’ হিসাবে লেবেল করা C) Make judgments about the quality of learning and teaching / শিখন এবং শিক্ষণের গুণমান সম্পর্কে বিচার করা D) Punish students for not learning / না শেখার জন্য শিক্ষার্থীদের শাস্তি দেওয়া

Correct Answer: C) Make judgments about the quality of learning and teaching / শিখন এবং শিক্ষণের গুণমান সম্পর্কে বিচার করা

Explanation: The ultimate goal of evaluation is not just to measure, but to use the information gathered to make informed decisions and judgments about the effectiveness of the entire educational process, including student learning, teaching methods, and curriculum.
ব্যাখ্যা: মূল্যায়নের চূড়ান্ত লক্ষ্য কেবল পরিমাপ করা নয়, বরং সংগৃহীত তথ্য ব্যবহার করে শিক্ষার্থীর শিখন, শিক্ষণ পদ্ধতি এবং পাঠ্যক্রম সহ সমগ্র শিক্ষাগত প্রক্রিয়ার কার্যকারিতা সম্পর্কে অবহিত সিদ্ধান্ত এবং বিচার করা।

59. Verbal and non-verbal tests are classifications of which type of test?
বাচনিক এবং অবাচনিক পরীক্ষা কোন ধরণের পরীক্ষার শ্রেণিবিভাগ?

A) Personality Tests / ব্যক্তিত্বের অভীক্ষা B) Interest Tests / আগ্রহের অভীক্ষা C) Intelligence Tests / বুদ্ধিমত্তার অভীক্ষা D) Achievement Tests / পারদর্শিতার অভীক্ষা

Correct Answer: C) Intelligence Tests / বুদ্ধিমত্তার অভীক্ষা

Explanation: Intelligence tests are often categorized based on the medium used. Verbal tests rely heavily on language, while non-verbal (or performance) tests use pictures, diagrams, or objects, making them suitable for people with language barriers or young children.
ব্যাখ্যা: বুদ্ধিমত্তার পরীক্ষাগুলিকে প্রায়শই ব্যবহৃত মাধ্যমের উপর ভিত্তি করে শ্রেণীবদ্ধ করা হয়। বাচনিক পরীক্ষাগুলি ভাষার উপর খুব বেশি নির্ভর করে, যখন অবাচনিক (বা পারফরম্যান্স) পরীক্ষাগুলি ছবি, চিত্র বা বস্তু ব্যবহার করে, যা তাদের ভাষাগত বাধা বা ছোট শিশুদের জন্য উপযুক্ত করে তোলে।

60. A major limitation of essay-type tests is their:
প্রবন্ধমূলক পরীক্ষার একটি প্রধান সীমাবদ্ধতা হল তাদের:

A) Low validity / কম বৈধতা B) Limited content sampling / সীমিত বিষয়বস্তু নমুনা C) Inability to measure higher-order thinking / উচ্চ-স্তরের চিন্তাভাবনা পরিমাপের অক্ষমতা D) High objectivity in scoring / স্কোরিংয়ে উচ্চ বস্তুনিষ্ঠতা

Correct Answer: B) Limited content sampling / সীমিত বিষয়বস্তু নমুনা

Explanation: Since essay questions take a long time to answer, an essay test can only include a few questions. This means it can only sample a small, often unrepresentative, portion of the total course content, which can lower its content validity.
ব্যাখ্যা: যেহেতু প্রবন্ধমূলক প্রশ্নের উত্তর দিতে অনেক সময় লাগে, তাই একটি প্রবন্ধ পরীক্ষায় মাত্র কয়েকটি প্রশ্ন অন্তর্ভুক্ত করা যায়। এর মানে হল এটি মোট কোর্স বিষয়বস্তুর একটি ছোট, প্রায়শই অ-প্রতিনিধিত্বমূলক, অংশ নমুনা করতে পারে, যা এর বিষয়বস্তুগত বৈধতা হ্রাস করতে পারে।

61. Which evaluation occurs at the end of instruction?
কোন মূল্যায়নটি নির্দেশনার শেষে ঘটে?

A) Formative / গঠনমূলক B) Summative / সার্বিক C) Diagnostic / নির্ণায়ক D) Placement / স্থাননির্ণায়ক

Correct Answer: B) Summative / সার্বিক

Explanation: Summative evaluation is conducted at the end of a unit, course, or program to determine the extent to which instructional objectives have been achieved. It is an “evaluation of learning.”
ব্যাখ্যা: সার্বিক মূল্যায়ন একটি ইউনিট, কোর্স বা প্রোগ্রামের শেষে পরিচালিত হয় যাতে নির্দেশনামূলক উদ্দেশ্যগুলি কতটা অর্জিত হয়েছে তা নির্ধারণ করা যায়। এটি “শিখনের মূল্যায়ন”।

62. The ‘discrimination index’ of a test item indicates how well the item:
একটি পরীক্ষার প্রশ্নের ‘পৃথকীকরণ সূচক’ নির্দেশ করে যে প্রশ্নটি কতটা ভালো:

A) Is understood by all students / সমস্ত শিক্ষার্থী দ্বারা বোঝা যায় B) Differentiates between high-achievers and low-achievers / উচ্চ-কৃতিত্ব এবং নিম্ন-কৃতিত্বের শিক্ষার্থীদের মধ্যে পার্থক্য করে C) Covers the syllabus / পাঠ্যক্রম কভার করে D) Is easy to score / স্কোর করা সহজ

Correct Answer: B) Differentiates between high-achievers and low-achievers / উচ্চ-কৃতিত্ব এবং নিম্ন-কৃতিত্বের শিক্ষার্থীদের মধ্যে পার্থক্য করে

Explanation: The discrimination index is a key statistic in item analysis. A good test item should be answered correctly by more students in the high-scoring group than in the low-scoring group. This shows the item is effectively measuring the trait the test is designed to measure.
ব্যাখ্যা: পৃথকীকরণ সূচক হল প্রশ্ন বিশ্লেষণের একটি মূল পরিসংখ্যান। একটি ভালো পরীক্ষার প্রশ্ন নিম্ন-স্কোরিং গ্রুপের চেয়ে উচ্চ-স্কোরিং গ্রুপের বেশি শিক্ষার্থী দ্বারা সঠিকভাবে উত্তর দেওয়া উচিত। এটি দেখায় যে প্রশ্নটি কার্যকরভাবে সেই বৈশিষ্ট্য পরিমাপ করছে যা পরিমাপ করার জন্য পরীক্ষাটি ডিজাইন করা হয়েছে।

63. Individual tests of intelligence are administered to:
ব্যক্তিগত বুদ্ধিমত্তার পরীক্ষা কাদের উপর পরিচালিত হয়?

A) A large group of people at once / একবারে একদল লোকের উপর B) One individual at a time / একবারে একজন ব্যক্তির উপর C) Only children / শুধুমাত্র শিশুদের উপর D) Only adults / শুধুমাত্র প্রাপ্তবয়স্কদের উপর

Correct Answer: B) One individual at a time / একবারে একজন ব্যক্তির উপর

Explanation: Individual tests, like the Stanford-Binet or Wechsler scales, require a one-on-one administration by a trained examiner. This allows for detailed observation of the test-taker’s behavior and approach to problems, providing richer information than group tests.
ব্যাখ্যা: ব্যক্তিগত পরীক্ষা, যেমন স্ট্যানফোর্ড-বিনেট বা ওয়েক্সলার স্কেল, একজন প্রশিক্ষিত পরীক্ষক দ্বারা একের পর এক পরিচালনার প্রয়োজন হয়। এটি পরীক্ষার্থীর আচরণ এবং সমস্যার প্রতি তার পদ্ধতির বিস্তারিত পর্যবেক্ষণের সুযোগ দেয়, যা গ্রুপ পরীক্ষার চেয়ে সমৃদ্ধ তথ্য প্রদান করে।

64. A major drawback of the current examination system is that it encourages:
বর্তমান পরীক্ষা ব্যবস্থার একটি প্রধান অসুবিধা হল এটি উৎসাহিত করে:

A) Holistic development / সামগ্রিক উন্নয়ন B) Selective study and cramming / বেছে বেছে পড়া এবং মুখস্থ করা C) Problem-solving skills / সমস্যা সমাধান দক্ষতা D) Creativity and originality / সৃজনশীলতা এবং মৌলিকতা

Correct Answer: B) Selective study and cramming / বেছে বেছে পড়া এবং মুখস্থ করা

Explanation: When students know that questions will come from a predictable set of topics, they tend to focus only on those “important” topics and cram information just before the exam, rather than aiming for a deep and comprehensive understanding of the entire subject.
ব্যাখ্যা: যখন শিক্ষার্থীরা জানে যে প্রশ্নগুলি একটি অনুমানযোগ্য বিষয় থেকে আসবে, তখন তারা পুরো বিষয়টির গভীর এবং ব্যাপক বোঝার লক্ষ্য না রেখে শুধুমাত্র সেই “গুরুত্বপূর্ণ” বিষয়গুলির উপর মনোযোগ দেয় এবং পরীক্ষার ঠিক আগে তথ্য মুখস্থ করে।

65. What is the main difference between an interest test and an aptitude test?
একটি আগ্রহ পরীক্ষা এবং একটি প্রবণতা পরীক্ষার মধ্যে প্রধান পার্থক্য কী?

A) Interest tests measure what you like to do; aptitude tests measure what you are capable of doing. / আগ্রহ পরীক্ষা পরিমাপ করে আপনি কী করতে পছন্দ করেন; প্রবণতা পরীক্ষা পরিমাপ করে আপনি কী করতে সক্ষম। B) Interest tests are for careers; aptitude tests are for school subjects. / আগ্রহ পরীক্ষা পেশার জন্য; প্রবণতা পরীক্ষা স্কুলের বিষয়গুলির জন্য। C) Interest tests are objective; aptitude tests are subjective. / আগ্রহ পরীক্ষা বস্তুনিষ্ঠ; প্রবণতা পরীক্ষা ব্যক্তিনিষ্ঠ। D) There is no difference. / কোন পার্থক্য নেই।

Correct Answer: A) Interest tests measure what you like to do; aptitude tests measure what you are capable of doing. / আগ্রহ পরীক্ষা পরিমাপ করে আপনি কী করতে পছন্দ করেন; প্রবণতা পরীক্ষা পরিমাপ করে আপনি কী করতে সক্ষম।

Explanation: This is the fundamental distinction. An interest inventory assesses one’s preferences and likes/dislikes (what one would enjoy). An aptitude test assesses one’s potential or capacity to succeed in a certain area (what one could be good at). A person might have an aptitude for something they have no interest in, and vice-versa.
ব্যাখ্যা: এটিই হল মৌলিক পার্থক্য। একটি আগ্রহের তালিকা একজনের পছন্দ এবং অপছন্দ মূল্যায়ন করে (যা একজন উপভোগ করবে)। একটি প্রবণতা পরীক্ষা একটি নির্দিষ্ট ক্ষেত্রে সফল হওয়ার জন্য একজনের সম্ভাবনা বা ক্ষমতা মূল্যায়ন করে (যা একজন ভালো করতে পারে)। একজন ব্যক্তির এমন কিছুতে প্রবণতা থাকতে পারে যাতে তার কোন আগ্রহ নেই, এবং এর বিপরীতও হতে পারে।

66. A teacher prepares a test. After administering the test, he finds that the objectives of testing are not met. What was lacking in the test?
একজন শিক্ষক একটি পরীক্ষা প্রস্তুত করেন। পরীক্ষা পরিচালনার পর, তিনি দেখেন যে পরীক্ষার উদ্দেশ্য পূরণ হয়নি। পরীক্ষাটিতে কীসের অভাব ছিল?

A) Reliability / নির্ভরযোগ্যতা B) Validity / বৈধতা C) Objectivity / বস্তুনিষ্ঠতা D) Usability / ব্যবহারযোগ্যতা

Correct Answer: B) Validity / বৈধতা

Explanation: Validity is the degree to which a test measures what it is intended to measure. If the test did not meet its objectives, it means it was not a valid measure of the intended learning outcomes.
ব্যাখ্যা: বৈধতা হল সেই মাত্রা যেখানে একটি পরীক্ষা যা পরিমাপ করার উদ্দেশ্যে তৈরি তা পরিমাপ করে। যদি পরীক্ষাটি তার উদ্দেশ্য পূরণ না করে, এর মানে হল এটি উদ্দেশ্যপ্রণোদিত শেখার ফলাফলের একটি বৈধ পরিমাপক ছিল না।

67. The term ‘usability’ of a test refers to its:
একটি পরীক্ষার ‘ব্যবহারযোগ্যতা’ বলতে কী বোঝায়?

A) Consistency and accuracy / সামঞ্জস্য এবং নির্ভুলতা B) Practical aspects like ease of administration, scoring, and interpretation / ব্যবহারিক দিক যেমন পরিচালনা, স্কোরিং এবং ব্যাখ্যার সহজতা C) Ability to predict future success / ভবিষ্যতের সাফল্য ভবিষ্যদ্বাণী করার ক্ষমতা D) Freedom from cultural bias / সাংস্কৃতিক পক্ষপাত থেকে মুক্তি

Correct Answer: B) Practical aspects like ease of administration, scoring, and interpretation / ব্যবহারিক দিক যেমন পরিচালনা, স্কোরিং এবং ব্যাখ্যার সহজতা

Explanation: Usability (or practicability) is a practical consideration. A test might be valid and reliable, but if it is too long, too expensive, or too difficult to score and interpret, its usability is low.
ব্যাখ্যা: ব্যবহারযোগ্যতা (বা প্রায়োগিকতা) একটি ব্যবহারিক বিবেচনা। একটি পরীক্ষা বৈধ এবং নির্ভরযোগ্য হতে পারে, কিন্তু যদি এটি খুব দীর্ঘ, খুব ব্যয়বহুল, বা স্কোর এবং ব্যাখ্যা করা খুব কঠিন হয়, তবে এর ব্যবহারযোগ্যতা কম।

68. In a criterion-referenced test, the emphasis is on:
একটি নির্ণায়ক-ভিত্তিক পরীক্ষায়, জোর দেওয়া হয়:

A) Comparing students with each other / শিক্ষার্থীদের একে অপরের সাথে তুলনা করার উপর B) Identifying an individual’s specific level of performance / একজন ব্যক্তির পারফরম্যান্সের নির্দিষ্ট স্তর চিহ্নিত করার উপর C) A wide range of generalized skills / বিস্তৃত সাধারণীকৃত দক্ষতার উপর D) The normal distribution of scores / স্কোরের স্বাভাবিক বন্টনের উপর

Correct Answer: B) Identifying an individual’s specific level of performance / একজন ব্যক্তির পারফরম্যান্সের নির্দিষ্ট স্তর চিহ্নিত করার উপর

Explanation: The goal of a CRT is to determine what a student knows and can do in relation to a specific set of learning objectives or a performance standard (the criterion). The focus is absolute (what they can do) rather than relative (how they compare to others).
ব্যাখ্যা: একটি CRT-এর লক্ষ্য হল একজন শিক্ষার্থী একটি নির্দিষ্ট শেখার উদ্দেশ্য বা পারফরম্যান্সের মান (নির্ণায়ক) সম্পর্কিত কী জানে এবং করতে পারে তা নির্ধারণ করা। ফোকাসটি পরম (তারা কী করতে পারে) আপেক্ষিক (তারা অন্যদের সাথে কীভাবে তুলনা করে) এর পরিবর্তে।

69. A group test of intelligence:
একটি দলগত বুদ্ধিমত্তার পরীক্ষা:

A) Is more reliable than an individual test / একটি ব্যক্তিগত পরীক্ষার চেয়ে বেশি নির্ভরযোগ্য B) Provides more in-depth information / আরও গভীর তথ্য প্রদান করে C) Can be administered to many people at the same time / একই সময়ে অনেক লোকের উপর পরিচালনা করা যায় D) Requires a highly trained examiner for administration / পরিচালনার জন্য একজন উচ্চ প্রশিক্ষিত পরীক্ষকের প্রয়োজন

Correct Answer: C) Can be administered to many people at the same time / একই সময়ে অনেক লোকের উপর পরিচালনা করা যায়

Explanation: The primary advantage of group tests is their efficiency. They are typically paper-and-pencil or computer-based and can be given to large groups simultaneously, making them cost-effective and time-saving for large-scale screening.
ব্যাখ্যা: দলগত পরীক্ষার প্রাথমিক সুবিধা হল তাদের কার্যকারিতা। এগুলি সাধারণত কাগজ-কলম বা কম্পিউটার-ভিত্তিক এবং একই সাথে বড় গোষ্ঠীকে দেওয়া যেতে পারে, যা তাদের বড় আকারের স্ক্রীনিংয়ের জন্য ব্যয়-কার্যকর এবং সময়-সাশ্রয়ী করে তোলে।

70. An open book examination is a suggestion to improve the current system because it aims to reduce:
একটি খোলা বই পরীক্ষা বর্তমান ব্যবস্থার উন্নতির জন্য একটি পরামর্শ কারণ এটি কমাতে লক্ষ্য রাখে:

A) The importance of understanding / বোঝার গুরুত্ব B) The stress on rote memorization / মুখস্থ বিদ্যার উপর চাপ C) The time required for evaluation / মূল্যায়নের জন্য প্রয়োজনীয় সময় D) The cost of conducting exams / পরীক্ষা পরিচালনার খরচ

Correct Answer: B) The stress on rote memorization / মুখস্থ বিদ্যার উপর চাপ

Explanation: Open book exams shift the focus from recalling information to applying it. Since students can access their notes and books, the questions must be designed to test higher-order skills like analysis, synthesis, and problem-solving, rather than simple memory.
ব্যাখ্যা: খোলা বই পরীক্ষা তথ্য স্মরণ করা থেকে তা প্রয়োগ করার দিকে মনোযোগ সরিয়ে দেয়। যেহেতু শিক্ষার্থীরা তাদের নোট এবং বই অ্যাক্সেস করতে পারে, তাই প্রশ্নগুলি অবশ্যই সাধারণ স্মৃতির পরিবর্তে বিশ্লেষণ, সংশ্লেষণ এবং সমস্যা সমাধানের মতো উচ্চ-স্তরের দক্ষতা পরীক্ষা করার জন্য ডিজাইন করা উচিত।

71. Which one is different from the other three with reference to the type of test?
পরীক্ষার ধরনের নিরিখে কোনটি অন্য তিনটি থেকে আলাদা?

A) 16 PF / ১৬ পিএফ B) TAT / টিএটি C) Rorschach Test / রোরশ্যাক পরীক্ষা D) Sentence Completion Test / বাক্য পূরণ পরীক্ষা

Correct Answer: A) 16 PF / ১৬ পিএফ

Explanation: TAT, Rorschach, and Sentence Completion are all projective tests of personality. The 16 PF (16 Personality Factor Questionnaire) is a non-projective, objective, self-report inventory.
ব্যাখ্যা: TAT, রোরশ্যাক, এবং বাক্য পূরণ পরীক্ষা সবই ব্যক্তিত্বের প্রক্ষেপণমূলক পরীক্ষা। 16 PF (16 পার্সোনালিটি ফ্যাক্টর কোয়েশ্চনেয়ার) হল একটি অ-প্রক্ষেপণমূলক, বস্তুনিষ্ঠ, স্ব-বিবরণী তালিকা।

72. The final product of measurement is:
পরিমাপের চূড়ান্ত ফল হল:

A) A score or a numerical value / একটি স্কোর বা একটি সংখ্যাসূচক মান B) A grade / একটি গ্রেড C) A value judgment / একটি মূল্য বিচার D) A decision / একটি সিদ্ধান্ত

Correct Answer: A) A score or a numerical value / একটি স্কোর বা একটি সংখ্যাসূচক মান

Explanation: Measurement is the process of quantification. Its output is data in the form of numbers or scores. The interpretation of these scores (grades, judgments, decisions) is the role of evaluation.
ব্যাখ্যা: পরিমাপ হল পরিমাণ নির্ধারণের প্রক্রিয়া। এর আউটপুট হল সংখ্যা বা স্কোরের আকারে ডেটা। এই স্কোরগুলির ব্যাখ্যা (গ্রেড, বিচার, সিদ্ধান্ত) হল মূল্যায়নের ভূমিকা।

73. A teacher-made test is generally a:
একটি শিক্ষক-নির্মিত পরীক্ষা সাধারণত একটি:

A) Standardized test / মানসম্মত পরীক্ষা B) Non-standardized test / অ-মানসম্মত পরীক্ষা C) Norm-referenced test / নর্ম-ভিত্তিক পরীক্ষা D) Aptitude test / প্রবণতা পরীক্ষা

Correct Answer: B) Non-standardized test / অ-মানসম্মত পরীক্ষা

Explanation: Standardized tests are developed by experts, undergo rigorous item analysis, and have established norms. Teacher-made tests are typically created for a specific classroom and do not go through this extensive process, so they are considered non-standardized or informal.
ব্যাখ্যা: মানসম্মত পরীক্ষাগুলি বিশেষজ্ঞদের দ্বারা তৈরি করা হয়, কঠোর প্রশ্ন বিশ্লেষণের মধ্য দিয়ে যায় এবং প্রতিষ্ঠিত নর্ম থাকে। শিক্ষক-নির্মিত পরীক্ষাগুলি সাধারণত একটি নির্দিষ্ট শ্রেণিকক্ষের জন্য তৈরি করা হয় এবং এই ব্যাপক প্রক্রিয়ার মধ্য দিয়ে যায় না, তাই এগুলিকে অ-মানসম্মত বা অনানুষ্ঠানিক হিসাবে বিবেচনা করা হয়।

74. The “need for evaluation” in education arises because we want to:
শিক্ষায় “মূল্যায়নের প্রয়োজন” দেখা দেয় কারণ আমরা চাই:

A) Increase the number of examinations / পরীক্ষার সংখ্যা বাড়াতে B) Determine the effectiveness of the educational process / শিক্ষাগত প্রক্রিয়ার কার্যকারিতা নির্ধারণ করতে C) Eliminate weak students from the system / দুর্বল শিক্ষার্থীদের সিস্টেম থেকে বাদ দিতে D) Make the curriculum more difficult / পাঠ্যক্রমকে আরও কঠিন করতে

Correct Answer: B) Determine the effectiveness of the educational process / শিক্ষাগত প্রক্রিয়ার কার্যকারিতা নির্ধারণ করতে

Explanation: The fundamental need for evaluation is to gather evidence to see if the educational goals are being met. It helps in assessing student learning, improving teaching methods, and making decisions about curriculum, thereby checking the overall effectiveness.
ব্যাখ্যা: মূল্যায়নের মৌলিক প্রয়োজন হল শিক্ষাগত লক্ষ্যগুলি পূরণ হচ্ছে কিনা তা দেখার জন্য প্রমাণ সংগ্রহ করা। এটি শিক্ষার্থীর শিখন মূল্যায়ন, শিক্ষণ পদ্ধতির উন্নতি এবং পাঠ্যক্রম সম্পর্কে সিদ্ধান্ত নিতে সাহায্য করে, যার ফলে সামগ্রিক কার্যকারিতা যাচাই করা হয়।

75. The term ‘intelligence’ is best described as:
‘বুদ্ধিমত্তা’ শব্দটি সবচেয়ে ভালোভাবে বর্ণনা করা হয় এভাবে:

A) The ability to score high on tests / পরীক্ষায় উচ্চ স্কোর করার ক্ষমতা B) The capacity to acquire and apply knowledge and skills / জ্ঞান এবং দক্ষতা অর্জন ও প্রয়োগ করার ক্ষমতা C) The amount of information a person knows / একজন ব্যক্তি যে পরিমাণ তথ্য জানে D) The speed of reading and writing / পড়া এবং লেখার গতি

Correct Answer: B) The capacity to acquire and apply knowledge and skills / জ্ঞান এবং দক্ষতা অর্জন ও প্রয়োগ করার ক্ষমতা

Explanation: Modern definitions of intelligence go beyond just knowing facts. They emphasize the ability to reason, solve problems, think abstractly, comprehend complex ideas, learn quickly, and learn from experience.
ব্যাখ্যা: বুদ্ধিমত্তার আধুনিক সংজ্ঞাগুলি কেবল তথ্য জানার বাইরেও যায়। এগুলি যুক্তি, সমস্যা সমাধান, বিমূর্তভাবে চিন্তা করা, জটিল ধারণা বোঝা, দ্রুত শেখা এবং অভিজ্ঞতা থেকে শেখার ক্ষমতার উপর জোর দেয়।

76. The construction of a test begins with:
একটি পরীক্ষা নির্মাণ শুরু হয় কী দিয়ে?

A) Item writing / প্রশ্ন লেখা B) Scoring key preparation / স্কোরিং কী প্রস্তুতি C) Planning and specifying objectives / পরিকল্পনা এবং উদ্দেশ্য নির্দিষ্টকরণ D) Try-out / ট্রাই-আউট

Correct Answer: C) Planning and specifying objectives / পরিকল্পনা এবং উদ্দেশ্য নির্দিষ্টকরণ

Explanation: The very first and most critical stage is planning. This includes defining the purpose of the test, identifying the content to be covered, and clearly specifying the instructional objectives that will be measured. All other steps follow from this plan.
ব্যাখ্যা: সবচেয়ে প্রথম এবং সবচেয়ে গুরুত্বপূর্ণ পর্যায় হল পরিকল্পনা। এর মধ্যে পরীক্ষার উদ্দেশ্য সংজ্ঞায়িত করা, কভার করা হবে এমন বিষয়বস্তু চিহ্নিত করা এবং পরিমাপ করা হবে এমন নির্দেশনামূলক উদ্দেশ্যগুলি স্পষ্টভাবে নির্দিষ্ট করা অন্তর্ভুক্ত। অন্যান্য সমস্ত পদক্ষেপ এই পরিকল্পনা থেকে অনুসরণ করে।

77. Which of the following is a performance test?
নিম্নলিখিত কোনটি একটি পারফরম্যান্স পরীক্ষা?

A) A multiple-choice history test / একটি বহুনির্বাচনী ইতিহাস পরীক্ষা B) A science lab experiment examination / একটি বিজ্ঞান ল্যাব পরীক্ষার পরীক্ষা C) A true-false geography quiz / একটি সত্য-মিথ্যা ভূগোল কুইজ D) A literature essay / একটি সাহিত্য প্রবন্ধ

Correct Answer: B) A science lab experiment examination / একটি বিজ্ঞান ল্যাব পরীক্ষার পরীক্ষা

Explanation: A performance test requires the student to actually perform a task or create a product, rather than just selecting an answer. A lab experiment, a typing test, or a driving test are all examples of performance tests.
ব্যাখ্যা: একটি পারফরম্যান্স পরীক্ষায় শিক্ষার্থীকে কেবল একটি উত্তর নির্বাচন করার পরিবর্তে বাস্তবে একটি কাজ সম্পাদন করতে বা একটি পণ্য তৈরি করতে হয়। একটি ল্যাব পরীক্ষা, একটি টাইপিং পরীক্ষা, বা একটি ড্রাইভিং পরীক্ষা সবই পারফরম্যান্স পরীক্ষার উদাহরণ।

78. The “scope of evaluation” covers:
“মূল্যায়নের পরিধি” কী কভার করে?

A) Only cognitive domain / শুধুমাত্র জ্ঞানীয় ক্ষেত্র B) Only affective domain / শুধুমাত্র অনুভূতিমূলক ক্ষেত্র C) Only psychomotor domain / শুধুমাত্র মনশ্চালকমূলক ক্ষেত্র D) Cognitive, affective, and psychomotor domains / জ্ঞানীয়, অনুভূতিমূলক, এবং মনশ্চালকমূলক ক্ষেত্র

Correct Answer: D) Cognitive, affective, and psychomotor domains / জ্ঞানীয়, অনুভূতিমূলক, এবং মনশ্চালকমূলক ক্ষেত্র

Explanation: Comprehensive evaluation aims to assess the whole child. This includes the cognitive domain (knowledge, thinking), the affective domain (attitudes, values, interests), and the psychomotor domain (physical skills).
ব্যাখ্যা: ব্যাপক মূল্যায়নের লক্ষ্য হল সমগ্র শিশুকে মূল্যায়ন করা। এর মধ্যে রয়েছে জ্ঞানীয় ক্ষেত্র (জ্ঞান, চিন্তাভাবনা), অনুভূতিমূলক ক্ষেত্র (মনোভাব, মূল্যবোধ, আগ্রহ), এবং মনশ্চালকমূলক ক্ষেত্র (শারীরিক দক্ষতা)।

79. The term “personality” originates from the Latin word ‘persona’, which means:
‘পার্সোনালিটি’ শব্দটি ল্যাটিন শব্দ ‘পার্সোনা’ থেকে এসেছে, যার অর্থ:

A) Self / স্বয়ং B) Individual / ব্যক্তি C) Mask / মুখোশ D) Character / চরিত্র

Correct Answer: C) Mask / মুখোশ

Explanation: The word ‘persona’ referred to the theatrical masks worn by actors in ancient Greek and Roman drama to portray different roles or characters. This origin hints at the idea of the outward appearance or social self.
ব্যাখ্যা: ‘পার্সোনা’ শব্দটি প্রাচীন গ্রীক এবং রোমান নাটকে অভিনেতাদের দ্বারা বিভিন্ন ভূমিকা বা চরিত্র চিত্রিত করার জন্য পরা নাট্য মুখোশকে বোঝায়। এই উৎসটি বাহ্যিক চেহারা বা সামাজিক স্ব-এর ধারণার ইঙ্গিত দেয়।

80. If a student gets a similar rank in two equivalent tests, the tests are said to be:
যদি একজন শিক্ষার্থী দুটি সমতুল্য পরীক্ষায় একই রকম র‍্যাঙ্ক পায়, তবে পরীক্ষা দুটিকে বলা হয়:

A) Valid / বৈধ B) Reliable / নির্ভরযোগ্য C) Objective / বস্তুনিষ্ঠ D) Comprehensive / ব্যাপক

Correct Answer: B) Reliable / নির্ভরযোগ্য

Explanation: This scenario describes parallel-forms reliability (or equivalent-forms reliability). It assesses the consistency of results across different but equivalent versions of a test. If scores are similar, the tests are considered reliable.
ব্যাখ্যা: এই দৃশ্যটি সমান্তরাল-ফর্ম নির্ভরযোগ্যতা (বা সমতুল্য-ফর্ম নির্ভরযোগ্যতা) বর্ণনা করে। এটি একটি পরীক্ষার বিভিন্ন কিন্তু সমতুল্য সংস্করণ জুড়ে ফলাফলের সামঞ্জস্যতা মূল্যায়ন করে। যদি স্কোর একই রকম হয়, তবে পরীক্ষাগুলিকে নির্ভরযোগ্য হিসাবে বিবেচনা করা হয়।

81. “A test can be reliable without being valid.” This statement is:
“একটি পরীক্ষা বৈধ না হয়েও নির্ভরযোগ্য হতে পারে।” এই বিবৃতিটি:

A) True / সত্য B) False / মিথ্যা C) Partially True / আংশিক সত্য D) Irrelevant / অপ্রাসঙ্গিক

Correct Answer: A) True / সত্য

Explanation: A test can consistently measure something (be reliable), but that “something” may not be what it’s supposed to measure (making it invalid). For example, a test of “intelligence” that only measures reading speed would be invalid, but it could be very reliable in consistently measuring reading speed.
ব্যাখ্যা: একটি পরীক্ষা ধারাবাহিকভাবে কিছু পরিমাপ করতে পারে (নির্ভরযোগ্য হতে পারে), কিন্তু সেই “কিছু” যা পরিমাপ করার কথা তা নাও হতে পারে (যা এটিকে অবৈধ করে তোলে)। উদাহরণস্বরূপ, “বুদ্ধিমত্তার” একটি পরীক্ষা যা কেবল পড়ার গতি পরিমাপ করে তা অবৈধ হবে, তবে এটি ধারাবাহিকভাবে পড়ার গতি পরিমাপ করতে খুব নির্ভরযোগ্য হতে পারে।

82. “A test cannot be valid unless it is reliable.” This statement is:
“একটি পরীক্ষা নির্ভরযোগ্য না হলে বৈধ হতে পারে না।” এই বিবৃতিটি:

A) True / সত্য B) False / মিথ্যা C) Partially True / আংশিক সত্য D) Irrelevant / অপ্রাসঙ্গিক

Correct Answer: A) True / সত্য

Explanation: Reliability is a necessary, but not sufficient, condition for validity. If a test gives inconsistent, random scores (is unreliable), it cannot possibly be measuring any specific trait accurately (be valid). A valid test must first be reliable.
ব্যাখ্যা: নির্ভরযোগ্যতা বৈধতার জন্য একটি প্রয়োজনীয়, কিন্তু যথেষ্ট নয়, শর্ত। যদি একটি পরীক্ষা অসামঞ্জস্যপূর্ণ, এলোমেলো স্কোর দেয় (অনির্ভরযোগ্য), তবে এটি সম্ভবত কোনো নির্দিষ্ট বৈশিষ্ট্য সঠিকভাবে পরিমাপ করতে পারে না (বৈধ হতে পারে না)। একটি বৈধ পরীক্ষা অবশ্যই প্রথমে নির্ভরযোগ্য হতে হবে।

83. Essay tests are superior to objective tests in terms of:
প্রবন্ধ পরীক্ষা বস্তুনিষ্ঠ পরীক্ষার চেয়ে কোন দিক থেকে উন্নত?

A) Reliability of scoring / স্কোরিং এর নির্ভরযোগ্যতা B) Sampling of content / বিষয়বস্তুর নমুনা C) Measuring originality and organization of ideas / মৌলিকতা এবং ধারণার সংগঠন পরিমাপ করা D) Objectivity of scoring / স্কোরিং এর বস্তুনিষ্ঠতা

Correct Answer: C) Measuring originality and organization of ideas / মৌলিকতা এবং ধারণার সংগঠন পরিমাপ করা

Explanation: The main strength of essay tests lies in their ability to assess higher-level cognitive processes. They allow students to demonstrate their ability to synthesize information, develop logical arguments, and express themselves creatively, which is difficult to measure with objective formats.
ব্যাখ্যা: প্রবন্ধ পরীক্ষার প্রধান শক্তি হল উচ্চ-স্তরের জ্ঞানীয় প্রক্রিয়াগুলি মূল্যায়ন করার ক্ষমতা। এগুলি শিক্ষার্থীদের তথ্য সংশ্লেষণ, যৌক্তিক যুক্তি বিকাশ এবং সৃজনশীলভাবে নিজেদের প্রকাশ করার ক্ষমতা প্রদর্শন করতে দেয়, যা বস্তুনিষ্ঠ বিন্যাসে পরিমাপ করা কঠিন।

84. Which of the following is not a tool of evaluation?
নিম্নলিখিত কোনটি মূল্যায়নের উপকরণ নয়?

A) Interview / সাক্ষাৎকার B) Checklist / চেকলিস্ট C) Rating scale / রেটিং স্কেল D) Achievement test / পারদর্শিতার অভীক্ষা

Correct Answer: A) Interview / সাক্ষাৎকার

Explanation: A distinction is often made between tools (the instruments) and techniques (the methods). A checklist, rating scale, and achievement test are all physical or conceptual instruments (tools). An interview is a method or process (a technique) for gathering information, which might use a tool like an interview schedule.
ব্যাখ্যা: প্রায়শই উপকরণ (যন্ত্র) এবং কৌশল (পদ্ধতি) এর মধ্যে একটি পার্থক্য করা হয়। একটি চেকলিস্ট, রেটিং স্কেল, এবং পারদর্শিতার অভীক্ষা সবই ভৌত বা ধারণাগত যন্ত্র (উপকরণ)। একটি সাক্ষাৎকার হল তথ্য সংগ্রহের একটি পদ্ধতি বা প্রক্রিয়া (একটি কৌশল), যা একটি সাক্ষাৎকারের সময়সূচীর মতো একটি উপকরণ ব্যবহার করতে পারে।

85. The term ‘Prognostic Test’ is another name for:
‘প্রোগনস্টিক টেস্ট’ শব্দটি কিসের অন্য নাম?

A) Aptitude Test / প্রবণতা অভীক্ষা B) Achievement Test / পারদর্শিতার অভীক্ষা C) Diagnostic Test / নির্ণায়ক অভীক্ষা D) Personality Test / ব্যক্তিত্বের অভীক্ষা

Correct Answer: A) Aptitude Test / প্রবণতা অভীক্ষা

Explanation: Prognosis means predicting the likely course of a situation. A prognostic test, therefore, is one that aims to predict future performance or success, which is the primary function of an aptitude test.
ব্যাখ্যা: প্রোগনোসিস মানে একটি পরিস্থিতির সম্ভাব্য গতিপথ ভবিষ্যদ্বাণী করা। একটি প্রোগনস্টিক পরীক্ষা, অতএব, এমন একটি পরীক্ষা যা ভবিষ্যতের কর্মক্ষমতা বা সাফল্য ভবিষ্যদ্বাণী করার লক্ষ্য রাখে, যা একটি প্রবণতা পরীক্ষার প্রাথমিক কাজ।

86. The concept of “Mental Age” was introduced by:
“মানসিক বয়স” ধারণাটি কে প্রবর্তন করেন?

A) Terman / টারম্যান B) Stern / স্টার্ন C) Wechsler / ওয়েক্সলার D) Binet / বিনে

Correct Answer: D) Binet / বিনে

Explanation: Alfred Binet and his collaborator Théodore Simon developed the concept of mental age as part of their work on the first practical intelligence scale. Mental age refers to the level of intellectual performance typically associated with a particular chronological age.
ব্যাখ্যা: আলফ্রেড বিনে এবং তার সহযোগী থিওডোর সাইমন প্রথম ব্যবহারিক বুদ্ধিমত্তা স্কেলে তাদের কাজের অংশ হিসাবে মানসিক বয়সের ধারণাটি তৈরি করেন। মানসিক বয়স বলতে একটি নির্দিষ্ট কালানুক্রমিক বয়সের সাথে সাধারণত যুক্ত বৌদ্ধিক কর্মক্ষমতার স্তরকে বোঝায়।

87. A major suggestion for examination reform is the introduction of a:
পরীক্ষা সংস্কারের জন্য একটি প্রধান পরামর্শ হল কিসের প্রবর্তন?

A) More rigid and fixed examination schedule / আরও কঠোর এবং নির্দিষ্ট পরীক্ষার সময়সূচী B) Single, high-stakes final examination / একটি একক, উচ্চ-ঝুঁকির চূড়ান্ত পরীক্ষা C) Question bank system / প্রশ্ন ব্যাংক ব্যবস্থা D) System with only essay questions / শুধুমাত্র প্রবন্ধমূলক প্রশ্ন সহ একটি ব্যবস্থা

Correct Answer: C) Question bank system / প্রশ্ন ব্যাংক ব্যবস্থা

Explanation: Using a large, scientifically developed question bank can improve the quality, comparability, and fairness of examinations. It helps in creating balanced papers, reduces the scope for selective study, and allows for more flexible and frequent testing.
ব্যাখ্যা: একটি বৃহৎ, বৈজ্ঞানিকভাবে বিকশিত প্রশ্ন ব্যাংক ব্যবহার করে পরীক্ষার গুণমান, তুলনামূলকতা এবং ন্যায্যতা উন্নত করা যায়। এটি ভারসাম্যপূর্ণ প্রশ্নপত্র তৈরি করতে সাহায্য করে, বেছে বেছে পড়ার সুযোগ কমায় এবং আরও নমনীয় ও ঘন ঘন পরীক্ষার সুযোগ দেয়।

88. Who is considered the father of modern personality testing?
কাকে আধুনিক ব্যক্তিত্ব পরীক্ষার জনক হিসাবে বিবেচনা করা হয়?

A) Sigmund Freud / সিগমুন্ড ফ্রয়েড B) Gordon Allport / গর্ডন অলপোর্ট C) Raymond Cattell / রেমন্ড ক্যাটেল D) Hermann Rorschach / হারম্যান রোরশ্যাক

Correct Answer: B) Gordon Allport / গর্ডন অলপোর্ট

Explanation: Gordon Allport is often regarded as a foundational figure in the study of personality. His trait theory, which identified thousands of personality traits and categorized them into cardinal, central, and secondary traits, laid the groundwork for many modern personality tests.
ব্যাখ্যা: গর্ডন অলপোর্টকে প্রায়শই ব্যক্তিত্বের অধ্যয়নের ক্ষেত্রে একটি foundational figure হিসাবে বিবেচনা করা হয়। তার বৈশিষ্ট্য তত্ত্ব, যা হাজার হাজার ব্যক্তিত্বের বৈশিষ্ট্য চিহ্নিত করে এবং সেগুলিকে কার্ডিনাল, কেন্দ্রীয় এবং গৌণ বৈশিষ্ট্যে শ্রেণীবদ্ধ করে, অনেক আধুনিক ব্যক্তিত্ব পরীক্ষার ভিত্তি স্থাপন করে।

89. The step ‘Try-out’ in test construction is done to:
পরীক্ষা নির্মাণে ‘ট্রাই-আউট’ ধাপটি করা হয় কেন?

A) Finalize the scoring procedure / স্কোরিং পদ্ধতি চূড়ান্ত করতে B) Print the final version of the test / পরীক্ষার চূড়ান্ত সংস্করণ মুদ্রণ করতে C) Evaluate the test items and instructions / পরীক্ষার প্রশ্ন এবং নির্দেশাবলী মূল্যায়ন করতে D) Announce the test results / পরীক্ষার ফলাফল ঘোষণা করতে

Correct Answer: C) Evaluate the test items and instructions / পরীক্ষার প্রশ্ন এবং নির্দেশাবলী মূল্যায়ন করতে

Explanation: The try-out, or pilot testing, involves administering the preliminary draft of the test to a small, representative sample of the target population. The data from the try-out is then used for item analysis to identify flawed items, check for ambiguity in instructions, and determine the appropriate time limit.
ব্যাখ্যা: ট্রাই-আউট বা পাইলট টেস্টিং-এ পরীক্ষার প্রাথমিক খসড়াটি লক্ষ্য জনসংখ্যার একটি ছোট, প্রতিনিধিত্বমূলক নমুনায় পরিচালনা করা হয়। ট্রাই-আউট থেকে প্রাপ্ত ডেটা ত্রুটিপূর্ণ প্রশ্ন শনাক্ত করতে, নির্দেশাবলীতে অস্পষ্টতা পরীক্ষা করতে এবং উপযুক্ত সময়সীমা নির্ধারণ করতে প্রশ্ন বিশ্লেষণের জন্য ব্যবহৃত হয়।

90. Which of these is a non-standardized evaluation tool?
এর মধ্যে কোনটি একটি অ-মানসম্মত মূল্যায়ন উপকরণ?

A) WAIS-IV / WAIS-IV B) Stanford-Binet Test / স্ট্যানফোর্ড-বিনেট পরীক্ষা C) MMPI-2 / MMPI-2 D) Teacher-made observation checklist / শিক্ষক-নির্মিত পর্যবেক্ষণ চেকলিস্ট

Correct Answer: D) Teacher-made observation checklist / শিক্ষক-নির্মিত পর্যবেক্ষণ চেকলিস্ট

Explanation: WAIS, Stanford-Binet, and MMPI are all classic examples of standardized tests that have been developed through rigorous research and have established norms. A checklist created by a teacher for their specific classroom use is a non-standardized, informal tool.
ব্যাখ্যা: WAIS, স্ট্যানফোর্ড-বিনেট, এবং MMPI সবই মানসম্মত পরীক্ষার ক্লাসিক উদাহরণ যা কঠোর গবেষণার মাধ্যমে তৈরি করা হয়েছে এবং প্রতিষ্ঠিত নর্ম রয়েছে। একজন শিক্ষকের দ্বারা তাদের নির্দিষ্ট শ্রেণিকক্ষের ব্যবহারের জন্য তৈরি করা একটি চেকলিস্ট একটি অ-মানসম্মত, অনানুষ্ঠানিক উপকরণ।

91. The main difference between Measurement and Evaluation is that Evaluation involves:
পরিমাপ এবং মূল্যায়নের মধ্যে প্রধান পার্থক্য হল মূল্যায়ন জড়িত করে:

A) Numbers / সংখ্যা B) Description / বর্ণনা C) Value Judgment / মূল্য বিচার D) Tools / উপকরণ

Correct Answer: C) Value Judgment / মূল্য বিচার

Explanation: Measurement provides the ‘what’ (e.g., a score of 80). Evaluation provides the ‘so what’ by making a judgment about the worth or value of that score (e.g., “80 is an excellent score”). It is this act of judging value that separates evaluation from mere measurement.
ব্যাখ্যা: পরিমাপ ‘কী’ তা প্রদান করে (যেমন, ৮০-র একটি স্কোর)। মূল্যায়ন সেই স্কোরের যোগ্যতা বা মূল্য সম্পর্কে একটি বিচার করে ‘তাতে কী’ তা প্রদান করে (যেমন, “৮০ একটি চমৎকার স্কোর”)। এই মূল্য বিচার করার কাজটিই মূল্যায়নকে নিছক পরিমাপ থেকে আলাদা করে।

92. Tests used for career counseling are primarily:
কেরিয়ার কাউন্সেলিংয়ের জন্য ব্যবহৃত পরীক্ষাগুলি প্রাথমিকভাবে:

A) Achievement tests / পারদর্শিতার অভীক্ষা B) Interest inventories and aptitude tests / আগ্রহের তালিকা এবং প্রবণতা অভীক্ষা C) Summative evaluations / সার্বিক মূল্যায়ন D) Diagnostic tests / নির্ণায়ক অভীক্ষা

Correct Answer: B) Interest inventories and aptitude tests / আগ্রহের তালিকা এবং প্রবণতা অভীক্ষা

Explanation: Career counseling aims to match an individual’s profile with suitable career paths. This is best achieved by assessing what they like to do (interest) and what they have the potential to do well (aptitude).
ব্যাখ্যা: কেরিয়ার কাউন্সেলিংয়ের লক্ষ্য হল একজন ব্যক্তির প্রোফাইলকে উপযুক্ত কেরিয়ার পথের সাথে মেলানো। এটি সবচেয়ে ভালোভাবে অর্জন করা যায় তারা কী করতে পছন্দ করে (আগ্রহ) এবং তারা কী ভালো করতে পারে (প্রবণতা) তা মূল্যায়ন করে।

93. The “table of specifications” is another name for:
“নির্দিষ্টকরণের সারণী” কিসের অন্য নাম?

A) The test paper itself / স্বয়ং পরীক্ষা পত্র B) The scoring key / স্কোরিং কী C) The test blueprint / পরীক্ষার ব্লুপ্রিন্ট D) The list of students taking the test / পরীক্ষা দেওয়া শিক্ষার্থীদের তালিকা

Correct Answer: C) The test blueprint / পরীক্ষার ব্লুপ্রিন্ট

Explanation: A blueprint, or table of specifications, is a chart that guides test construction. It specifies the number of items for each content area and each instructional objective, ensuring a balanced and valid test.
ব্যাখ্যা: একটি ব্লুপ্রিন্ট, বা নির্দিষ্টকরণের সারণী, একটি চার্ট যা পরীক্ষা নির্মাণে পথ দেখায়। এটি প্রতিটি বিষয়বস্তু ক্ষেত্র এবং প্রতিটি নির্দেশনামূলক উদ্দেশ্যের জন্য প্রশ্নের সংখ্যা নির্দিষ্ট করে, যা একটি ভারসাম্যপূর্ণ এবং বৈধ পরীক্ষা নিশ্চিত করে।

94. A test which is administered and scored in a consistent, or “standard,” manner is called:
একটি পরীক্ষা যা একটি সামঞ্জস্যপূর্ণ, বা “মান,” পদ্ধতিতে পরিচালিত এবং স্কোর করা হয় তাকে বলা হয়:

A) A subjective test / একটি ব্যক্তিনিষ্ঠ পরীক্ষা B) A standardized test / একটি মানসম্মত পরীক্ষা C) A teacher-made test / একটি শিক্ষক-নির্মিত পরীক্ষা D) An informal test / একটি অনানুষ্ঠানিক পরীক্ষা

Correct Answer: B) A standardized test / একটি মানসম্মত পরীক্ষা

Explanation: The key feature of a standardized test is its uniform procedure for administration (e.g., same instructions, same time limits) and scoring. This ensures that results are comparable across different individuals and groups.
ব্যাখ্যা: একটি মানসম্মত পরীক্ষার প্রধান বৈশিষ্ট্য হল এর পরিচালনা (যেমন, একই নির্দেশাবলী, একই সময়সীমা) এবং স্কোরিংয়ের জন্য অভিন্ন পদ্ধতি। এটি নিশ্চিত করে যে ফলাফলগুলি বিভিন্ন ব্যক্তি এবং গোষ্ঠীর মধ্যে তুলনামূলক।

95. The most comprehensive and time-consuming method of evaluation is:
মূল্যায়নের সবচেয়ে ব্যাপক এবং সময়সাপেক্ষ পদ্ধতি হল:

A) Checklist / চেকলিস্ট B) Multiple Choice Test / বহুনির্বাচনী পরীক্ষা C) Case Study / কেস স্টাডি D) Rating Scale / রেটিং স্কেল

Correct Answer: C) Case Study / কেস স্টাডি

Explanation: A case study is an in-depth, intensive investigation of a single individual, group, or situation. It uses multiple sources of data (interviews, observations, records) to build a detailed, holistic picture, making it very comprehensive but also very time-consuming.
ব্যাখ্যা: একটি কেস স্টাডি হল একজন একক ব্যক্তি, গোষ্ঠী বা পরিস্থিতির একটি গভীর, নিবিড় তদন্ত। এটি একটি বিস্তারিত, সামগ্রিক চিত্র তৈরি করতে একাধিক ডেটা উৎস (সাক্ষাৎকার, পর্যবেক্ষণ, রেকর্ড) ব্যবহার করে, যা এটিকে খুব ব্যাপক কিন্তু খুব সময়সাপেক্ষ করে তোলে।

96. What does a percentile rank of 70 (P70) mean?
একটি ৭০ শতাংশ পার্সেন্টাইল র‍্যাঙ্ক (P70) এর মানে কী?

A) The student answered 70% of the questions correctly. / শিক্ষার্থী ৭০% প্রশ্নের সঠিক উত্তর দিয়েছে। B) The student scored better than 70% of the people in the norm group. / শিক্ষার্থী নর্ম গ্রুপের ৭০% লোকের চেয়ে ভালো স্কোর করেছে। C) 70 students scored higher than this student. / ৭০ জন শিক্ষার্থী এই শিক্ষার্থীর চেয়ে বেশি স্কোর করেছে। D) The test has a difficulty level of 0.70. / পরীক্ষার কাঠিন্যের স্তর ০.৭০।

Correct Answer: B) The student scored better than 70% of the people in the norm group. / শিক্ষার্থী নর্ম গ্রুপের ৭০% লোকের চেয়ে ভালো স্কোর করেছে।

Explanation: A percentile rank is a norm-referenced score that indicates the percentage of people in the reference group who scored at or below a particular score. So, P70 means the individual’s score is equal to or higher than the scores of 70% of the norm group.
ব্যাখ্যা: একটি পার্সেন্টাইল র‍্যাঙ্ক একটি নর্ম-ভিত্তিক স্কোর যা রেফারেন্স গ্রুপের সেই শতাংশ নির্দেশ করে যারা একটি নির্দিষ্ট স্কোরের সমান বা তার নিচে স্কোর করেছে। সুতরাং, P70 মানে ব্যক্তির স্কোর নর্ম গ্রুপের ৭০% এর স্কোরের সমান বা তার চেয়ে বেশি।

97. “Bluffing” or writing irrelevant material is a common problem in:
“ব্লাফিং” বা অপ্রাসঙ্গিক বিষয় লেখা একটি সাধারণ সমস্যা কিসে?

A) Multiple Choice Tests / বহুনির্বাচনী পরীক্ষায় B) True-False Tests / সত্য-মিথ্যা পরীক্ষায় C) Essay Type Tests / প্রবন্ধমূলক পরীক্ষায় D) Matching Tests / মেলানো পরীক্ষায়

Correct Answer: C) Essay Type Tests / প্রবন্ধমূলক পরীক্ষায়

Explanation: In essay tests, students who do not know the exact answer may try to “bluff” by writing long, vague, or irrelevant answers in the hope of getting some partial credit. This is not possible in objective test formats.
ব্যাখ্যা: প্রবন্ধমূলক পরীক্ষায়, যে শিক্ষার্থীরা সঠিক উত্তর জানে না তারা কিছু আংশিক ক্রেডিট পাওয়ার আশায় দীর্ঘ, অস্পষ্ট বা অপ্রাসঙ্গিক উত্তর লিখে “ব্লাফ” করার চেষ্টা করতে পারে। এটি বস্তুনিষ্ঠ পরীক্ষার বিন্যাসে সম্ভব নয়।

98. A test for measuring interest is a type of:
আগ্রহ পরিমাপের জন্য একটি পরীক্ষা হল এক ধরণের:

A) Cognitive test / জ্ঞানীয় পরীক্ষা B) Affective test / অনুভূতিমূলক পরীক্ষা C) Psychomotor test / মনশ্চালকমূলক পরীক্ষা D) Performance test / পারফরম্যান্স পরীক্ষা

Correct Answer: B) Affective test / অনুভূতিমূলক পরীক্ষা

Explanation: The affective domain deals with emotions, feelings, attitudes, values, and interests. Therefore, a test designed to measure a person’s interests falls under the category of affective assessment.
ব্যাখ্যা: অনুভূতিমূলক ক্ষেত্র আবেগ, অনুভূতি, মনোভাব, মূল্যবোধ এবং আগ্রহ নিয়ে কাজ করে। অতএব, একজন ব্যক্তির আগ্রহ পরিমাপের জন্য ডিজাইন করা একটি পরীক্ষা অনুভূতিমূলক মূল্যায়নের বিভাগের অধীনে পড়ে।

99. If a test measures a single, specific skill (e.g., addition of two-digit numbers), it is likely a:
যদি একটি পরীক্ষা একটি একক, নির্দিষ্ট দক্ষতা (যেমন, দুই-অঙ্কের সংখ্যার যোগ) পরিমাপ করে, তবে এটি সম্ভবত একটি:

A) Survey Test / সমীক্ষা অভীক্ষা B) Mastery Test / পারদর্শিতা (মাস্টারি) অভীক্ষা C) Aptitude Test / প্রবণতা অভীক্ষা D) Personality Test / ব্যক্তিত্বের অভীক্ষা

Correct Answer: B) Mastery Test / পারদর্শিতা (মাস্টারি) অভীক্ষা

Explanation: A mastery test is a type of criterion-referenced test that focuses on determining whether a student has mastered a specific, narrowly defined skill or objective. The result is typically a pass/fail or mastery/non-mastery decision.
ব্যাখ্যা: একটি মাস্টারি পরীক্ষা হল এক ধরণের নির্ণায়ক-ভিত্তিক পরীক্ষা যা একজন শিক্ষার্থী একটি নির্দিষ্ট, সংকীর্ণভাবে সংজ্ঞায়িত দক্ষতা বা উদ্দেশ্য আয়ত্ত করেছে কিনা তা নির্ধারণের উপর দৃষ্টি নিবদ্ধ করে। ফলাফল সাধারণত একটি পাস/ফেল বা মাস্টারি/নন-মাস্টারি সিদ্ধান্ত।

100. The ultimate goal of test construction and standardization is to create a tool that is:
পরীক্ষা নির্মাণ এবং মাননির্ণয়ের চূড়ান্ত লক্ষ্য হল এমন একটি উপকরণ তৈরি করা যা হল:

A) Easy for all students to pass / সমস্ত শিক্ষার্থীদের জন্য পাস করা সহজ B) As valid and reliable as possible / যতটা সম্ভব বৈধ এবং নির্ভরযোগ্য C) Very short and quick to administer / খুব ছোট এবং দ্রুত পরিচালনা করা যায় D) Difficult enough to fail most students / বেশিরভাগ শিক্ষার্থীকে ফেল করানোর জন্য যথেষ্ট কঠিন

Correct Answer: B) As valid and reliable as possible / যতটা সম্ভব বৈধ এবং নির্ভরযোগ্য

Explanation: The entire scientific process of developing a test—from writing items to conducting item analysis and establishing norms—is aimed at ensuring the final instrument measures what it’s supposed to measure (validity) and does so consistently (reliability). These are the two most fundamental qualities of a good test.
ব্যাখ্যা: একটি পরীক্ষা বিকাশের সমগ্র বৈজ্ঞানিক প্রক্রিয়া—প্রশ্ন লেখা থেকে শুরু করে প্রশ্ন বিশ্লেষণ এবং নর্ম প্রতিষ্ঠা পর্যন্ত—এর লক্ষ্য হল চূড়ান্ত উপকরণটি যা পরিমাপ করার কথা তা পরিমাপ করে (বৈধতা) এবং তা ধারাবাহিকভাবে করে (নির্ভরযোগ্যতা) তা নিশ্চিত করা। এগুলি একটি ভালো পরীক্ষার দুটি সবচেয়ে মৌলিক গুণ।