Bem-vindo(a) ao Scribd!

Pular no carrossel

Dryrun

Enviado por

zollic

0% acharam este documento útil (0 voto)

57 visualizações3 páginas

dryrun

Título original

dryrun

Direitos autorais

Formatos disponíveis

PDF, TXT ou leia online no Scribd

Compartilhar este documento

Compartilhar ou incorporar documento

Opções de compartilhamento

Você considera este documento útil?

Este conteúdo é inapropriado?

Denunciar este documento

dryrun

Direitos autorais:

Formatos disponíveis

Baixe no formato PDF, TXT ou leia online no Scribd

Sinalizar o conteúdo como inadequado

0% acharam este documento útil (0 voto)

57 visualizações3 páginas

Dryrun

Enviado por

zollic

dryrun

Direitos autorais:

Formatos disponíveis

Baixe no formato PDF, TXT ou leia online no Scribd

Sinalizar o conteúdo como inadequado

Pular para a página

Você está na página 1de 3

Pesquisar no documento

IR&TM Probeklausur

Zugelassene Hilfsmittel: Taschenrechner; sonst keine.

Bearbeitungszeit: 60 Minuten
It will be impossible to complete all assignments. Dont worry, this is
not required for getting the top grade. The final exam will have a few more
exercises, but this does not influence the number of points needed to pass.
You may answer the questions in English or German.

Questions

Each question is 2 points.

1.0.1

Question

Define the number of types/tokens in a sentence.

1.0.2

Question

What is lemmatization? Give an example.

1.0.3

Question

Define Levenshtein edit distance.

1.0.4

Question

What is the feast or famine problem?

1.0.5

Question

Write down the formula for cosine similarity between query q and document
d.
1.0.6

Question

What are the components of an information retrieval benchmark?

1.0.7

Question

Define the kappa measure.

1.0.8

Question

What does marginal relevance measure?

1.0.9

Question

What is the main independence assumption of Naive Bayes?

1.0.10

Question

Why is feature selection used? Give the two main reasons?

1.0.11

Question

For linearly separable problem, how many different linear decision boundaries are there that separate the two classes of the training set perfectly?
1.0.12

Question

How does an SVM classify a test set point in the margin?

1.0.13

Question

Does K-means find the global optimum? Why (not)?

1.0.14

Question

What are the advantages of a search engine ad compared to other types of

ads (radio, television, newspaper)?
1.0.15

Question

What does politeness mean for a crawler?

Problems

2.0.16

Problem 10 points

(i) Compute the Levenshtein matrix and the distance l for the distance
between the strings apfel (input) and poems (output). Assume that
all operations have the same cost. (ii) How many different sequences of l
Levenshtein operations are there? List them.
2.0.17

Problem 12 points

Compute the similarity between the query smart phones and the document smart phones and video phones at smart prices by filling out the
empty columns in the table below. Use the similarity function ddd.qqq =
lnc.ltn (as given in the table). Assume N = 10,000,000. Treat and and at
as stop words. What is the final similarity score? When computing length
2

normalized weights you can round the length of a vector to the nearest
integer.
query
document
word
tf-raw tf-wght df
idf weight tf-raw tf-wght nlized
smart
5,000
video
50,000
phones
25,000
prices
30,000
Here are some log values you may need:
x
1 2
3
4
5
6
7
8
9
log10 x 0 0.3 0.5 0.6 0.7 0.8 0.8 0.9 1.0
2.0.18

Problem 15 points

Figure 1: A web graph

Compute PageRank for the web graph in Figure 1 for each of the three
pages.
Assume that at each step of the PageRank random walk, we teleport to
a random page with probability 0.6, with a uniform distribution over which
particular page we teleport to.
Using symmetries to simplify and solving with linear equations might be
easier than using iterative methods.
2.0.19

Problem 10 points

Rank the documents in collection {d1 , d2 } for query q using the language
model approach to IR introduced in class. Use the mixture coefficient =
0.4. Ignore punctuation marks.
d1 : In Financial Crisis, No Prosecution of Top Figures
d2 : Wall Street and the Financial Crisis: Anatomy of a Financial
Collapse
Query q: Financial Crisis

product

Você também pode gostar

Advanced C++ Interview Questions You'll Most Likely Be Asked: Job Interview Questions Series
No Everand
Advanced C++ Interview Questions You'll Most Likely Be Asked: Job Interview Questions Series
Vibrant Publishers
Ainda não há avaliações
VBA Excel BOOK
Documento131 páginas
VBA Excel BOOK
Sokol
100% (8)
Elements of Art
Documento1 página
Elements of Art
samson8cindy8lou
Ainda não há avaliações
Programming Concepts in C++
No Everand
Programming Concepts in C++
Robert Burns
Ainda não há avaliações
Practical Design of Experiments: DoE Made Easy
No Everand
Practical Design of Experiments: DoE Made Easy
Colin Hardwick
Nota: 4.5 de 5 estrelas
4.5/5 (7)
Midterm2006 Sol Csi4107
Documento9 páginas
Midterm2006 Sol Csi4107
martin
100% (2)
Numerical Methods
Documento32 páginas
Numerical Methods
mojiryhamid
Ainda não há avaliações
Week 7 Sex Limited Influenced
Documento19 páginas
Week 7 Sex Limited Influenced
Lorelyn Villamor
Ainda não há avaliações
Grade 7 Exam
Documento3 páginas
Grade 7 Exam
Mikko Gomez
Ainda não há avaliações
Service Manual: SV01-NHX40AX03-01E NHX4000 MSX-853 Axis Adjustment Procedure of Z-Axis Zero Return Position
Documento5 páginas
Service Manual: SV01-NHX40AX03-01E NHX4000 MSX-853 Axis Adjustment Procedure of Z-Axis Zero Return Position
mahdi elmay
100% (3)
WebLMT Help
Documento12 páginas
WebLMT Help
João Lopes
Ainda não há avaliações
Sylvia Langfield, Dave Duddell - Cambridge International As and A Level Computer Science Coursebook (2019, Cambridge University Press) - Libgen - Li
Documento554 páginas
Sylvia Langfield, Dave Duddell - Cambridge International As and A Level Computer Science Coursebook (2019, Cambridge University Press) - Libgen - Li
Sahil Kaundun
Ainda não há avaliações
Hima OPC Server Manual
Documento36 páginas
Hima OPC Server Manual
Ashkan Khajouie
100% (3)
of Thesis Project
Documento2 páginas
of Thesis Project
moon
Ainda não há avaliações
Métodos numéricos aplicados a Ingeniería: Casos de estudio usando MATLAB
No Everand
Métodos numéricos aplicados a Ingeniería: Casos de estudio usando MATLAB
Héctor Jorquera González
Nota: 5 de 5 estrelas
5/5 (1)
Hw1 Sum2013
Documento8 páginas
Hw1 Sum2013
Dylan Erickson
Ainda não há avaliações
Spa Sem 2 PDF
Documento754 páginas
Spa Sem 2 PDF
Aayushi Patel
Ainda não há avaliações
Module 2: Problem Solving Techniques Unit 1
Documento7 páginas
Module 2: Problem Solving Techniques Unit 1
Alipriya Chatterjee
Ainda não há avaliações
Cs201 Midterm Subject
Documento14 páginas
Cs201 Midterm Subject
Azhar Dogar
Ainda não há avaliações
Unit 2
Documento8 páginas
Unit 2
guddinan makure
Ainda não há avaliações
SPA Ebook PDF
Documento375 páginas
SPA Ebook PDF
Abhishek
Ainda não há avaliações
Lec3a Asymptotic Growth Rate
Documento21 páginas
Lec3a Asymptotic Growth Rate
Online Worker
Ainda não há avaliações
Modelling With Linear Prog
Documento22 páginas
Modelling With Linear Prog
vijaydh
Ainda não há avaliações
Cs201 Final Notce by Mohsin Raza (1) - 1
Documento30 páginas
Cs201 Final Notce by Mohsin Raza (1) - 1
Umme Rubab saleem
Ainda não há avaliações
Lab 2
Documento5 páginas
Lab 2
Nathan Wang
Ainda não há avaliações
cs703 Mid
Documento11 páginas
cs703 Mid
Mian Ejaz
Ainda não há avaliações
Homework #1: MA 402 Mathematics of Scientific Computing Due: Wednesday, January 19
Documento3 páginas
Homework #1: MA 402 Mathematics of Scientific Computing Due: Wednesday, January 19
Andrea Stancescu
Ainda não há avaliações
Lab 1e Fixed-Point Output Fall 2010 1e.1
Documento8 páginas
Lab 1e Fixed-Point Output Fall 2010 1e.1
iky77
Ainda não há avaliações
Lab03 (R)
Documento12 páginas
Lab03 (R)
Javaid Musa Bojol
Ainda não há avaliações
Computer Architectur
Documento6 páginas
Computer Architectur
echandana
Ainda não há avaliações
Introduction To Algorithms Second Edition Solutions Manual: Christopher Clark University of Virginia Summer 2010
Documento9 páginas
Introduction To Algorithms Second Edition Solutions Manual: Christopher Clark University of Virginia Summer 2010
Aliga8er
0% (1)
02 Greedy Algorithms Problems
Documento18 páginas
02 Greedy Algorithms Problems
amarjeet
Ainda não há avaliações
Lab02 (R)
Documento10 páginas
Lab02 (R)
Javaid Musa Bojol
Ainda não há avaliações
Jbiet Dept of IT: J.B.Institute of Engg & Technology
Documento18 páginas
Jbiet Dept of IT: J.B.Institute of Engg & Technology
Sreekanth Nara
Ainda não há avaliações
Assignment One: Part One: Warm-up/Practice (Non-Graded) Questions
Documento4 páginas
Assignment One: Part One: Warm-up/Practice (Non-Graded) Questions
Andre Knight
Ainda não há avaliações
Practice Final Soln
Documento17 páginas
Practice Final Soln
Jimmie J Mshumbusi
Ainda não há avaliações
NPC - Print It
Documento22 páginas
NPC - Print It
bayentapas
Ainda não há avaliações
Ee263 Homework Problems
Documento6 páginas
Ee263 Homework Problems
uzmlivznd
100% (1)
Lab - 09 - Nested Loops Functions - BESE-10
Documento6 páginas
Lab - 09 - Nested Loops Functions - BESE-10
Waseem Abbas
Ainda não há avaliações
CENG400 Midterm Fall 2015
Documento10 páginas
CENG400 Midterm Fall 2015
Mohamad Issa
Ainda não há avaliações
Lect 0825
Documento5 páginas
Lect 0825
blabla9999999
Ainda não há avaliações
Finals 2013 Solutions
Documento10 páginas
Finals 2013 Solutions
Вера Оконешникова
Ainda não há avaliações
Assignment 2
Documento2 páginas
Assignment 2
Siddique Rana
Ainda não há avaliações
CS101 Introduction To Computing: Final Term Examination - February 2005
Documento5 páginas
CS101 Introduction To Computing: Final Term Examination - February 2005
Muhammad Ayub
Ainda não há avaliações
CS201 Final Term Subjectivebymoaaz PDF
Documento18 páginas
CS201 Final Term Subjectivebymoaaz PDF
Muhammad Zeeshan
Ainda não há avaliações
Term Work of USIT101:communication Skill I) Ii) Iii) Practical (Case Studies) USIT1P1: I) Ii) Iii)
Documento5 páginas
Term Work of USIT101:communication Skill I) Ii) Iii) Practical (Case Studies) USIT1P1: I) Ii) Iii)
KaranPatil
Ainda não há avaliações
Professor Douglas Troeger The City College of New York Spring, 2016
Documento44 páginas
Professor Douglas Troeger The City College of New York Spring, 2016
Daniel Gaston
Ainda não há avaliações
Input/Output Techniques: Example 1
Documento7 páginas
Input/Output Techniques: Example 1
Vardaan Bajaj
Ainda não há avaliações
Input/Output Techniques: Example 1
Documento7 páginas
Input/Output Techniques: Example 1
Vardaan Bajaj
Ainda não há avaliações
Computational Complexity: Section 1: Why Is It Important?
Documento6 páginas
Computational Complexity: Section 1: Why Is It Important?
Prabhuanand Sivashanmugam
Ainda não há avaliações
Cs 201 Solved Midterm Papers 1
Documento26 páginas
Cs 201 Solved Midterm Papers 1
Rana Sameer Zulfiqar
Ainda não há avaliações
Matlab Marina For Loops Exercises
Documento4 páginas
Matlab Marina For Loops Exercises
Eman Arshad
Ainda não há avaliações
Exam #1 For Computer Networks SOLUTIONS
Documento4 páginas
Exam #1 For Computer Networks SOLUTIONS
87bb
Ainda não há avaliações
Final Exam Su2017
Documento9 páginas
Final Exam Su2017
Juan Santos
Ainda não há avaliações
Sheet 03
Documento3 páginas
Sheet 03
eir.gn
Ainda não há avaliações
Report Indutry
Documento29 páginas
Report Indutry
anshikag1402
Ainda não há avaliações
CS201 - Introduction To Programmming Solved Subjective Questions From Spring 2010 Final Term Papers
Documento15 páginas
CS201 - Introduction To Programmming Solved Subjective Questions From Spring 2010 Final Term Papers
Asjid Naeem
Ainda não há avaliações
01 - CO - Introduction To Visual Studio and C# - PracticeAssignments
Documento8 páginas
01 - CO - Introduction To Visual Studio and C# - PracticeAssignments
Henan Nazeem
Ainda não há avaliações
C Technical
Documento118 páginas
C Technical
Sarath Dasari
Ainda não há avaliações
Chapter 11: Limitations of Algorithms: P, NP, and NP-complete Problems
Documento3 páginas
Chapter 11: Limitations of Algorithms: P, NP, and NP-complete Problems
Mahadev Educations
Ainda não há avaliações
Appendix A: Answers To The Test Your Knowledge Questions
Documento62 páginas
Appendix A: Answers To The Test Your Knowledge Questions
Otebele Jemimah
Ainda não há avaliações
Lecture1 6 PDF
Documento30 páginas
Lecture1 6 PDF
Colin
Ainda não há avaliações
cs251 Quiz4 PDF
Documento13 páginas
cs251 Quiz4 PDF
Yash Gupta
Ainda não há avaliações
Project 02
Documento4 páginas
Project 02
Eldin Lina
Ainda não há avaliações
2013 Apr Cert CNT Report
Documento11 páginas
2013 Apr Cert CNT Report
michaelmikeal
Ainda não há avaliações
Sample Data Structures Questions
Documento69 páginas
Sample Data Structures Questions
rethic
Ainda não há avaliações
Good Habits for Great Coding: Improving Programming Skills with Examples in Python
No Everand
Good Habits for Great Coding: Improving Programming Skills with Examples in Python
Michael Stueben
Ainda não há avaliações
Aleksandrov I Dis 1-50.ru - en
Documento50 páginas
Aleksandrov I Dis 1-50.ru - en
Nabeel Adil
Ainda não há avaliações
Obesity - The Health Time Bomb: ©LTPHN 2008
Documento36 páginas
Obesity - The Health Time Bomb: ©LTPHN 2008
EVA PUTRANTO
100% (2)
KP Tevta Advertisement 16-09-2019
Documento4 páginas
KP Tevta Advertisement 16-09-2019
Ishaq Amin
Ainda não há avaliações
CEE Annual Report 2018
Documento100 páginas
CEE Annual Report 2018
BusinessTech
100% (1)
18 June 2020 12:03: New Section 1 Page 1
Documento4 páginas
18 June 2020 12:03: New Section 1 Page 1
KarthikNayaka
Ainda não há avaliações
Wilcoxon Matched Pairs Signed Rank Test
Documento3 páginas
Wilcoxon Matched Pairs Signed Rank Test
Dawn Ilish Nicole Diez
Ainda não há avaliações
The JHipster Mini Book 2
Documento129 páginas
The JHipster Mini Book 2
tyulist
100% (1)
Farmer Producer Companies in Odisha
Documento34 páginas
Farmer Producer Companies in Odisha
Suraj Gantayat
Ainda não há avaliações
8.ZXSDR B8200 (L200) Principle and Hardware Structure Training Manual-45
Documento45 páginas
8.ZXSDR B8200 (L200) Principle and Hardware Structure Training Manual-45
mehdi_mehdi
Ainda não há avaliações
MGMT Audit Report Writing
Documento28 páginas
MGMT Audit Report Writing
Andrei Iulian
Ainda não há avaliações
A Case Study of Coustomer Satisfaction in Demat Account At: A Summer Training Report
Documento110 páginas
A Case Study of Coustomer Satisfaction in Demat Account At: A Summer Training Report
Deepak Singhal
Ainda não há avaliações
Week 7
Documento24 páginas
Week 7
Priyank Patel
Ainda não há avaliações
B.SC BOTANY Semester 5-6 Syllabus June 2013
Documento33 páginas
B.SC BOTANY Semester 5-6 Syllabus June 2013
Barnali Dutta
Ainda não há avaliações
Borang Ambulans Call
Documento2 páginas
Borang Ambulans Call
leo89azman
100% (1)
Career Essay 1
Documento2 páginas
Career Essay 1
api-572592063
Ainda não há avaliações
Talking Art As The Spirit Moves Us
Documento7 páginas
Talking Art As The Spirit Moves Us
UCLA_SPARC
Ainda não há avaliações
Stearns 87700 Series Parts List
Documento4 páginas
Stearns 87700 Series Parts List
Yorkist
Ainda não há avaliações
Residual Power Series Method For Obstacle Boundary Value Problems
Documento5 páginas
Residual Power Series Method For Obstacle Boundary Value Problems
Sayiqa Jabeen
Ainda não há avaliações
The Magic Drum
Documento185 páginas
The Magic Drum
tanishgiri2012
Ainda não há avaliações
Harper Independent Distributor Tri Fold
Documento2 páginas
Harper Independent Distributor Tri Fold
Yipper Shnipper
Ainda não há avaliações
Agnes de Mille
Documento3 páginas
Agnes de Mille
Marie-Maxence De Rouck
Ainda não há avaliações
Thermally Curable Polystyrene Via Click Chemistry
Documento4 páginas
Thermally Curable Polystyrene Via Click Chemistry
Danesh Az
Ainda não há avaliações
13 Adsorption of Congo Red A Basic Dye by ZnFe-CO3
Documento10 páginas
13 Adsorption of Congo Red A Basic Dye by ZnFe-CO3
Jorellie Petalver
Ainda não há avaliações