...

Welcome to CLEF 2005 - Consiglio Nazionale delle Ricerche

by user

on
Category: Documents
23

views

Report

Comments

Transcript

Welcome to CLEF 2005 - Consiglio Nazionale delle Ricerche
Welcome to CLEF 2007
Carol Peters
ISTI-CNR Pisa, Italy
CLEF Objectives
 Stimulate the development of multilingual IR systems
for European languages
 To create a CLIR/MLIA community
 Construct publicly available test-suites
 Conducting annual evaluation campaigns
 Designing tracks/tasks to meet emerging needs and
to stimulate research in the”right” direction
CLEF 2007 Workshop, Budapest, Hungary
19-21 September 2007
CLEF Coordination
CLEF is coordinated by the Istituto di Scienza e Tecnologie dell'Informazione, Consiglio Nazionale delle Ricerche, Pisa
The following Institutions are contributing to the organisation of the different tracks of the CLEF 2007campaign:















Inst. For Information technology, Hyderabad, India
Centre for the Evaluation of Human Language and
Multimodal Communication Technologies (CELCT),

Inst. of Formal and Applied Linguistics, Charles
Trento, Italy
University, Czech Rep
College of Information Studies and Institute for

LSI-UNED, Madrid, Spain
Advanced Computer Studies, U. Maryland, USA

Linguateca, Sintef, Oslo, Norway
Dept. of Computer Science, U. Indonesia

Linguistic Modelling Lab., Bulgarian Acad Sci
Depts. of Computer Science & Medical Informatics,

Microsoft Research Asia
RWTH Aachen U., Germany

NIST, USA
Dept. of Computer Science and Information Systems,

Biomedial Informatics, Oregon Health and Science
U. Limerick, Ireland
University, USA
Dept. of Computer Science and Information

Research Computing Center of Moscow State U.
Engineering, National U. Taiwan

Research Institute for Linguistics, Hungarian
Dept. of Information Engineering, U. Padua, Italy
Academy of Sciences
Dept. of Information Sci, U. Hildesheim, Germany

School of Computer Science and Mathematics,
Dept. of Information Studies, U. Sheffield, UK
Victoria U., Australia
Evaluations and Language Resources Distribution

School of Computing, DCU, Ireland
Agency Sarl, Paris, France

UC Data Archive and School of Information
Fondazione Bruno Kessler FBK-irst, Trento, Italy
Management and Systems, UC Berkeley, USA
German Research Centre for Artificial Intelligence,

University "Alexandru Ioan Cuza", IASI, Romania
DFKI, Saarbrücken, Germany

U. Hospitals and U.of Geneva, Switzerland
Information and Language Processing Systems, U.

Vienna University of Technology, Austria
Amsterdam, Netherlands
IZ Bonn, Germany
CLEF 2007 Workshop, Budapest, Hungary
19-21 September 2007
CLEF
Steering Committee




















Maristella Agosti, U.Padove, Italy
Martin Braschler, Zurich, Switzerland
Amedeo Cappelli, ISTI-CNR & CELCT, Italy
Hsin-Hsi Chen, National Taiwan U., Taipei, Taiwan
Khalid Choukri, ELRA/ELDA, Paris, France
Paul Clough, University of Sheffield, UK
Thomas Deselaers, RWTH Aachen University, Germany
Giorgio Di Nunzio, U. Padova, Italy
David A. Evans, Clairvoyance Corporation, USA
Nicola Ferro, U. Padova, Italy
Christian Fluhr, CEA-LIST, Fontenay-aux-Roses, France
Norbert Fuhr, University of Duisburg, Germany
Frederic C. Gey, U.C. Berkeley, USA
Julio Gonzalo, LSI-UNED, Madrid, Spain
Donna Harman, NIST, USA
Gareth Jones, Dublin City University, Ireland
Franciska de Jong, University of Twente, Netherlands
Noriko Kando, NII, Tokyo, Japan
Jussi Karlgren, SICS, Sweden
Michael Kluck, German Institute for International and
Security Affairs, Berlin, Germany


















Natalia Loukachevitch, Moscow State University, Russia
Bernardo Magnini, ITC-irst, Trento, Italy
Thomas Mandl, U. Hildesheim, Germany
Paul McNamee, Johns Hopkins University, USA
Henning Müller, University & University Hospitals of
Geneva, Switzerland
Douglas W. Oard, University of Maryland, USA
Anselmo Peňas, LSI-UNED, Madrid, Spain
Maarten de Rijke, University of Amsterdam, Netherlands
Diana Santos, Linguateca, Sintef, Oslo, Norway
Jacques Savoy, University of Neuchatel, Switzerland
Peter Schäuble, Eurospider Information Technologies,
Switzerland
Richard Sutcliffe, University of Limerick, Ireland
Max Stempfhuber, Informationszentrum
Sozialwissenschaften Bonn, Germany
Hans Uszkoreit, German Research Center for Artificial
Intelligence (DFKI), Germany
Felisa Verdejo, LSI-UNED, Madrid, Spain
José Luis Vicedo, University of Alicante, Spain
Ellen Voorhees, NIST, USA
Christa Womser-Hacker, University of Hildesheim, Germany
CLEF 2007 Workshop, Budapest, Hungary
19-21 September 2007
CLEF 2007:
Track Coordinators
 Ad Hoc: Giorgio Di Nunzio, Nicola Ferro and Thomas Mandl
 Domain-Specific: Vivien Petras, Stefan Baerisch, Maximillian
Stempfhuber
 QA@CLEF: Danilo Giampiccolo, Bernardo Magnini, Anselmo
Peñas, Christelle Ayache, Petya Osenova,, Maarten de Rijke,
Bogdan Sacaleanu, Diana Santos and Richard Sutcliffe
 ImageCLEF: Allan Hanbury, Paul Clough, Henning Müller,
Thomas Deselaers , Michael Grubinger, Jayashree Kalpathy–
Cramer, and William Hersh
 CL-SR: Douglas W. Oard, Gareth J. F. Jones, and Pavel Pecina
 Web-CLEF: Valentin Jijkoun and Maarten de Rijke
 GeoCLEF: Thomas Mandl, Fredric Gey, Giorgio Di Nunzio, Nicola
Ferro, Ray Larson, Mark Sanderson, Diana Santos, Christa
Womser-Hacker, Xing Xie
CLEF 2007 Workshop, Budapest, Hungary
19-21 September 2007
CLEF 2007:
Participating Groups





















Brown U., USA
California State U. SanMarcos, USA**
Charles U., Prague, Czech Rep.
Daedalus & Madrid Univs, Spain ****
Ching Yun Univ., Taiwan
DFKI-Artificial Intelligence, DE****
Dokuz Eylul U.,Turkey*
Dublin City U. - Comp.Sci., Ireland ***
Fondazione Bruno Kessler********
Helsinki U. of Technology
Hungarian Acad. Sci.
IDIAP Research Inst., CH
Imperial College, London, UK**
Ist.Nac.Astrofisica, Optica, Electronica,
Mexico**
Indian Statistical Inst., India*
Indian Inst. Technology (IIT-Bombay)
Indian Inst. Technology (IITKharagpur)
Inst.Infocomm Research, Singapore **
Inst. Superior Técníco (DEI-IST)
IPAL-CNRS (IR2), Singapore ****
IRIT / SIG - Toulouse *****
Jadavpur University, Kolkata, India
Johns Hopkins U., USA *******
Language Computer Corp., USA*

LIMSI-CNRS, France ****



























Linguateca-Sintef, Norway ***
Linguit Ltd, UK
Microsoft Asia*
Microsoft India
MRIM Group – LIG, Grenoble*
Nat. Inst.Informatics, Japan ***
Nat.Taiwan U. - Comp-Sci, *****
Open Text Corp.(ex Hummingbird)
Oregon Health & Sci. U., USA **
Priberam Informatica, Portugal *
Research Inst. for AI of Romaian
Academy*
RWTH Aaachen-CS., Germany ***
RWTH Aachen - Med.Inf., DE***
SUNY Buffalo – Informat, USA ****
SYNAPSE Développement, France**
Tech U. Chemnitz, Germany*
Tokyo Inst. Technology, Japan*
U.Alicante, Spain (2) ******
U.AI.I Cuza Iasi, Romania*
U.Amsterdam - Informatics, N ******
U. Basil, Seitzerland
U. Chicago, USA **
U.Concordia - CINDI, Canada**
U.Concordia - CLAK, Canada
U.Coruna & U.Sunderland, ES/UK*





Univ. Evora, Portugal **
U.Freiburg – Pattern Recog., Germany U. & Hospitals
Geneva, CH ***
U.Groningen - Inf.Sci, The Netherlands** (2)
U.Hagen – IICS, Germany ****
U.Hildesheim - Inf.Sci, Germany *** *





















U.Indonesia - Comp.Sci, Indonesia **
U.Jaen - Intell.Systems, Spain ******
U.Liege - Elect.Eng.&CS, Belgium**
U.Lisbon – Informatics, Portugal ***
Univ. Macquarie, Australia
Univ. Nacional Colombia
U.Neuchatel – Informatique, Switzerland ******
Univ. Nottingham, UK
U.Ottawa - IT & Eng, Canada*
U.Politecnica Catalunya – TALP, Spain**
U.Politecnica Valencia - Comp.Sci, Spain**
U. Porto, Portugal*
U.Salamanca – REINA, Spain *****
U.Stockholm, NLP, Sweden ***
U.Tampere, Fiinland ****
U.Wolverhampton, UK *
UC Berkeley - IM&S-1, USA *******
UNED-LSI, Spain ******
Univ. West Bohemia, Czech rRp.*
Vienna Univ. Technology, Austria
Xerox XRCE, France *
CLEF 2007 Workshop, Budapest, Hungary
19-21 September 2007
CLEF: Trend in
Participation
CLEF 2000-2007 Participation
100
90
80
70
60
50
40
30
20
10
0
2000
Oceania
South America
North America
Asia
Europe
2001
2002
2003
2004
2005
2006
2007
Europe = 51(59.5); N. America = 14(4.5); Asia = 14(10), S. America = 1(4), Oceania = 1(2)
CLEF 2007 Workshop, Budapest, Hungary
19-21 September 2007
CLEF 2007 Tracks
 multilingual textual document retrieval on news
collections (Ad Hoc)
 mono- and cross-language information on structured
scientific data (Domain-Specific)
 multiple language question answering (QA@CLEF)
 cross-language retrieval in image collections
(ImageCLEF)
 cross-language spoken document retrieval (CL-SR)
 multilingual retrieval of Web documents (WebCLEF)
 cross-language geographical retrieval (GeoCLEF)
Plus: CLEF@SemEval and CLEF@MorphoChallenge
CLEF 2007 Workshop, Budapest, Hungary
19-21 September 2007
No. of Participants
per Track




Ad Hoc: 22(25)
Domain-Spec: 5(4)
iCLEF: 0(3)
QA@CLEF: 28(37)




ImageCLEF: 35(25)
CL-SR: 8(6)
WebCLEF: 4(8)
GeoCLEF: 13(17)
CLEF 2007 Workshop, Budapest, Hungary
19-21 September 2007
CLEF 2000 – 2007
Tracks
CLEF 2000-2007 Tracks
AdHoc
40
Participating Groups
35
DomSpec
30
iCLEF
25
CL-SR
20
15
QA@CLEF
10
ImageCLEF
5
WebClef
0
2000
2001
2002
2003
2004
2005
2006
Years
CLEF 2007 Workshop, Budapest, Hungary
19-21 September 2007
2007
GeoClef
CLEF 2007:
Test Collections
2000
 News documents in 4 languages
 GIRT German Social science database
2007
 CLEF multilingual comparable corpus of more than 3M news docs in 13
languages: CZ,DE,EN,ES,FI,FR,IT,NL,RU,SV,PT,BG and HU
 GIRT-4 social science database in EN and DE, Russian ISISS collection;
Cambridge Sociological Abstracts
 Malach collection of conversational speech derived from the Shoah
archives EN & CZ
 EuroGOV, a multilingual collection of approx 3M webpages crawled from
European governmental sites
 IAPR TC-12 photo database; PASCAL VOC 2006 training data
 ImageCLEFmed radiological database consisting of 6 distinct datasets;
 IRMA collection in EN and DE for automatic medical image
annotation:10,000 images
CLEF 2007 Workshop, Budapest, Hungary
19-21 September 2007
CLEF 2007: Highlights
 Slight fall in participation 81 groups in 2007 (90 in 2006); workshop
>115 Participants (130 in 2006)
 Expansion of test-suites
 Ad Hoc – mixed results – but good success of the non-European topic
languages task
 Domain-specific holds its own!
 Enormous success of ImageCLEF
 Confirmation of interest in QA@CLEF, GeoCLEF and CL-SR
 iCLEF -<didn’t happen
 WebCLEF – what happened???
 CLEF 2006 Proceedings ???
CLEF 2007 Workshop, Budapest, Hungary
19-21 September 2007
CLEF 2007: Highlights
 Slight fall in participation 81 groups in 2007 (90 in 2006); workshop
>115 Participants (130 in 2006)
 Expansion of test-suites
 Ad Hoc – mixed results – but good success of the non-European topic
languages task
 Domain-specific holds its own!
 Enormous success of ImageCLEF
 Confirmation of interest in QA@CLEF, GeoCLEF and CL-SR
 iCLEF -<didn’t happen
 WebCLEF – what happened???
 CLEF 2006 Proceedings – DID HAPPEN – A Miracle?
CLEF 2007 Workshop, Budapest, Hungary
19-21 September 2007
CLEF 2006 Proceedings
Evaluation of Multilingual and Multi-modal
Information Retrieval
7th Workshop of the Cross-Language Evaluation
Forum, CLEF 2006, Alicante, Spain, September,
2006, Revised Selected Papers
Lecture Notes in Computer Science, Vol. 4730
Peters, C.; Clough, P., Gey, F.C.; Karlgren, J.;
Magnini, B.; Oard, D.W.; de Rijke, M.: Stempfhuber
(Eds.) 2006
CLEF 2007 Workshop, Budapest, Hungary
19-21 September 2007
2006: Points for Discussion
 What new tasks/evaluation methodologies are
needed to address more advanced information
requirements?
 How can we best reduce the gap between research
and application communities?
 Who are the users?
Does CLEF have a future?
The challenge represented by i2010
CLEF 2007 Workshop, Budapest, Hungary
19-21 September 2007
Treble-CLEF
The CLEF research results have led to development of a
new generation of multilingual retrieval system prototypes
BUT lack of technology transfer
Treble-CLEF will extend the CLEF activity by:




continuing to promote MLIA R&D via evaluation campaigns;
providing a consistent training activity: tutorials, workshops,
summer school;
producing best practice guidelines for system implementation;
providing resources to encourage the multilingual system
development.
Treble-CLEF will begin activity with a brainstorming
workshop in January 2008
CLEF 2007 Workshop, Budapest, Hungary
19-21 September 2007
Fly UP