Each participating group should contact the organisers to get the collection used in the task. The required forms must be filled in as per the dataset instructions. When the dataset has been downloaded, then the participating team
can index the content in whichever form is desired. There is no limitation on the type of data processing, enrichment or indexing process to be employed. Some participants may choose to index the provided metadata into a conventional
inverted index, while others may choose to enhance the provided metadata using automated or semi-automated means, then index the data according to their preference.
For every topic, every image considered relevant should have one line in the CSV file. For some topics there will be only one relevant item (one line in the submission), for others there will be many relevant items (many lines in the
submission), up to 100. It is also possible that no relevant items are found for a topic, so then there should be no entry in the file for the topic.
The format of the CSV file for an automatic run would be as follows:
GROUP-ID, RUN-ID, TOPIC-ID, IMAGE-ID, SECONDS-ELAPSED, SCORE
...
DCU, DCULSAT01, 16001, u1_2016-08-15_112559, 0, 1.0
DCU, DCULSAT01, 16001, u1_2016-08-15_120354, 0, 1.0
...
The format of the CSV file for an interactive run would be as follows:GROUP-ID, RUN-ID, TOPIC-ID, IMAGE-ID, SECONDS-ELAPSED, SCORE
...
DCU, DCULSAT01, 16001, u1_2016-08-15_112559, 33, 1.0
DCU, DCULSAT01, 16001, u1_2016-08-15_120354, 54, 1.0
DCU, DCULSAT01, 16001, u1_2016-08-15_120412, 243, 1.0
...
In total there are
41 topics for this lifelog LSAT task. They are available now for download. There are two types of topics, adhoc and knownitem:
-
ADHOC - topics that may have many moments in the collection that are relevant. These topics are all new.
-
KNOWNITEM - topics with one (or few) relevant moments in the collection.
The format of the topic is as follows:
- ID - A unique identifier of every topic
- Type - identifying each topic as being either adhoc, or knownitem.
- UID - a user identifier. Always u1 (user 1) for this collection.
- Title - a title of the query used for identification
- Description - a descriptive query that would represent the information need of the user
- Narrative - additional details about the information need that helps to define what is correct and incorrect.
Therefore a sample query would be as follows:
<topic>
<id>16000</id>
<type>knownitem</type>
<uid>u1</uid>
<title>Eating fruit</title>
<description>Find the time when I was eating both mandarins
and apples at my work desk.</description>
<narrative>The lifelogger was at work and eating mandarin oranges and apples at his desk. Any moment showing such activities are considered relevant. Eating both
fruits in a location other than the lifelogger's desk is not considered to be relevant. </narrative>
</topic>