• No results found

Topic identification challenge

N/A
N/A
Protected

Academic year: 2021

Share "Topic identification challenge"

Copied!
2
0
0

Bezig met laden.... (Bekijk nu de volledige tekst)

Hele tekst

(1)

Topic identification challenge

Kevin Boyack1Wolfgang Gla¨nzel2Jochen Gla¨ser3 Frank Havemann4Andrea Scharnhorst5Bart Thijs2 Nees Jan van Eck6Theresa Velden3,7Ludo Waltmann6

Received: 6 June 2016 / Published online: 15 March 2017 Ó Akade´miai Kiado´, Budapest, Hungary 2017

Over the last two years, a group of researchers used a shared dataset in order to compare their approaches to the identification of thematic structures in a set of 111,616 papers on astronomy and astrophysics published in 59 journals between 2003 and 2010. The out- comes of this comparative exercise are published in a special issue of Scientometrics

& Jochen Gla¨ser

Jochen.Glaser@ztg.tu-berlin.de Kevin Boyack

kboyack@mapofscience.com Wolfgang Gla¨nzel

wolfgang.glanzel@kuleuven.be Frank Havemann

frank.havemann@ib.hu-berlin.de Andrea Scharnhorst

andrea.scharnhorst@dans.knaw.nl Nees Jan van Eck

ecknjpvan@cwts.leidenuniv.nl Theresa Velden

velden@ztg.tu-berlin.de Ludo Waltmann

waltmanlr@cwts.leidenuniv.nl

1 SciTech Strategies, Inc., Albuquerque, NM 87122, USA

2 ECOOM and Department of MSI, KU Leuven, Louvain, Belgium

3 ZTG, TU Berlin, HBS1, Hardenbergstr. 16-18, 10623 Berlin, Germany

4 Institut fu¨r Bibliotheks-und Informationswissenschaft, Humboldt-Universita¨t zu Berlin, Dorotheenstr. 26, 10099 Berlin, Germany

5 DANS-KNAW, Anna van Saksenlaan 51, The Hague, The Netherlands

6 Centre for Science and Technology Studies, Leiden University, Leiden, The Netherlands

7 University of Michigan, School of Information, Ann Arbor, MI 48109, USA

123

Scientometrics (2017) 111:1223–1224 DOI 10.1007/s11192-017-2307-0

(2)

(Gla¨ser et al. 2017). Now that Clarivate Analytics kindly agreed to make this dataset available to interested researchers in the bibliometrics community, we suggest to extend this comparative approach.

We challenge you to participate in the comparative topic identification exercise.

The challenge is not to develop the best partitioning of the dataset. We believe this to be impossible because there is not one single best solution. Instead, we challenge you to gain as much information as possible about your own approach and the reasons why it produced a particular solution, and how it compares to solutions produced by other approaches. We challenge you to comparatively discuss advantages and disadvantages of approaches to topic identification and thus to contribute to a cumulative body of knowledge on the suitability of data models and algorithms for the identification of topics.

Thanks to Clarivate Analytics, we are able to offer access to the dataset with an efficient license agreement.

While participating in the ‘‘Web of Science comparative topic identification exer- cise,’’ you will be provided with access to the Clarivate Analytics ‘‘Web of Science comparative topic identification exercise’’ dataset. You may access and use this dataset from March 1, 2017 through December 31, 2018 only for the exercise above, subject to the ‘‘Clarivate Analytics Terms’’, including the ‘‘Web of Science: Custom Data Set Product Terms’’ in the ‘‘Product/Service Terms’’, available on our Terms of Business site:http://clarivate.com/tob/. By accessing and/or using our data, you are legally bound by and hereby consent to these terms. If you do not agree to these terms, then you may not access or use our data. Any extension or further use of our data beyond December 31, 2018 is strictly prohibited unless you receive prior written permission from Clarivate Analytics.

The dataset can be obtained by sending an email to jason.rollins@thomsonreuters.com.

We will also offer access to a website where solutions can be deposited and downloaded for comparisons. The website, which also offers some tools for a comparative analysis of solutions and individual clusters, iswww.topic-challenge.info.

If there are enough participants, we will run sessions on the comparative exercise at the next ISSI conferences and dedicated workshops.

We hope that many of you will take up the challenge and thus contribute to cumulative progress in bibliometrics.

Reference

Gla¨ser, J., Gla¨nzel, W., & Scharnhorst, A. (2017). Same data—Different results? Towards a comparative approach to the identification of thematic structures in science. In J. Gla¨ser, A. Scharnhorst & W.

Gla¨nzel (Eds.), Same data—Different results? Towards a comparative approach to the identification of thematic structures in science. Special Issue of Scientometrics. doi:10.1007/s11192-017-2296-z.

1224 Scientometrics (2017) 111:1223–1224

123

Referenties

GERELATEERDE DOCUMENTEN

Note also that (since v0.991) this is a 1-parameter macro so doesn’t expand subsequent tokens until meets a ⟨balanced text⟩ but just takes first single token or ⟨text⟩..

From September 1992 to July 1993, the scheme brought together scholars with an intimate knowledge of the history of different countries (the Netherlands, Belgium, France,

1 Summary notification form relating to a draft decision of the commission of the Inde pendent Post and Telecommunications Authority (OPTA) in the Netherlands with respect to

• You may use results proved in the lecture or in the exercises, unless this makes the question trivial.. When doing so, clearly state the results that

• You may use results proved in the lecture or in the exercises, unless this makes the question trivial.. When doing so, clearly state the results that

• Notation: For any set S and any field k, we write khSi for the k-vector space of formal finite k-linear combinations of elements of S.. (18

Origins, Journeys And Returns: Social Justice In International Higher Education, 89-114. Retrieved

Uit de beschrijving van Simac en het Business Development proces komt naar voren dat een afdeling, of processen die samenhangen met, Competitive Intelligence een belangrijke