Cortext platform
At Cortext, our goal is to empower researchers in the social sciences and humanities by promoting advanced qualitative-quantitative mixed methods. Our primary focus is on studies about the dynamics of science, technology and innovation, and about the roles of knowledge and expertise in societies.
We understand the move towards digital humanities and computational methods not as addressing a technological gap for the social sciences, but rather as entailing entirely new assemblages between its disciplines and those of modern statistics and computer sciences. We work to tackle ever more complex research problems and deal with the profusion of new and diverse sources of information without losing sight of the situatedness and reflexivity required of studies of human societies.
Cortext is hosted by the LISIS research unit at Gustave Eiffel University, and was launched by French institutes IFRIS and INRAE, receiving their continued support.
Cortext Manager
Cortext Manager is our current main attraction, a publicly available web service providing data analysis methods curated and developed by our team of researchers and engineers.
You upload a textual corpus in order to analyse its discourse, names, categories, citations, places, dates etc, with methods for science/controversy/issue mapping, distant reading, document clustering, geo-spatial and network visualizations, and more.
You can jump straight to Cortext Manager and create an account, but we strongly suggest taking a look at the Documentation and Tutorials as you start your journey.
Latest journal articles employing our instruments
Miscellaneous
2020
Vanderfeesten, Maurice; Spielberg, Eike; Hasse, Linda
Text Analyses of Survey Data on "Mapping Research Output to the Sustainable Development Goals(SDGs)" Miscellaneous
2020, (Sustainable Development Goals SDG Classification model Search Queries SCOPUS Text indexingControlled vocabulary).
@misc{Vanderfeesten2020,
title = {Text Analyses of Survey Data on "Mapping Research Output to the Sustainable Development Goals(SDGs)"},
author = {Maurice Vanderfeesten and Eike Spielberg and Linda Hasse},
url = {https://doi.org/10.5281/zenodo.3832090},
doi = {10.5281/zenodo.3832090},
year = {2020},
date = {2020-05-01},
abstract = {This package contains data on five text analysis types (term extraction, contract analysis, topic modeling, network mapping), based on the survey data where researchers selected research output that are related to the 17 Sustainable Development Goals (SDGs). This is used as input to improve the current SDG classification model v4.0 to v5.0
Sustainable Development Goals are the 17 global challenges set by the United Nations. Within each of the goals specific targets and indicators are mentioned to monitor the progress of reaching those goals by 2030. In an effort to capture how research is contributing to move the needle on those challenges, we earlier have made an initial classification model than enables to quickly identify what research output is related to what SDG. (This Aurora SDG dashboard is the initial outcome as proof of practice.)
The initiative started from the Aurora Universities Network in 2017, in the working group "Societal Impact and Relevance of Research", to investigate and to make visible 1. what research is done that are relevant to topics or challenges that live in society (for the proof of practice this has been scoped down to the SDGs), and 2. what the effect or impact is of implementing those research outcomes to those societal challenges (this also have been scoped down to research output being cited in policy documents from national and local governments an NGO's).},
note = {Sustainable Development Goals SDG Classification model Search Queries SCOPUS Text indexingControlled vocabulary},
keywords = {},
pubstate = {published},
tppubtype = {misc}
}
Sustainable Development Goals are the 17 global challenges set by the United Nations. Within each of the goals specific targets and indicators are mentioned to monitor the progress of reaching those goals by 2030. In an effort to capture how research is contributing to move the needle on those challenges, we earlier have made an initial classification model than enables to quickly identify what research output is related to what SDG. (This Aurora SDG dashboard is the initial outcome as proof of practice.)
The initiative started from the Aurora Universities Network in 2017, in the working group "Societal Impact and Relevance of Research", to investigate and to make visible 1. what research is done that are relevant to topics or challenges that live in society (for the proof of practice this has been scoped down to the SDGs), and 2. what the effect or impact is of implementing those research outcomes to those societal challenges (this also have been scoped down to research output being cited in policy documents from national and local governments an NGO's).
Online
2020
Laurens, Patricia; Schoen, Antoine; Larédo, Philippe
2020.
@online{nokey,
title = {Policy Brief, Issue 6/International patents: the role of large multinational firms in building competitive metropolitan areas},
author = {Patricia Laurens and Antoine Schoen and Philippe Larédo},
url = {https://zenodo.org/records/4301797},
doi = {/10.5281/zenodo.4301796},
year = {2020},
date = {2020-12-02},
abstract = {Large multinational firms (LMF) play a crucial role in the dynamics of knowledge production worldwide.
The study conducted by Université Eiffel, using inventive activities as a central marker, highlights in particular four major results:
(i) Large groups represent 80% of worldwide international inventive activities and, contrary to many expectations, this role has increased over the last decade.
(ii) Though LMF are present in 60% of inventive metropolitan areas, the top 100 metropolitan areas worldwide concentrate 80% of LMF international patents.
(iii) Large metropolitan areas gather 90% of international patents in Asia, 70% in the US, but only 37% in Europe. Europe has thus a very different structure where inventive activities are more distributed with a central role of medium-size metropolitan areas.
(iv) ‘National’ LMF play a majority role in the overall production of metropolitan areas: over 90% in Asia, and 75% in the US. In Europe, this share is only 57%. This highlights the role of LMF from other European countries (23%) and from outside of Europe (20%).
These four results question research and innovation policies and call for an open debate about their policy-mix and their role in distributive and inclusion objectives.
The study has been conducted, using in an integrated way the three major resources developed within RISIS: CIB dataset on large firms, RISIS patent database on patents, and CORTEXT geolocation on metropolitan areas. },
keywords = {},
pubstate = {published},
tppubtype = {online}
}
The study conducted by Université Eiffel, using inventive activities as a central marker, highlights in particular four major results:
(i) Large groups represent 80% of worldwide international inventive activities and, contrary to many expectations, this role has increased over the last decade.
(ii) Though LMF are present in 60% of inventive metropolitan areas, the top 100 metropolitan areas worldwide concentrate 80% of LMF international patents.
(iii) Large metropolitan areas gather 90% of international patents in Asia, 70% in the US, but only 37% in Europe. Europe has thus a very different structure where inventive activities are more distributed with a central role of medium-size metropolitan areas.
(iv) ‘National’ LMF play a majority role in the overall production of metropolitan areas: over 90% in Asia, and 75% in the US. In Europe, this share is only 57%. This highlights the role of LMF from other European countries (23%) and from outside of Europe (20%).
These four results question research and innovation policies and call for an open debate about their policy-mix and their role in distributive and inclusion objectives.
The study has been conducted, using in an integrated way the three major resources developed within RISIS: CIB dataset on large firms, RISIS patent database on patents, and CORTEXT geolocation on metropolitan areas.
Baciu, Robert; Bersezio, Ludovic; Béchet, Nathalie; Boboc, Roxana Varvara; (Doris), Yujie Dong; Stirum, Roline Van Limburg; Macpherson, Ava; Oettle, Josephine; Yedema, Emma
(DMI), The Digital Methods Initiative (Ed.): 2020, visited: 30.01.2020.
@online{Baciu2020,
title = {Who is /ourguy/?: Studying political Internet subcultures through their identification with public figures},
author = {Robert Baciu and Ludovic Bersezio and Nathalie Béchet and Roxana Varvara Boboc and Yujie Dong (Doris) and Roline Van Limburg Stirum and Ava Macpherson and Josephine Oettle and Emma Yedema},
editor = {The Digital Methods Initiative (DMI)},
url = {https://digitalmethods.net/Dmi/WinterSchool2020OurguyReddit},
year = {2020},
date = {2020-01-30},
urldate = {2020-01-30},
abstract = {The primary outcome of our research has been an approach to analysing the renegotiation and contestation of public figures, or /ourguy/, through the use of a 5 step protocol. The second aim was to showcase our model’s potential by doing a case study on political subreddits and looking at which public figures their communities discuss, as well as how they do so. The protocol allows for cross-subreddit and single-subreddit analysis and can easily be replicated for the study of other subreddits. To compile the protocol, we repurposed subreddit metrics to create a toolkit that allows us to characterize web communities. First the appropriate subreddits are selected based on a set of criteria, next we extract the most mentioned names in those subreddits using natural language processing. Then we make a selection of the relevant public figures among these names. After this we employ a twofold method of using contrast analysis and network mapping the subreddits. Lastly we generate and analyse word trees for a deeper understanding of how negotiations surrounding these public figures happen. This protocol is a useful framework for future subreddit analysis and shows that meaningful social research can still be conducted post-API.
},
keywords = {},
pubstate = {published},
tppubtype = {online}
}
PhD Theses
2020
Gray, Daniel
Tweeting About Women: A Critical Discourse Analysis of International Women’s Day on Twitter PhD Thesis
School of Social Sciences, 2020.
@phdthesis{Gray2020,
title = {Tweeting About Women: A Critical Discourse Analysis of International Women’s Day on Twitter},
author = {Daniel Gray},
url = {https://orca.cardiff.ac.uk/id/eprint/137810/
https://orca.cardiff.ac.uk/id/eprint/137810/1/Thesis%20Daniel%20Gray%20Corrected%201-11-2020%282%29.pdf},
year = {2020},
date = {2020-11-01},
urldate = {2020-11-01},
address = {Cardiff University, Cardiff, CF10 3AT},
school = {School of Social Sciences},
abstract = {This thesis is a work of critical digital sociology, investigating discourse which occurred on International Women’s Day 2017 (IWD2017) on Twitter, a widely used social media network, using innovative methodology. The principle finding presented in this thesis is methodological. I demonstrate that it is possible and productive to bring together qualitative analysis and so-called ‘big data’, specifically a large quantity of tweets, via innovative and original methodology, while preserving the unique and valuable affordances of critical, qualitative, theory-informed analysis.
Alongside demonstrating this, I also present a range of analytic findings related to the discourse I have analysed. The analytic findings include the use of popular and ‘fringe’ hashtags in linking mainstream and right-wing/reactionary topics, the prominence of anti- feminism and anti-Islam sentiment in discourse associated with supporters of US president Donald Trump, the antifeminist discursive splitting of feminism and feminists into benign and maligned categories, and the ways women are constructed by Twitter accounts representing police and armed forces.
Methodologically, this thesis provides a detailed account of the practicalities, challenges and strategies involved in approaching big social media data as a critical researcher using qualitative analysis. In doing so I argue that big social media data may be a fruitful area for qualitative work, but that in approaching it we should not discard our previous theoretical, analytical and ethical frameworks.},
keywords = {},
pubstate = {published},
tppubtype = {phdthesis}
}
Alongside demonstrating this, I also present a range of analytic findings related to the discourse I have analysed. The analytic findings include the use of popular and ‘fringe’ hashtags in linking mainstream and right-wing/reactionary topics, the prominence of anti- feminism and anti-Islam sentiment in discourse associated with supporters of US president Donald Trump, the antifeminist discursive splitting of feminism and feminists into benign and maligned categories, and the ways women are constructed by Twitter accounts representing police and armed forces.
Methodologically, this thesis provides a detailed account of the practicalities, challenges and strategies involved in approaching big social media data as a critical researcher using qualitative analysis. In doing so I argue that big social media data may be a fruitful area for qualitative work, but that in approaching it we should not discard our previous theoretical, analytical and ethical frameworks.
NotesVIEW ALL
-
Long trends on twitter: intertemporal clusters combining hashtags and terms on Scientometrics, Altmetrics, Bibliometrics and Science Of Science
Long trends on twitter: inter-temporal clusters combining hashtags and terms, for all tweets on Scientometrics, Altmetrics, Bibliometrics and Science Of Science from Jan. 2017 to dec. 2021, on a semester base. Query used to extract tweets: lang:en (Scientometrics OR “ScienceOfScience” OR “Science Of Science” OR “Altmetrics” OR “altmetric” OR “bibliometrics” OR “bibliometric” OR “citation metrics” […]
-
Présenter CorTexT Manager en 2 minutes
Cortext Manager est une application web construite par des chercheurs et par des ingénieurs à destination de chercheurs en sciences humaines et sociales, au plus près des questions portées par les chercheurs qui nous entourent et par notre communauté d’utilisateurs. Cette application web peut produire un grand nombre d’analyses différentes qui ont trait aux champs […]
-
Analysis of the scientific production that mentioned the use of CorText Manager
There are two ways to understand what CorTexT Manager is. The first one is to look at what has been achieved in terms of methods, tools and therefore lines of code. The second one is studied below, by analyzing (here with CorTexT Manager) what academic users have published using… CorTexT Manager. Our study of the […]
-
10 years of CorText Manager v2
It took us more than 10 years to come with CorText Manager version 2 as it is now! Behind the scenes CorText Manager begun with a first version in 2009. More than thirty contributors has worked directly or indirectly on the two versions, year after year. All the ideas, inspirations, all this accumulation of pieces […]
-
RISIS Training: Thematic and spatial analysis of technologies using CorText Manager and RISIS patent database
One of the best CorText Manager training courses was organized and offered by the RISIS project. Here is the program of this training which lasted 3 days: Monday 08/11/21 14h-16h30: Session 1 Session 1a: Introduction on patent analysis (60’) Introductory lecture session • Welcoming introduction (Philippe Larédo) 5’ • Type of patents documents (Antoine Schoen) […]
-
Early 2021 CorText Manager training sessions
CorText organized a series of training workshops on CorText Manager and its methods in January 2021! These workshops were imagined as a staircase with three successive steps : Session 1: Introduction Session 2: Method comparisons Session 3: Research questions and work on user’s corpus For these sessions, the subject chosen for the demonstrations and exercises […]
-
Seminar and workshop during the Summer School of PPGCI IBICT UFRJ, Rio de Janeiro – 03/2020
In March 2020, the LabEx SITES post-doctoral researcher, Ale Abdo, traveled to Rio de Janeiro and São Paulo to organize two trainings on textual analysis and on a new method he developed and integrated at the CorText Infrastructure, as well as to participate in discussions on open and citizen science in Brazil, including the discussion […]
-
A CorText Manager distance training session in the framework of the nanocellulose project – Grenoble, June 2020
For complementing the RISIS access requested (to Leiden publications DB and RISIS patent DB) by the GAEL laboratory (UMR INRAE, CNRS, UGA, INPG), in the framework of a research project on nanocellulose, the CorText team has provided , in June and July 2020, an advanced training on the use of CorText. After setting up of […]
CorText Newsfeed
Want to stay up-to-date with the latest training sessions and developments in our methods and data? We invite you to subscribe to Cortext Newsfeed, our succint and researcher oriented quarterly newsletter.
Read the previous editions of our newsletter