CorTexT platform
The CorTexT Platform is the digital platform of LISIS Unit and a project launched and sustained by IFRIS and INRAE.
This platform aims at empowering open research and studies in humanities about the dynamic of science, technology, innovation and knowledge production.
After the emergence and generalization of network analysis in many disciplines and its valorization in on-line bibliometric tools and then in many social media infrastructure of digital business, quali-quantitative of data bases and the booming of @datas is creating a new space for research. Designing and engineering solution for the analysis and the visualization of datasets represents a scientific and technological challenge. Moreover, the next step towards digital humanities does not address only a technological gap for social sciences; it also means the development of epistemic bargain between disciplines of social sciences, artificial intelligence and computing sciences since the complexity of research problem is increasing in relation to the profusion of new data.
Latest newsVIEW ALL
Early 2021 CorText Manager training sessions
CorText organized a series of training workshops on CorText Manager and its methods in January 2021! These workshops were imagined as a staircase with three successive steps : Session 1: Introduction Session 2: Method comparisons Session 3: Research questions and work on user’s corpus For these sessions, the subject chosen for the demonstrations and exercises […]
Seminar and workshop during the Summer School of PPGCI IBICT UFRJ, Rio de Janeiro – 03/2020
In March 2020, the LabEx SITES post-doctoral researcher, Ale Abdo, traveled to Rio de Janeiro and São Paulo to organize two trainings on textual analysis and on a new method he developed and integrated at the CorText Infrastructure, as well as to participate in discussions on open and citizen science in Brazil, including the discussion […]
A CorText Manager distance training session in the framework of the nanocellulose project – Grenoble, June 2020
For complementing the RISIS access requested (to Leiden publications DB and RISIS patent DB) by the GAEL laboratory (UMR INRAE, CNRS, UGA, INPG), in the framework of a research project on nanocellulose, the CorText team has provided , in June and July 2020, an advanced training on the use of CorText. After setting up of […]
Covid-19: Pandemic and online social movements
A Covid-19 Data Sprint was organized by the D2SN Master of UGE. On June 30th, 2020 was presented an analysis of how, during the lockdown, people continued to express their dissatisfaction through online social movements. This analysis is based on the study of Twitter hashtags during this period. The study focuses on the evolution of […]
A digital enquiry of the agroecological turn in Costa Rica
This project has been developed by Bertha Brenes in LISIS laboratory with Nicola Ricci and Marc Barbier. The objective of the project is to drive a digital enquiry of the agroecological turn in Costa Rica, more largely in Central America through the setup of consistent and appropriate datasets in order to analyze the production and […]
CorTexT introductory course in México – 16th October 2019
On Wednesday 16th October, will be held at the Universidad Autónoma Metropolitana – Azcapotzalco the workshop : ‘Methods for digital humanities. Introduction to the automated text analysis with CorTexT platform‘ This meeting will be held on the request of a group of interested researchers with the aim to explore potential uses of CorTexT platform in […]
“CORTEXT MANAGER” Training – 15th to 17th April 2019
CorText : La Plateforme Digitale du LISIS Formation‐Atelier aux usages du CorTexT Manager Le 15-16-17 Avril 2019 – Formation “CORTEXT MANAGER“ Formation ouverte à tous les membres de l’IFRIS Contact: Marc.barbier@inra.fr Inscription : lynda.silva@u-pem.fr Adresse : LISIS & IFRIS – Bâtiment A. Camus, 2 allée Jean Renoir, Noisy-le-Grand Nous vous attendons le Lundi 15 Avril 2019 à partir […]
Introduction to Pytheas
In this article we will present what is Pytheas and how you can access it. Available here : https://pytheas.cortext.net
Latest scientific works using CorText Manager

2020 |
Journal Articles |
Ubando, Aristotle T; Rosario, Aaron Jules Del R; Chen, Wei-Hsin; Culaba, Alvin B A state-of-the-art review of biowaste biorefinery Journal Article Environmental Pollution, 2020. @article{Ubando2020b, title = {A state-of-the-art review of biowaste biorefinery}, author = {Aristotle T. Ubando and Aaron Jules R. Del Rosario and Wei-Hsin Chen and Alvin B. Culaba}, url = {https://www-sciencedirect-com.inshs.bib.cnrs.fr/science/article/pii/S026974912036838X}, doi = {https://doi.org/10.1016/j.envpol.2020.116149}, year = {2020}, date = {2020-11-20}, journal = {Environmental Pollution}, abstract = {Biorefineries provide a platform for different industries to produce multiple bio-products enhancing the economic value of the system. The production of these biorefineries has led to an increase in the gen- eration of biowaste. To minimize the risk of environmental pollution, numerous studies have focused on a variety of strategies to mitigate these concerns reflected in the vast amount of literature written on this topic. This paper aims to systematically analyze and review the enormous body of scientific literature in the biowaste and biorefinery field for establishing an understanding and providing a direction for future works. A bibliometric analysis is first performed using the CorTexT Manager platform on a corpus of 1488 articles written on the topic of biowaste. Popular and emerging topics are determined using a terms extraction algorithm. A contingency matrix is then created to study the correlation of scientific journals and key topics from this field. Then, the connection and evolution of these terms were analyzed using network mapping, to determine relationships among key terms and analyze notable trends in this research field. Finally, a critical review of articles was presented across three main categories of biowaste management such as mitigation, sustainable utilization, and cleaner disposal from the perspective of the biorefinery concept. Operational and technological challenges are identified for the integration of anaerobic digestion in biorefineries, especially in developing nations. Moreover, logistical challenges in the biorefinery supply-chain are established based on the economics and collection aspect of handling biowaste.}, keywords = {}, pubstate = {published}, tppubtype = {article} } Biorefineries provide a platform for different industries to produce multiple bio-products enhancing the economic value of the system. The production of these biorefineries has led to an increase in the gen- eration of biowaste. To minimize the risk of environmental pollution, numerous studies have focused on a variety of strategies to mitigate these concerns reflected in the vast amount of literature written on this topic. This paper aims to systematically analyze and review the enormous body of scientific literature in the biowaste and biorefinery field for establishing an understanding and providing a direction for future works. A bibliometric analysis is first performed using the CorTexT Manager platform on a corpus of 1488 articles written on the topic of biowaste. Popular and emerging topics are determined using a terms extraction algorithm. A contingency matrix is then created to study the correlation of scientific journals and key topics from this field. Then, the connection and evolution of these terms were analyzed using network mapping, to determine relationships among key terms and analyze notable trends in this research field. Finally, a critical review of articles was presented across three main categories of biowaste management such as mitigation, sustainable utilization, and cleaner disposal from the perspective of the biorefinery concept. Operational and technological challenges are identified for the integration of anaerobic digestion in biorefineries, especially in developing nations. Moreover, logistical challenges in the biorefinery supply-chain are established based on the economics and collection aspect of handling biowaste. |
Bai, Yang; Li, Hongxiu; Liu, Yong Visualizing research trends and research theme evolution in E‐learning field: 1999–2018 Journal Article Scientometrics, 2020. @article{Bai2020, title = {Visualizing research trends and research theme evolution in E‐learning field: 1999–2018}, author = {Yang Bai and Hongxiu Li and Yong Liu}, url = {https://link.springer.com/article/10.1007/s11192-020-03760-7}, doi = {https://doi.org/10.1007/s11192-020-03760-7}, year = {2020}, date = {2020-11-19}, journal = {Scientometrics}, abstract = {This paper aims to provide a comprehensive understanding of the evolution of major research themes and trends in e-learning research. A co-word analysis is applied for the analysis of the 21,656 keywords collected from 7214 articles published in 10 journals in the field of e-learning from the years 1999 to 2018. Specifically, a cluster analysis, social network analysis, strategic diagram, and graph theory were applied in the analysis for two time periods: 1999–2008 and 2009–2018. The study detects the bridging, popular, and core topics in e-learning research for the two periods. The research results indicate that e-learning research has undergone a health evolution over the past two decades. There is a temporal continuity of e-learning research because some research topics have kept their continuity over the studied 20 years. Meanwhile, the research traditions in the e-learning field are also continuously evolving with the development of new technologies. The results also offer useful hints on the future direction of how the field may evolve.}, keywords = {}, pubstate = {published}, tppubtype = {article} } This paper aims to provide a comprehensive understanding of the evolution of major research themes and trends in e-learning research. A co-word analysis is applied for the analysis of the 21,656 keywords collected from 7214 articles published in 10 journals in the field of e-learning from the years 1999 to 2018. Specifically, a cluster analysis, social network analysis, strategic diagram, and graph theory were applied in the analysis for two time periods: 1999–2008 and 2009–2018. The study detects the bridging, popular, and core topics in e-learning research for the two periods. The research results indicate that e-learning research has undergone a health evolution over the past two decades. There is a temporal continuity of e-learning research because some research topics have kept their continuity over the studied 20 years. Meanwhile, the research traditions in the e-learning field are also continuously evolving with the development of new technologies. The results also offer useful hints on the future direction of how the field may evolve. |
Gaulda, C; Micoulaud-Franchi, J -A Analyse en réseau par fouille de données textuelles systématique du concept de psychiatrie personnalisée et de précision Journal Article L'Encéphale, 2020, ISSN: 0013-7006. @article{Gaulda2020, title = {Analyse en réseau par fouille de données textuelles systématique du concept de psychiatrie personnalisée et de précision}, author = {C. Gaulda and J.-A. Micoulaud-Franchi}, url = {http://www.sciencedirect.com/science/article/pii/S0013700620302360}, doi = { https://doi.org/10.1016/j.encep.2020.08.008}, issn = {0013-7006}, year = {2020}, date = {2020-11-12}, journal = {L'Encéphale}, abstract = {Objectifs. – La médecine personnalisée et de précision nécessite une clarification des concepts qui y sont rattachés. À notre connaissance, il n’existe pas d’exploration systématique de la littérature portant sur les dimensions et les concepts de la psychiatrie personnalisée et de précision et sur leurs usages dans les domaines neuroscientifiques et génétiques. Cet article propose donc d’explorer les dimensions et les concepts de la psychiatrie personnalisée et de précision. Méthodes. – Une analyse en réseau par fouille de données textuelles systématique issue d’une revue exhaustive de la littérature internationale autour des termes de “precision psychiatry” et de “personalized psychiatry” a été réalisée. Cette fouille de données textuelles a été représentée sous forme d’un réseau permettant d’analyser les dimensions et les concepts de la psychiatrie personnalisée et de précision. Résultats. – La psychiatrie personnalisée et de précision renvoie à six dimensions retrouvées au sein de l’analyse du réseau textuel. Ces six dimensions correspondent aux domaines scientifiques qui étu- dient la psychiatrie personnalisée et de précision, à savoir : la génétique, la pharmacogénétique, les approches computationnelles, le raffinement des essais thérapeutiques, les biomarqueurs et la stadifica- tion. L’analyse des termes renvoie à un ensemble de concepts hétérogènes. Conclusions. – L’hétérogénéité retrouvée dans la littérature sur la psychiatrie personnalisée et de précision peut témoigner d’un manque d’un cadre théorique pluraliste et intégratif. Ce cadre de travail pourrait être basé sur un formalisme naturalisant mais non réducteur, conscient des enjeux sociétaux des sciences et de leur implémentation dans les dispositifs de recherche et cliniques de la psychiatrie. Objectives The current challenges of psychiatric nosology and semiology are part of an interdisciplinary and integrative framework. The paradigm of the personalized and precision psychiatry proposes to study this discipline according to new approaches and methodologies. Personalized and precision psychiatry therefore requires clarification of its concepts. To our knowledge, there is no systematic exploration of the literature on the application of the concepts of personalized and precision medicine in the field of psychiatry. This article proposes thus to explore the framework of personalized and precision medicine applied to psychiatry. Methods We explored the framework of personalized and precision medicine applied to psychiatry by a textual network analysis. Firstly, we performed a systematic text-mining (Natural Language Processing) from an exhaustive review of the international literature with the terms “precision psychiatry” and “personalized psychiatry”. Secondly, this analysis of textual data allowed us to build a textual network which made it possible to visualize the most proximal terms (the most frequently associated in the literature). Finally, we extracted from the network the main dimensions explored in the scientific literature, and we studied the relative importance of each term by analyzing the network centrality. In addition, a brief bibliometric analysis was conducted. Results We show that personalized and precision psychiatry refers to six dimensions found in the textual network analysis which correspond to the scientific fields which study personalized and precision psychiatry: genetics, pharmacogenetics, artificial intelligence, therapeutic trials, biomarkers and staging. We explore how each dimension relates to the mechanization of psychiatric disorders. However, precision and personalized psychiatry, which tries to refine the levels of mechanistic explanations for psychiatry, suffers from a conceptual heterogeneity. Indeed, textual analysis also allows us to find terms referring to a set of heterogeneous concepts. Many methodological fields and epistemological concepts are invoked in this literature, without standardization. Conclusions The paradox of personalized and precision psychiatry is to associate a strong conceptual heterogeneity with a well-defined mechanistic component. Heterogeneity found in literature on personalized and precision psychiatry testifies to the lack of a pluralist and integrative theoretical framework. This framework could be based on a naturalizing but non-reducing formalism, aware of the societal challenges of the sciences and their implementation in the research and clinical systems of psychiatry.}, keywords = {}, pubstate = {published}, tppubtype = {article} } Objectifs. – La médecine personnalisée et de précision nécessite une clarification des concepts qui y sont rattachés. À notre connaissance, il n’existe pas d’exploration systématique de la littérature portant sur les dimensions et les concepts de la psychiatrie personnalisée et de précision et sur leurs usages dans les domaines neuroscientifiques et génétiques. Cet article propose donc d’explorer les dimensions et les concepts de la psychiatrie personnalisée et de précision. Méthodes. – Une analyse en réseau par fouille de données textuelles systématique issue d’une revue exhaustive de la littérature internationale autour des termes de “precision psychiatry” et de “personalized psychiatry” a été réalisée. Cette fouille de données textuelles a été représentée sous forme d’un réseau permettant d’analyser les dimensions et les concepts de la psychiatrie personnalisée et de précision. Résultats. – La psychiatrie personnalisée et de précision renvoie à six dimensions retrouvées au sein de l’analyse du réseau textuel. Ces six dimensions correspondent aux domaines scientifiques qui étu- dient la psychiatrie personnalisée et de précision, à savoir : la génétique, la pharmacogénétique, les approches computationnelles, le raffinement des essais thérapeutiques, les biomarqueurs et la stadifica- tion. L’analyse des termes renvoie à un ensemble de concepts hétérogènes. Conclusions. – L’hétérogénéité retrouvée dans la littérature sur la psychiatrie personnalisée et de précision peut témoigner d’un manque d’un cadre théorique pluraliste et intégratif. Ce cadre de travail pourrait être basé sur un formalisme naturalisant mais non réducteur, conscient des enjeux sociétaux des sciences et de leur implémentation dans les dispositifs de recherche et cliniques de la psychiatrie. Objectives The current challenges of psychiatric nosology and semiology are part of an interdisciplinary and integrative framework. The paradigm of the personalized and precision psychiatry proposes to study this discipline according to new approaches and methodologies. Personalized and precision psychiatry therefore requires clarification of its concepts. To our knowledge, there is no systematic exploration of the literature on the application of the concepts of personalized and precision medicine in the field of psychiatry. This article proposes thus to explore the framework of personalized and precision medicine applied to psychiatry. Methods We explored the framework of personalized and precision medicine applied to psychiatry by a textual network analysis. Firstly, we performed a systematic text-mining (Natural Language Processing) from an exhaustive review of the international literature with the terms “precision psychiatry” and “personalized psychiatry”. Secondly, this analysis of textual data allowed us to build a textual network which made it possible to visualize the most proximal terms (the most frequently associated in the literature). Finally, we extracted from the network the main dimensions explored in the scientific literature, and we studied the relative importance of each term by analyzing the network centrality. In addition, a brief bibliometric analysis was conducted. Results We show that personalized and precision psychiatry refers to six dimensions found in the textual network analysis which correspond to the scientific fields which study personalized and precision psychiatry: genetics, pharmacogenetics, artificial intelligence, therapeutic trials, biomarkers and staging. We explore how each dimension relates to the mechanization of psychiatric disorders. However, precision and personalized psychiatry, which tries to refine the levels of mechanistic explanations for psychiatry, suffers from a conceptual heterogeneity. Indeed, textual analysis also allows us to find terms referring to a set of heterogeneous concepts. Many methodological fields and epistemological concepts are invoked in this literature, without standardization. Conclusions The paradox of personalized and precision psychiatry is to associate a strong conceptual heterogeneity with a well-defined mechanistic component. Heterogeneity found in literature on personalized and precision psychiatry testifies to the lack of a pluralist and integrative theoretical framework. This framework could be based on a naturalizing but non-reducing formalism, aware of the societal challenges of the sciences and their implementation in the research and clinical systems of psychiatry. |
Stefanija, Ana Pop; Pierson, Jo Practical AI Transparency: Revealing Datafication and Algorithmic Identities Journal Article 2 (3), 2020. @article{Stefanija2020, title = {Practical AI Transparency: Revealing Datafication and Algorithmic Identities}, author = {Ana Pop Stefanija and Jo Pierson}, url = {https://www.jdsr.io/articles/2020/11/8/practical-ai-transparency-revealing-datafication-and-algorithmic-identities}, doi = {10.33621/jdsr.v2i3.32}, year = {2020}, date = {2020-11-09}, volume = {2}, number = {3}, abstract = {How does one do research on algorithms and their outputs when confronted with the inherent algorithmic opacity and black box-ness as well as with the limitations of API-based research and the data access gaps imposed by platforms’ gate-keeping practices? This article outlines the methodological steps we undertook to manoeuvre around the above-mentioned obstacles. It is a “byproduct” of our investigation into datafication and the way how algorithmic identities are being produced for personalisation, ad delivery and recommendation. Following Paßmann and Boersma’s (2017) suggestion for pursuing “practical transparency” and focusing on particular actors, we experiment with different avenues of research. We develop and employ an approach of letting the platforms speak and making the platforms speak. In doing so, we also use non-traditional research tools, such as transparency and regulatory tools, and repurpose them as objects of/for study. Empirically testing the applicability of this integrated approach, we elaborate on the possibilities it offers for the study of algorithmic systems, while being aware and cognizant of its limitations and shortcomings.}, keywords = {}, pubstate = {published}, tppubtype = {article} } How does one do research on algorithms and their outputs when confronted with the inherent algorithmic opacity and black box-ness as well as with the limitations of API-based research and the data access gaps imposed by platforms’ gate-keeping practices? This article outlines the methodological steps we undertook to manoeuvre around the above-mentioned obstacles. It is a “byproduct” of our investigation into datafication and the way how algorithmic identities are being produced for personalisation, ad delivery and recommendation. Following Paßmann and Boersma’s (2017) suggestion for pursuing “practical transparency” and focusing on particular actors, we experiment with different avenues of research. We develop and employ an approach of letting the platforms speak and making the platforms speak. In doing so, we also use non-traditional research tools, such as transparency and regulatory tools, and repurpose them as objects of/for study. Empirically testing the applicability of this integrated approach, we elaborate on the possibilities it offers for the study of algorithmic systems, while being aware and cognizant of its limitations and shortcomings. |
CorText Newsfeed

Providing useful tools, data, methods or algorithms, has been one of the main goals of CorText Team. Therefore, CorText Newsfeed is there to put emphasis on some of our recent activities. We want it to be simple and fast reading so you would be able to pick relevant information for your own work.
Join our team
