

{"id":64,"date":"2019-06-20T14:08:39","date_gmt":"2019-06-20T12:08:39","guid":{"rendered":"https:\/\/project.inria.fr\/bigclin\/?page_id=64"},"modified":"2019-10-31T10:46:38","modified_gmt":"2019-10-31T09:46:38","slug":"results","status":"publish","type":"page","link":"https:\/\/project.inria.fr\/bigclin\/results\/","title":{"rendered":"Results"},"content":{"rendered":"<p><\/p>\n<h2 id=\"aui_3_4_0_1_382\"><a href=\"https:\/\/project.inria.fr\/bigclin\/files\/2019\/10\/BigClin-pre-final-report1.pdf\">Our pre-Final REPORT<\/a> is available<\/h2>\n<h3><u>Achievements<\/u><\/h3>\n<ul>\n<li>IRISA\/LinkMedia and CNRS\/STL have develop NLP techniques to extract numerical information from clinical record or clinical trial protocols (what is the concept measured, what is the unit, what is the value). Our approach is good in terms of precision and recall and is able to handle variations in the way units are expressed. The different aspects of this work has been published in several NLP conferences.<\/li>\n<li>IRISA\/LinkMedia and CNRS\/STL\u00a0have develop a NLP tool based on deep learning to detect negation in biomedical texts (the negation marker and the concerned negated concept). This tool achieve state-of-the-art results on English and the best known results for French and Portguese. It is available as a web-service on allgo.inria.fr . This work has been publsihed in several NLP conferences, and is under adaptation to other languages.<\/li>\n<li>IRISA\/LinkMedia and CNRS\/STL\u00a0have developed a Part-of-Speech tagger and lemmatizer based on Conditional Random Fields and example-based machine learning. It is freely available as a web-service on <a href=\"https:\/\/allgo.inria.fr\">Allgo<\/a>.<\/li>\n<li>IRISA\/Cidre have developed a new distributed algorithms to detect the k most frequent elements in a set. It is a one-pass, time-decaying, resource efficient and distributed approach. It is especially suited for massive and highly dynamic data streams such as medical events mined in the EHR of several hospitals. This work has been published in two top-tier conferences: DSN 2018 and NCA 2018.<\/li>\n<li>IRISA\/LinkMedia and CNRS\/STL, with colleagues from LIMSI, France, have proposed a text-mining competition, DeFT2019, dedicated to information extraction in clinical texts. A workshop will be held during PFIA in Toulouse, Fr. See the <a href=\"https:\/\/deft.limsi.fr\/2019\/\">challenge website<\/a>.<\/li>\n<\/ul>\n<h3><u>On going work<\/u><\/h3>\n<ul>\n<li>we are still evaluating distributed algorithms to detect the k most frequent elements in a set (IRISA\/Cidre)<\/li>\n<li>we are developping text classification tools based on deep learning to automatically assign ICD10 codes (PMSI) to clinical records (IRISA\/LinkMedia)<\/li>\n<\/ul>\n<h3><u>Latest news<\/u><\/h3>\n<h4 id=\"aui_3_4_0_1_427\">July 2019: BigClin at TALN\/PFIA<\/h4>\n<p>We have presented the CAS corpus (clinical cases and their fine grain annotations) at the conference TALN. The corpus is being made freely available, on demand, for research purposes.<\/p>\n<h4>July 2019: workshop of the Text-mining Challenge DeFT2019<\/h4>\n<p>STL and Linkmedia, with colleagues from LIMSI, proposed a text-mining competition about information extraction in clinical texts. Nine teams participated (from academia and companies). The results and the participants&#8217; approaches have been presented in a workshop held during PFIA in Toulouse, in July.<\/p>\n<h4>May 2019: Visit from Claudia Moro, PUCPR in Rennes<\/h4>\n<p>Claudia Moro visited LinkMedia, IRISA during a week. During her stay, we finished to adapt the information extraction tools developed for French to Brazilian. We began to write reports for the Brazilian funding agency. Plans for a co-advised PhD were also discussed.<\/p>\n<h4>January 2019: TagEx, a new NLP API is online<\/h4>\n<p>The Part-of-Speech system developed in the framework of the project is now available online as an API. Test it on <a href=\"http:\/\/allgo.inria.fr\">Allgo<\/a>\u00a0.<\/p>\n<h4>November 2018: BigClin at NCA 2018<\/h4>\n<p>Vasile Cazacu is presenting his work &#8220;On the fly detection of top-k elements in the sliding-window model.&#8221; during the 17th IEEE International Symposium on Network\u00a0Computing and Applications (NCA 2018), Boston, MA (CORE ranking: A).<\/p>\n<h4>November 2018: Visit to PUCPR, Curitiba, Brazil<\/h4>\n<p>Natalia Grabar (STL) and Vincent Claveau (IRISA) went to PUCPR to follow up the common work with our brazilian partners about numerical information extraction, negation detection and Part-of-Speech tagging. Several publications were also initiated during the stay.<\/p>\n<h4>November 2018: BigClin meeting<\/h4>\n<p>One full day dedicated to the project. All the participants have presented their latest results, with a special emphasis on the work of our two PhD students, Vasile Cazacu and Cl\u00e9ment Dalloux. We had two invited researchers presenting their work in biomedical NLP :<\/p>\n<ul>\n<li>Cyril Grouin (LIMSI) about Pharmacovigilance et r\u00e9seaux sociaux<\/li>\n<li>Anne-Lyse Minard, about her participation to the SMM4H challenge about drug identification, and adverse reaction<\/li>\n<\/ul>\n<h4>October 2018: BigClin at Louhi<\/h4>\n<p>Cl\u00e9ment Dalloux will prensent his work about the development of annotated datasets of French clinical texts at the Louhi workshop held in Brussels, Belgium.<\/p>\n<h4>June 2018: BigClin at DSN 2018<\/h4>\n<p>Vasile Cazacu is presenting his work &#8220;Finding Top-k Most Frequent Items\u00a0in Distributed Streams in the Time-Sliding Window Model&#8221; at the 48th International\u00a0Conference on Dependable Systems and Networks (DSN 2018), Luxembourg City,\u00a0Luxembourg (CORE ranking: A).<\/p>\n<h4>March 2018: BigClin at n2c2 challenge<\/h4>\n<p>This year, our team has participated to the n2c2 challenge on clinical trial recruitment. Although it addresses a real-life scenario (patient recruitment for clinical trials), the small amount of training data discarded our prefered approach based on machine learning (deep learning and others). Yet, rule-based techniques were also developed and gave good results. This work is going on.<\/p>\n<h4>February 2018: Visit to PUCPR, Curitiba, Brazil<\/h4>\n<p>Marc Cuggia and Pascal VanHill (LTSI) are on their way to spend 10 days in Curitiba with our brazilian partners. They will present their clinical record warehousing system.<\/p>\n<p>&nbsp;<\/p>\n<h4>December 2017: Visit to PUCPR, Curitiba, Brazil<\/h4>\n<p>Following our collaboration with the members of\u00a0the medical informatics dept of the\u00a0Pontif\u00edcia Universidade Cat\u00f3lica do Paran\u00e1, C. Dalloux spent one month in Curitiba to work on text mining in clinical texts in French, Portuguese and English.<\/p>\n<p>&nbsp;<\/p>\n<h4>November 2017: BigClin at SIIM 2017<\/h4>\n<ol>\n<li>Dalloux (IRISA\/LinkMedia) will present his work about negation detection in biomedical texts at the\u00a0<em>Symposium sur l&#8217;Ing\u00e9nierie de l&#8217;Information M\u00e9dicale.<\/em><\/li>\n<\/ol>\n<p>&nbsp;<\/p>\n<h4>November 2017: NegDetect, a new NLP API is online<\/h4>\n<p>The negation detection system developed in the framework of the PhD thesis of C. Dalloux is now available online as an API. Test it on <a href=\"http:\/\/allgo.inria.fr\">Allgo<\/a>\u00a0.<\/p>\n<p>&nbsp;<\/p>\n<h4>June 2017: BigClin at TALN2017<\/h4>\n<p>We will present our work on extracting numerical eligibility criteria in clinical trial protocaols at the conference Traitement Automatique des Langues Naturelles in Orl\u00e9ans.<\/p>\n<p>&nbsp;<\/p>\n<h4>June 2017: BigClin at AIME2017<\/h4>\n<p>We will present our work on extracting numerical eligibility criteria in clinical trial protocaols at the 16th conference in Artificial Intelligence in Medecine in Vienna.<\/p>\n<p>&nbsp;<\/p>\n<h4>April 2017: BigClin at CICLing 2017<\/h4>\n<p>Vincent Claveau and Ewa Kijak (IRISA\/LinkMedia) will present their work on active learning for text mining at the\u00a0<em>18th International Conference on Computational Linguistics and Intelligent Text Processing\u00a0<\/em>in Budapest.<\/p>\n<h4>February 2017: a new PhD student join the team!<\/h4>\n<p>Vasile Cazacu is hired as a PhD student. He will work with E. Anceaume and Yann Busnel on distributed computing.<\/p>\n<p>&nbsp;<\/p>\n<h4>December 2016: Visit to PUCPR, Curitiba, Brazil<\/h4>\n<p>In december 2016, BigClin members visited the medical informatics dept of the\u00a0Pontif\u00edcia Universidade Cat\u00f3lica do Paran\u00e1 &#8211; PUCPR, in the research group of Prof. Claudia Moro.<\/p>\n<p>A seminar was organized, gathering researchers from computer science, medical informatics and medecine. We presented rearch activities in the frame of the BigClin project:<\/p>\n<ul>\n<li>Vincent Claveau, Biomedical NLP at IRISA<\/li>\n<li>Natalia Grabar, Managing the complexity of medical texts<\/li>\n<li>Guillaume Bouzill\u00e9, Big Data in Health: the example of CHU Rennes<\/li>\n<\/ul>\n<p>Many common research avenues were identified and a long term collaboration is foreseen with the team of Prof. Moro.<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<h4>December 2016: a new PhD student join the team!<\/h4>\n<p>Cl\u00e9ment Dalloux is hired as PhD student in the Linkmedia team. He will study\u00a0 biomedical NLP (text mining, information retrieval).<\/p>\n<p>&nbsp;<\/p>\n<h4>October 2016: Kickoff<\/h4>\n<p>A seminar was organized in LTSI Rennes for the kickoff of BigClin. Many invited personalities presented their work and \u00a0about 30 participants attended.<\/p>\n<p>&nbsp;<\/p>\n<p>Program<\/p>\n<p>9h00 &#8211; 9h30 : Welcome<\/p>\n<p>9h30 &#8211; 9h45 : Opening and presentation of the project (15\u2019) Marc Cuggia &amp; Vincent Claveau<\/p>\n<p>9h45-11h00 Guest presentations (each 20\u2019+5\u2019)<\/p>\n<ul>\n<li>Frantz Thiessard \/ Diallo Gayo (ISPED\/CHU de Bordeaux) &#8211; Title :Contribution of NLP techniques in three research projects<\/li>\n<li>Bastien Rance (HEGP) &#8211; Title :Secondary use of data at HEGP: The new challenges of reusing clinical narratives.<\/li>\n<li>Douglas Teodoro (SIB Swiss Institute of Bioinformatics) : Title: Making sense of text data: from enterprise to web content<\/li>\n<\/ul>\n<p>11h15 &#8211; 11h30 : Pause<\/p>\n<p>11h30 &#8211; 12h20 : Guest presentations (each 20\u2019+5\u2019)<\/p>\n<ul>\n<li>Claudia Moro (University of Parana) : Exploiting PLN methods to identifying diagnosis and continue of care information from Brazilian Portuguese Clinical Narratives &#8211; System Evaluation and EHR integration<\/li>\n<li>Pierre Antoine Gourraud (CHU and University of Nantes) : &#8220;Precision Medicine for Multiple Sclerosis: From research cohort datasets through EMR data, The MS Bioscreen application&#8221;<\/li>\n<\/ul>\n<p>12h20 &#8211; 13h50 Lunch and discussion (70\u2019)<\/p>\n<p>13h50 &#8211; 14h30 (40\u2019\u2019): Demos and related scientific works<\/p>\n<ul>\n<li>eHOP (Guillaume Bouzille)<\/li>\n<\/ul>\n<p>14h30 &#8211; 15h15 \u00a0Teams presentations (45\u2019) :<\/p>\n<p>10&#8217;+5&#8242; \u00a0\u00a0\u00a0Linkmedia (IRISA) \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 Vincent Claveau<\/p>\n<p>10&#8217;+5&#8242; \u00a0\u00a0\u00a0STL (Lille) \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 Natalia Grabar<\/p>\n<p>10&#8217;+5&#8242; \u00a0\u00a0\u00a0Cidre (IRISA) \u00a0\u00a0 \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 Yann Busnel \/ Emmanuelle<\/p>\n<p>15h15 &#8211; 16h15 (75\u2019\u2019) Discussions of topics<\/p>\n<ul>\n<li>Use cases and applications for research<\/li>\n<li>Pre-requisite for data processing (de-identification, data protection and regulation, project organisation, IT)<\/li>\n<li>Extraction, indexing, Machine learning methods and tools<\/li>\n<li>Benchmarks\/Evaluation\/Challenges<\/li>\n<li>PhD or MSD Students \u00a0exchange<\/li>\n<\/ul>\n<p>16h15 &#8211; 16h30 \u00a0Pause<\/p>\n<p>16h30 &#8211; 17h45 \u00a0(75\u2019) Action plan, networking, collaborations, next scientific meetings<\/p>\n<p>17h45 &#8211; 17h55 \u00a0(10\u2019) Wrap up and Cloture<\/p>","protected":false},"excerpt":{"rendered":"<p>Our pre-Final REPORT is available Achievements IRISA\/LinkMedia and CNRS\/STL have develop NLP techniques to extract numerical information from clinical record or clinical trial protocols (what\u2026<\/p>\n<p> <a class=\"continue-reading-link\" href=\"https:\/\/project.inria.fr\/bigclin\/results\/\"><span>Continue reading<\/span><i class=\"crycon-right-dir\"><\/i><\/a> <\/p>\n","protected":false},"author":1611,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-64","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/project.inria.fr\/bigclin\/wp-json\/wp\/v2\/pages\/64","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/project.inria.fr\/bigclin\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/project.inria.fr\/bigclin\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/project.inria.fr\/bigclin\/wp-json\/wp\/v2\/users\/1611"}],"replies":[{"embeddable":true,"href":"https:\/\/project.inria.fr\/bigclin\/wp-json\/wp\/v2\/comments?post=64"}],"version-history":[{"count":3,"href":"https:\/\/project.inria.fr\/bigclin\/wp-json\/wp\/v2\/pages\/64\/revisions"}],"predecessor-version":[{"id":83,"href":"https:\/\/project.inria.fr\/bigclin\/wp-json\/wp\/v2\/pages\/64\/revisions\/83"}],"wp:attachment":[{"href":"https:\/\/project.inria.fr\/bigclin\/wp-json\/wp\/v2\/media?parent=64"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}