MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Task

•

0 likes•165 views

Presenter: Maigrot Cédric MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Task In Working Notes Proceedings of the MediaEval 2016 Workshop, Hilversum, Netherlands, October 20-21, CEUR-WS.org (2016) by Cédric Maigrot, Vincent Claveau, Ewa Kijak, Ronan Sicre Paper: http://ceur-ws.org/Vol-1739/MediaEval_2016_paper_45.pdf Video: https://youtu.be/ay1zWydnijY Abstract: This paper presents a multi-modal hoax detection system composed of text, source, and image analysis. As hoax can be very diverse, we want to analyze several modalities to better detect them. This system is applied in the context of the Verifying Multimedia Use task of MediaEval 2016. Experiments show the performance of each separated modality as well as their combination.

Science

Linkmedia team participation: a multimodal system for
the Verifying Multimedia Use task
C´edric Maigrot, Vincent Claveau, Ewa Kijak, Ronan Sicre
October 20, 2016
Maigrot, Claveau, Kijak, Sicre Linkmedia team participation: a multimodal system for the Verifying Multimedia Use taskOctober 20, 2016 1 / 3

A multimodal system for the Verifying Multimedia Use task
Text-based
Is the message style-wise
similar to known hoax?
Maigrot, Claveau, Kijak, Sicre Linkmedia team participation: a multimodal system for the Verifying Multimedia Use taskOctober 20, 2016 2 / 3

A multimodal system for the Verifying Multimedia Use task
Text-based
Is the message style-wise
similar to known hoax?
Source-based
Presence of a trustworthy
source in the text content?
Maigrot, Claveau, Kijak, Sicre Linkmedia team participation: a multimodal system for the Verifying Multimedia Use taskOctober 20, 2016 2 / 3

A multimodal system for the Verifying Multimedia Use task
See you in 10 mn!
MediaEval 2016: A multimodal system for the
Verifying Multimedia Use task
C´edric Maigrot Vincent Claveau Ewa Kijak Ronan Sicre
{firstname}.{lastname}@irisa.fr
MediaEval 2016: A multimodal system for the
Verifying Multimedia Use task
C´edric Maigrot Vincent Claveau Ewa Kijak Ronan Sicre
{firstname}.{lastname}@irisa.fr
Why use a multimodal system ? Because there are several types of hoax !
» False information present in the text content » Forged image » Image reused for an other event
Global Hypotheses
» Prediction is ﬁrst made at the image-level, then propagated to the tweets that contain the image
» Translation if the detected language is diﬀerent than english
Text-based approach
(run-T)
Detect if the message is style-wise
similar to known hoax
§ Capture similar comments between an unknown
image and an image from the training set (e.g.
It’s photoshopped) and similar genres of com-
ments (e.g. presence of smileys)
§ Prediction made by a k-Nearest-Neighbor ap-
proach (in this case k = 1)
Source-based approach
(run-S)
Detect if the message is related to a
trustworthy source
§ 2 type of sources searched: news-related organ-
isms (e.g. press agencies) and explicit citations
of the source of the image (e.g. the pattern pho-
tographed by + Name)
§ Predict real if a trustworthy source is detected,
fake else
Example Image-based approach
(run-I)
Detect a known image
» Compare an unknown image to an image
database of 8 000 known images (7 500 fake and
500 real images)
» Database images extracted from 5 specialized
websites
» Description of an image by a deep CNN layer
output (4096-dimensional descriptor)
» Predict real (resp. fake) if a real (resp. fake)
similar image is found in the database, uncertain
else
Combination approach (run-C)
Combine the three previous predictions
» Late fusion: learn the best combination
» Boosting algorithm (adaboost.MH, parameters of the machine learning algorithm are set by cross-validation on the training data)
Results
run-T run-I run-S run-C
92.23%
34.07%
94.63%
91.22%
63.98%
49.18%
90.3%
75.25%
75.57%
40.25%
92.42%
82.47%
Approaches
Scorein%
» 2 228 messages to classify, corresponding to 130 images
» 86 % to the test tweets are associated with one or more images (the rest is associated
with video)
Conclusion
» Text-based approach: competes with the source-based approach in terms of recall but
tends to classify every tweet as fake
» Image-based approach: low precision compared with estimations on the training set.
This may be due to: (1) small and unbalanced reference database; (2) original image and
forged ones are sometimes very similar; (3) presence of stamps
» Combination-based approach: does not oﬀer any gain due to overﬁtting
Acknowledgements
This work is partly supported by the Direction G´en´erale de l’Armement, France (DGA).
Maigrot, Claveau, Kijak, Sicre Linkmedia team participation: a multimodal system for the Verifying Multimedia Use taskOctober 20, 2016 3 / 3

Viewers also liked

MediaEval 2016 - Simula Team @ Context of Experience Taskmultimediaeval

MediaEval 2015 - CERTH at MediaEval 2015 Synchronization of Multi-User Event ...multimediaeval

MediaEval 2016 - Emotion in Music Task: Lessons Learnedmultimediaeval

MediaEval 2015 - JRS at Synchronization of Multi-user Event Media Taskmultimediaeval

MediaEval 2015 - GTM-UVigo Systems for Person Discovery Task at MediaEval 2015multimediaeval

The InVID Plug-in: Web Video Verification on the BrowserInVID Project

MediaEval 2016 - LAPI @ 2016 Retrieving Diverse Social Images Task: A Pseudo-...multimediaeval

MediaEval 2016 - HUCVL Predicting Interesting Key Frames with Deep Modelsmultimediaeval

MediaEval 2016 - Verifying Multimedia Use Task Overviewmultimediaeval

MediaEval 2016 - BUT Zero-Cost Speech Recognitionmultimediaeval

MediaEval 2015 - Verifying Multimedia Use at MediaEval 2015multimediaeval

MediaEval 2016 - TUD-MMC Predicting media Interestingness Taskmultimediaeval

MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...multimediaeval

Video Retrieval for Multimedia Verification of Breaking News on Social NetworksInVID Project

MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...multimediaeval

MediaEval 2016 - IR Evaluation: Putting the User Back in the Loopmultimediaeval

Viewers also liked (16)

MediaEval 2016 - Simula Team @ Context of Experience Task

MediaEval 2015 - CERTH at MediaEval 2015 Synchronization of Multi-User Event ...

MediaEval 2016 - Emotion in Music Task: Lessons Learned

MediaEval 2015 - JRS at Synchronization of Multi-user Event Media Task

MediaEval 2015 - GTM-UVigo Systems for Person Discovery Task at MediaEval 2015

The InVID Plug-in: Web Video Verification on the Browser

MediaEval 2016 - LAPI @ 2016 Retrieving Diverse Social Images Task: A Pseudo-...

MediaEval 2016 - HUCVL Predicting Interesting Key Frames with Deep Models

MediaEval 2016 - Verifying Multimedia Use Task Overview

MediaEval 2016 - BUT Zero-Cost Speech Recognition

MediaEval 2015 - Verifying Multimedia Use at MediaEval 2015

MediaEval 2016 - TUD-MMC Predicting media Interestingness Task

MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...

Video Retrieval for Multimedia Verification of Breaking News on Social Networks

MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...

MediaEval 2016 - IR Evaluation: Putting the User Back in the Loop

Similar to MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Task

MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Taskmultimediaeval

Improving Question Answering by Bridging Linguistic Structures with Statistic...Jinho Choi

How do we detect malware? A step-by-step guideMarcus Botacin

Bayesian Network 을 활용한 예측 분석datasciencekorea

Lies, Damned Lies and Software Analytics: Why Big Data Needs Rich DataMargaret-Anne Storey

Computational Verification Challenges in Social MediaSymeon Papadopoulos

Alexandros Papanikolaou PROmisIgnite_Athens

Aggregating and Analyzing the Context of Social Media ContentSymeon Papadopoulos

Master defence 2020 - Andrew Kurochkin - Meme Generation for Social Media Aud...Lviv Data Science Summer School

Enhancing Social Network Security through Smart CredentialsIJCSIS Research Publications

AMSWMC MV NPD.pptxAna Canhoto

IntroductionKh Ravy

m16y - How to make your media accessible for all usersMaya Shavin

A Hybrid Approach For Phishing Website Detection Using Machine Learning.vivatechijri

Keith J. Jones, Ph.D. - MALGAZER: AN AUTOMATED MALWARE CLASSIFIER WITH RUNNIN...Keith Jones, PhD

Science Big, Science ConnectedDeepak Singh

Robust Expert Finding in Web-Based Community Information SystemsRalf Klamma

Faking Sandy: Characterizing and Identifying Fake Images on Twitter during Hu...IIIT Hyderabad

Modified Apriori Algorithm for Frequent Pattern MiningPritish Yuvraj

Similar to MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Task (20)

MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Task

Improving Question Answering by Bridging Linguistic Structures with Statistic...

How do we detect malware? A step-by-step guide

Bayesian Network 을 활용한 예측 분석

Lies, Damned Lies and Software Analytics: Why Big Data Needs Rich Data

Computational Verification Challenges in Social Media

Alexandros Papanikolaou PROmis

Aggregating and Analyzing the Context of Social Media Content

Master defence 2020 - Andrew Kurochkin - Meme Generation for Social Media Aud...

Enhancing Social Network Security through Smart Credentials

AMSWMC MV NPD.pptx

Introduction

m16y - How to make your media accessible for all users

A Hybrid Approach For Phishing Website Detection Using Machine Learning.

Keith J. Jones, Ph.D. - MALGAZER: AN AUTOMATED MALWARE CLASSIFIER WITH RUNNIN...

Science Big, Science Connected

Robust Expert Finding in Web-Based Community Information Systems

Faking Sandy: Characterizing and Identifying Fake Images on Twitter during Hu...

Modified Apriori Algorithm for Frequent Pattern Mining

Recently uploaded

Environmental Biotechnology Topic:- Microbial Biosensorsonawaneprad

Pests of Bengal gram_Identification_Dr.UPR.pdfPirithiRaju

REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...Universidade Federal de Sergipe - UFS

Pests of castor_Binomics_Identification_Dr.UPR.pdfPirithiRaju

Davis plaque method.pptx recombinant DNA technologycaarthichand2003

Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPirithiRaju

《Queensland毕业文凭-昆士兰大学毕业证成绩单》rnrncn29

User Guide: Magellan MX™ Weather StationColumbia Weather Systems

Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPirithiRaju

STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptxMurugaveni B

RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptxFarihaAbdulRasheed

Microteaching on terms used in filtration .Pharmaceutical EngineeringPrajakta Shinde

Citronella presentation SlideShare mani upadhyayupadhyaymani499

REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...Universidade Federal de Sergipe - UFS

Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuinethapagita

Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝soniya singh

(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)riyaescorts54

Functional group interconversions(oxidation reduction)itwameryclare

Pests of safflower_Binomics_Identification_Dr.UPR.pdfPirithiRaju

Neurodevelopmental disorders according to the dsm 5 trssuser06f238

Recently uploaded (20)

Environmental Biotechnology Topic:- Microbial Biosensor

Pests of Bengal gram_Identification_Dr.UPR.pdf

REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...

Pests of castor_Binomics_Identification_Dr.UPR.pdf

Davis plaque method.pptx recombinant DNA technology

Pests of jatropha_Bionomics_identification_Dr.UPR.pdf

《Queensland毕业文凭-昆士兰大学毕业证成绩单》

User Guide: Magellan MX™ Weather Station

Pests of soyabean_Binomics_IdentificationDr.UPR.pdf

STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx

RESPIRATORY ADAPTATIONS TO HYPOXIA IN HUMNAS.pptx

Microteaching on terms used in filtration .Pharmaceutical Engineering

Citronella presentation SlideShare mani upadhyay

REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...

Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine

Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝

(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)

Functional group interconversions(oxidation reduction)

Pests of safflower_Binomics_Identification_Dr.UPR.pdf

Neurodevelopmental disorders according to the dsm 5 tr

MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Task

1. Linkmedia team participation: a multimodal system for the Verifying Multimedia Use task C´edric Maigrot, Vincent Claveau, Ewa Kijak, Ronan Sicre October 20, 2016 Maigrot, Claveau, Kijak, Sicre Linkmedia team participation: a multimodal system for the Verifying Multimedia Use taskOctober 20, 2016 1 / 3

2. A multimodal system for the Verifying Multimedia Use task Text-based Is the message style-wise similar to known hoax? Maigrot, Claveau, Kijak, Sicre Linkmedia team participation: a multimodal system for the Verifying Multimedia Use taskOctober 20, 2016 2 / 3

3. A multimodal system for the Verifying Multimedia Use task Text-based Is the message style-wise similar to known hoax? Source-based Presence of a trustworthy source in the text content? Maigrot, Claveau, Kijak, Sicre Linkmedia team participation: a multimodal system for the Verifying Multimedia Use taskOctober 20, 2016 2 / 3

4. A multimodal system for the Verifying Multimedia Use task Text-based Is the message style-wise similar to known hoax? Source-based Presence of a trustworthy source in the text content? Image-based The image is already known? Maigrot, Claveau, Kijak, Sicre Linkmedia team participation: a multimodal system for the Verifying Multimedia Use taskOctober 20, 2016 2 / 3

5. A multimodal system for the Verifying Multimedia Use task Text-based Is the message style-wise similar to known hoax? Source-based Presence of a trustworthy source in the text content? ↓ Image-based The image is already known? → Combinaison approach Can the three previous predictions help? Maigrot, Claveau, Kijak, Sicre Linkmedia team participation: a multimodal system for the Verifying Multimedia Use taskOctober 20, 2016 2 / 3

6. A multimodal system for the Verifying Multimedia Use task See you in 10 mn! MediaEval 2016: A multimodal system for the Verifying Multimedia Use task Cédric Maigrot Vincent Claveau Ewa Kijak Ronan Sicre {firstname}.{lastname}@irisa.fr MediaEval 2016: A multimodal system for the Verifying Multimedia Use task Cédric Maigrot Vincent Claveau Ewa Kijak Ronan Sicre {firstname}.{lastname}@irisa.fr Why use a multimodal system ? Because there are several types of hoax ! » False information present in the text content » Forged image » Image reused for an other event Global Hypotheses » Prediction is first made at the image-level, then propagated to the tweets that contain the image » Translation if the detected language is different than english Text-based approach (run-T) Detect if the message is style-wise similar to known hoax § Capture similar comments between an unknown image and an image from the training set (e.g. It’s photoshopped) and similar genres of comments (e.g. presence of smileys) § Prediction made by a k-Nearest-Neighbor approach (in this case k = 1) Source-based approach (run-S) Detect if the message is related to a trustworthy source § 2 type of sources searched: news-related organ- isms (e.g. press agencies) and explicit citations of the source of the image (e.g. the pattern pho- tographed by + Name) § Predict real if a trustworthy source is detected, fake else Example Image-based approach (run-I) Detect a known image » Compare an unknown image to an image database of 8 000 known images (7 500 fake and 500 real images) » Database images extracted from 5 specialized websites » Description of an image by a deep CNN layer output (4096-dimensional descriptor) » Predict real (resp. fake) if a real (resp. fake) similar image is found in the database, uncertain else Combination approach (run-C) Combine the three previous predictions » Late fusion: learn the best combination » Boosting algorithm (adaboost.MH, parameters of the machine learning algorithm are set by cross-validation on the training data) Results run-T run-I run-S run-C 92.23% 34.07% 94.63% 91.22% 63.98% 49.18% 90.3% 75.25% 75.57% 40.25% 92.42% 82.47% Approaches Scorein% » 2 228 messages to classify, corresponding to 130 images » 86 % to the test tweets are associated with one or more images (the rest is associated with video) Conclusion » Text-based approach: competes with the source-based approach in terms of recall but tends to classify every tweet as fake » Image-based approach: low precision compared with estimations on the training set. This may be due to: (1) small and unbalanced reference database; (2) original image and forged ones are sometimes very similar; (3) presence of stamps » Combination-based approach: does not offer any gain due to overfitting Acknowledgements This work is partly supported by the Direction Générale de l’Armement, France (DGA). Maigrot, Claveau, Kijak, Sicre Linkmedia team participation: a multimodal system for the Verifying Multimedia Use taskOctober 20, 2016 3 / 3

MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Task

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (16)

Similar to MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Task

Similar to MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Task (20)

More from multimediaeval

More from multimediaeval (20)

Recently uploaded

Recently uploaded (20)

MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Task