Loading…

Cross-Modal Analysis of Audio-Visual Film Montage

A stylistic device frequently employed by filmmakers is the synchronous montage (composition) of audio and visual elements. Synchronous montage helps to increase tension and tempo in a scene and highlights important events in the story. Sequences with synchronous montage usually contain rich semanti...

Full description

Saved in:
Bibliographic Details
Main Authors: Zeppelzauer, M., Mitrovic, D., Breiteneder, C.
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 6
container_issue
container_start_page 1
container_title
container_volume
creator Zeppelzauer, M.
Mitrovic, D.
Breiteneder, C.
description A stylistic device frequently employed by filmmakers is the synchronous montage (composition) of audio and visual elements. Synchronous montage helps to increase tension and tempo in a scene and highlights important events in the story. Sequences with synchronous montage usually contain rich semantics which is relevant for understanding a movie. This property is currently not exploited in automated indexing, annotation, and summarization of movies. We propose a cross-modal approach that extracts sequences from a movie with synchronous audio-visual montage. Experiments confirm that the extracted sequences have high semantic relevance. Consequently, they represent a useful basis for different high-level movie abstraction tasks such as automated movie annotation and movie summarization.
doi_str_mv 10.1109/ICCCN.2011.6005782
format conference_proceeding
fullrecord <record><control><sourceid>ieee_CHZPO</sourceid><recordid>TN_cdi_ieee_primary_6005782</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>6005782</ieee_id><sourcerecordid>6005782</sourcerecordid><originalsourceid>FETCH-LOGICAL-i175t-2976633ae0d37ae9d0bf6c2170369b4bad3996fc7b1ff8a5a82b913683ff8de83</originalsourceid><addsrcrecordid>eNo1j8tOwzAURM1LIpT8AGzyAw62b3xtLyOLQqUWNsC2cmobGaUNittF_55ILbMZjY50pCHkgbOac2aeFtbat1owzmtkTCotLkhplOaNVIohIFySQiAoahpgV-TuHyh1TYrJIKlgUt6SMucfNgVRawMF4XYccqarwbu-aneuP-aUqyFW7cGngX6lfJjAPPXbajXs9u473JOb6PocynPPyOf8-cO-0uX7y8K2S5q4knsqjEIEcIF5UC4Yz7qIG8EVAzRd0zkPxmDcqI7HqJ10WnSGA2qYpg8aZuTx5E0hhPXvmLZuPK7P7-EPneBIOg</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Cross-Modal Analysis of Audio-Visual Film Montage</title><source>IEEE Xplore All Conference Series</source><creator>Zeppelzauer, M. ; Mitrovic, D. ; Breiteneder, C.</creator><creatorcontrib>Zeppelzauer, M. ; Mitrovic, D. ; Breiteneder, C.</creatorcontrib><description>A stylistic device frequently employed by filmmakers is the synchronous montage (composition) of audio and visual elements. Synchronous montage helps to increase tension and tempo in a scene and highlights important events in the story. Sequences with synchronous montage usually contain rich semantics which is relevant for understanding a movie. This property is currently not exploited in automated indexing, annotation, and summarization of movies. We propose a cross-modal approach that extracts sequences from a movie with synchronous audio-visual montage. Experiments confirm that the extracted sequences have high semantic relevance. Consequently, they represent a useful basis for different high-level movie abstraction tasks such as automated movie annotation and movie summarization.</description><identifier>ISSN: 1095-2055</identifier><identifier>ISBN: 1457706377</identifier><identifier>ISBN: 9781457706370</identifier><identifier>EISSN: 2637-9430</identifier><identifier>EISBN: 9781457706363</identifier><identifier>EISBN: 1457706385</identifier><identifier>EISBN: 9781457706387</identifier><identifier>EISBN: 1457706369</identifier><identifier>DOI: 10.1109/ICCCN.2011.6005782</identifier><language>eng</language><publisher>IEEE</publisher><subject>Correlation ; Estimation ; Feature extraction ; Humans ; Motion pictures ; Semantics ; Visualization</subject><ispartof>2011 Proceedings of 20th International Conference on Computer Communications and Networks (ICCCN), 2011, p.1-6</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/6005782$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2057,27924,54554,54919,54931</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/6005782$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Zeppelzauer, M.</creatorcontrib><creatorcontrib>Mitrovic, D.</creatorcontrib><creatorcontrib>Breiteneder, C.</creatorcontrib><title>Cross-Modal Analysis of Audio-Visual Film Montage</title><title>2011 Proceedings of 20th International Conference on Computer Communications and Networks (ICCCN)</title><addtitle>ICCCN</addtitle><description>A stylistic device frequently employed by filmmakers is the synchronous montage (composition) of audio and visual elements. Synchronous montage helps to increase tension and tempo in a scene and highlights important events in the story. Sequences with synchronous montage usually contain rich semantics which is relevant for understanding a movie. This property is currently not exploited in automated indexing, annotation, and summarization of movies. We propose a cross-modal approach that extracts sequences from a movie with synchronous audio-visual montage. Experiments confirm that the extracted sequences have high semantic relevance. Consequently, they represent a useful basis for different high-level movie abstraction tasks such as automated movie annotation and movie summarization.</description><subject>Correlation</subject><subject>Estimation</subject><subject>Feature extraction</subject><subject>Humans</subject><subject>Motion pictures</subject><subject>Semantics</subject><subject>Visualization</subject><issn>1095-2055</issn><issn>2637-9430</issn><isbn>1457706377</isbn><isbn>9781457706370</isbn><isbn>9781457706363</isbn><isbn>1457706385</isbn><isbn>9781457706387</isbn><isbn>1457706369</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2011</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNo1j8tOwzAURM1LIpT8AGzyAw62b3xtLyOLQqUWNsC2cmobGaUNittF_55ILbMZjY50pCHkgbOac2aeFtbat1owzmtkTCotLkhplOaNVIohIFySQiAoahpgV-TuHyh1TYrJIKlgUt6SMucfNgVRawMF4XYccqarwbu-aneuP-aUqyFW7cGngX6lfJjAPPXbajXs9u473JOb6PocynPPyOf8-cO-0uX7y8K2S5q4knsqjEIEcIF5UC4Yz7qIG8EVAzRd0zkPxmDcqI7HqJ10WnSGA2qYpg8aZuTx5E0hhPXvmLZuPK7P7-EPneBIOg</recordid><startdate>201107</startdate><enddate>201107</enddate><creator>Zeppelzauer, M.</creator><creator>Mitrovic, D.</creator><creator>Breiteneder, C.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>201107</creationdate><title>Cross-Modal Analysis of Audio-Visual Film Montage</title><author>Zeppelzauer, M. ; Mitrovic, D. ; Breiteneder, C.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i175t-2976633ae0d37ae9d0bf6c2170369b4bad3996fc7b1ff8a5a82b913683ff8de83</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2011</creationdate><topic>Correlation</topic><topic>Estimation</topic><topic>Feature extraction</topic><topic>Humans</topic><topic>Motion pictures</topic><topic>Semantics</topic><topic>Visualization</topic><toplevel>online_resources</toplevel><creatorcontrib>Zeppelzauer, M.</creatorcontrib><creatorcontrib>Mitrovic, D.</creatorcontrib><creatorcontrib>Breiteneder, C.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Xplore</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Zeppelzauer, M.</au><au>Mitrovic, D.</au><au>Breiteneder, C.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Cross-Modal Analysis of Audio-Visual Film Montage</atitle><btitle>2011 Proceedings of 20th International Conference on Computer Communications and Networks (ICCCN)</btitle><stitle>ICCCN</stitle><date>2011-07</date><risdate>2011</risdate><spage>1</spage><epage>6</epage><pages>1-6</pages><issn>1095-2055</issn><eissn>2637-9430</eissn><isbn>1457706377</isbn><isbn>9781457706370</isbn><eisbn>9781457706363</eisbn><eisbn>1457706385</eisbn><eisbn>9781457706387</eisbn><eisbn>1457706369</eisbn><abstract>A stylistic device frequently employed by filmmakers is the synchronous montage (composition) of audio and visual elements. Synchronous montage helps to increase tension and tempo in a scene and highlights important events in the story. Sequences with synchronous montage usually contain rich semantics which is relevant for understanding a movie. This property is currently not exploited in automated indexing, annotation, and summarization of movies. We propose a cross-modal approach that extracts sequences from a movie with synchronous audio-visual montage. Experiments confirm that the extracted sequences have high semantic relevance. Consequently, they represent a useful basis for different high-level movie abstraction tasks such as automated movie annotation and movie summarization.</abstract><pub>IEEE</pub><doi>10.1109/ICCCN.2011.6005782</doi><tpages>6</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1095-2055
ispartof 2011 Proceedings of 20th International Conference on Computer Communications and Networks (ICCCN), 2011, p.1-6
issn 1095-2055
2637-9430
language eng
recordid cdi_ieee_primary_6005782
source IEEE Xplore All Conference Series
subjects Correlation
Estimation
Feature extraction
Humans
Motion pictures
Semantics
Visualization
title Cross-Modal Analysis of Audio-Visual Film Montage
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T16%3A02%3A54IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Cross-Modal%20Analysis%20of%20Audio-Visual%20Film%20Montage&rft.btitle=2011%20Proceedings%20of%2020th%20International%20Conference%20on%20Computer%20Communications%20and%20Networks%20(ICCCN)&rft.au=Zeppelzauer,%20M.&rft.date=2011-07&rft.spage=1&rft.epage=6&rft.pages=1-6&rft.issn=1095-2055&rft.eissn=2637-9430&rft.isbn=1457706377&rft.isbn_list=9781457706370&rft_id=info:doi/10.1109/ICCCN.2011.6005782&rft.eisbn=9781457706363&rft.eisbn_list=1457706385&rft.eisbn_list=9781457706387&rft.eisbn_list=1457706369&rft_dat=%3Cieee_CHZPO%3E6005782%3C/ieee_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i175t-2976633ae0d37ae9d0bf6c2170369b4bad3996fc7b1ff8a5a82b913683ff8de83%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=6005782&rfr_iscdi=true