List of Accepted Papers

Following is the list of accepted SLT 2012 papers, sorted by paper title. You can use the search feature of your web browser to find your paper number. Notifications to all authors have also been sent by email. If you have not received your notification of the results by email, please contact us at papers@slt2012.org.

1130A COMPARISON-BASED APPROACH TO MISPRONUNCIATION DETECTION
1136A CRITICAL ANALYSIS OF TWO STATISTICAL SPOKEN DIALOG SYSTEMS IN PUBLIC USE
1088A GRAPHEME-BASED METHOD FOR AUTOMATIC ALIGNMENT OF SPEECH AND TEXT DATA
1047A NOISE-ROBUST SPEECH RECOGNITION METHOD COMPOSED OF WEAK NOISE SUPPRESSION AND WEAK VECTOR TAYLOR SERIES ADAPTATION
1044A NONPARAMETRIC BAYESIAN APPROACH TO LEARNING MULTIMODAL INTERACTION MANAGEMENT
1127A RERANKING APPROACH FOR RECOGNITION AND CLASSIFICATION OF SPEECH INPUT IN CONVERSATIONAL DIALOGUE SYSTEMS
1113ACOUSTIC MODELING FOR UNDER-RESOURCED LANGUAGES BASED ON VECTORIAL HMM-STATES REPRESENTATION USING SUBSPACE GAUSSIAN MIXTURE MODELS
1174ACTIVE LEARNING FOR ACCENT ADAPTATION IN AUTOMATIC SPEECH RECOGNITION
1178ADAPTATION OF CONTEXT-DEPENDENT DEEP NEURAL NETWORKS FOR AUTOMATIC SPEECH RECOGNITION
1123AFFECTIVE EVALUATION OF A MOBILE MULTIMODAL DIALOGUE SYSTEM USING BRAIN SIGNALS
1064AMERICAN SIGN LANGUAGE FINGERSPELLING RECOGNITION WITH PHONOLOGICAL FEATURE-BASED TANDEM MODELS
1068AN AUTOMATIC PITCH ACCENT FEEDBACK SYSTEM FOR ENGLISH LEARNERS WITH ADAPTATION OF AN ENGLISH CORPUS SPOKEN BY KOREANS
1125ANALYSIS OF SPEECH TRANSCRIPTS TO PREDICT WINNERS OF U.S. PRESIDENTIAL AND VICE-PRESIDENTIAL DEBATES
1086AUDIO-VISUAL FEATURE INTEGRATION BASED ON PIECEWISE LINEAR TRANSFORMATION FOR NOISE ROBUST AUTOMATIC SPEECH RECOGNITION
1167AUTOMATIC CHINESE PRONUNCIATION ERROR DETECTION USING SVM WITH STRUCTURAL FEATURES
1139AUTOMATIC CLASSIFICATION OF UNEQUAL LEXICAL STRESS PATTERNS USING MACHINE LEARNING ALGORITHMS
1015AUTOMATIC DETECTION AND CORRECTION OF SYNTAX-BASED PROSODY ANNOTATION ERRORS
1156AUTOMATIC TRANSCRIPTION OF ACADEMIC LECTURES FROM DIVERSE DISCIPLINES
1036CLASS-BASED SPEECH RECOGNITION USING A MAXIMUM DISSIMILARITY CRITERION AND A TOLERANCE CLASSIFICATION MARGIN
1069COMBINING CEPSTRAL NORMALIZATION AND COCHLEAR IMPLANT-LIKE SPEECH PROCESSING FOR MICROPHONE ARRAY-BASED SPEECH RECOGNITION
1033COMBINING CRITERIA FOR THE DETECTION OF INCORRECT ENTRIES OF NON-NATIVE SPEECH IN THE CONTEXT OF FOREIGN LANGUAGE LEARNING
1076COMBINING MULTIPLE TRANSLATION SYSTEMS FOR SPOKEN LANGUAGE UNDERSTANDING PORTABILITY
1071COMPARISON OF ADAPTATION METHODS FOR GMM-SVM BASED SPEECH EMOTION RECOGNITION
1176CONTEXT DEPENDENT RECURRENT NEURAL NETWORK LANGUAGE MODEL
1083CONTEXT-DEPENDENT DEEP NEURAL NETWORKS FOR AUDIO INDEXING OF REAL-LIFE DATA
1161CROWDSOURCING THE ACQUISITION OF NATURAL LANGUAGE CORPORA: METHODS AND OBSERVATIONS
1180DEEP-LEVEL ACOUSTIC-TO-ARTICULATORY MAPPING FOR DBN-HMM BASED PHONE RECOGNITION
1035DISCRIMINATIVE SPOKEN LANGUAGE UNDERSTANDING USING WORD CONFUSION NETWORKS
1158ECOLOGICAL VALIDITY AND THE EVALUATION OF SPEECH SUMMARIZATION QUALITY
1065EFFICIENT PRIOR AND INCREMENTAL BEAM WIDTH CONTROL TO SUPPRESS EXCESSIVE SPEECH RECOGNITION TIME BASED ON SCORE RANGE ESTIMATION
1155EMPLOYING BOOSTING TO COMPARE CUES TO VERBAL FEEDBACK IN MULTI-LINGUAL DIALOG
1185EVALUATING THE EFFECT OF NORMALIZING INFORMAL TEXT ON TTS OUTPUT
1150EXEMPLAR-BASED VOICE CONVERSION IN NOISY ENVIRONMENT
1175EXPLOITING LOUDNESS DYNAMICS IN STOCHASTIC MODELS OF TURN-TAKING
1151EXPLOITING THE SEMANTIC WEB FOR UNSUPERVISED SPOKEN LANGUAGE UNDERSTANDING
1141FRAME-BASED PHONOTACTIC LANGUAGE IDENTIFICATION
1112GENERATING GRAMMAR QUESTIONS USING CORPUS DATA IN L2 LEARNING
1037IMPROVED SEMANTIC RETRIEVAL OF SPOKEN CONTENT BY LANGUAGE MODELS ENHANCED WITH ACOUSTIC SIMILARITY GRAPH
1054IMPROVING LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION BY COMBINING GMM-BASED AND RESERVOIR-BASED ACOUSTIC MODELING
1066IMPROVING WIDEBAND SPEECH RECOGNITION USING MIXED-BANDWIDTH TRAINING DATA IN CD-DNN-HMM
1090INCORPORATING SYLLABLE DURATION INTO LINE-DETECTION-BASED SPOKEN TERM DETECTION
1131INTENT TRANSFER IN SPEECH-TO-SPEECH MACHINE TRANSLATION
1078JOINT LANGUAGE MODELS FOR AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING
1009LEXICAL ENTRAINMENT AND SUCCESS IN STUDENT ENGINEERING GROUPS
1085LOCALIZED DETECTION OF SPEECH RECOGNITION ERRORS
1060MEDIAPARL: BILINGUAL MIXED LANGUAGE ACCENTED SPEECH DATABASE
1183MODELING INTENSITY CONTOURS AND THE INTERACTION OF PITCH AND INTENSITY TO IMPROVE AUTOMATIC PROSODIC EVENT DETECTION AND CLASSIFICATION
1128MODELING MULTIWORD PHRASES WITH CONSTRAINED PHRASE TREES FOR IMPROVED TOPIC MODELING OF CONVERSATIONAL SPEECH
1120N-BEST ERROR SIMULATION FOR TRAINING SPOKEN DIALOGUE SYSTEMS
1144NOISY CHANNEL ADAPTATION IN LANGUAGE IDENTIFICATION
1042ON THE GENERALIZATION OF SHANNON ENTROPY FOR SPEECH RECOGNITION
1077ON THE USE OF PHONE LOG-LIKELIHOOD RATIOS AS FEATURES IN SPOKEN LANGUAGE RECOGNITION
1159OPTIMIZATION OF THE DET CURVE IN SPEAKER VERIFICATION
1049PERFORMANCE IMPROVEMENT OF AUTOMATIC PRONUNCIATION ASSESSMENT IN A NOISY CLASSROOM
1058PERSONALIZED LANGUAGE MODELING BY CROWD SOURCING WITH SOCIAL NETWORK DATA FOR VOICE ACCESS OF CLOUD APPLICATIONS
1087POLICY OPTIMISATION OF POMDP-BASED DIALOGUE SYSTEMS WITHOUT STATE SPACE COMPRESSION
1149POMDP-BASED LET'S GO SYSTEM FOR SPOKEN DIALOG CHALLENGE
1034REACTIVE AND CONTINUOUS CONTROL OF HMM-BASED SPEECH SYNTHESIS
1063REALISTIC ANSWER VERIFICATION: AN ANALYSIS OF USER ERRORS IN A SENTENCE-REPETITION TASK
1061RECOGNITION RATE ESTIMATION BASED ON WORD ALIGNMENT NETWORK AND DISCRIMINATIVE ERROR TYPE CLASSIFICATION
1137RECOVERY OF ACRONYMS, OUT-OF-LATTICE WORDS AND PRONUNCIATIONS FROM PARALLEL MULTILINGUAL SPEECH
1032REINFORCEMENT LEARNING FOR SPOKEN DIALOGUE SYSTEMS USING OFF-POLICY NATURAL GRADIENT METHOD
1101ROBUST DETECTION OF VOICED SEGMENTS IN SAMPLES OF EVERYDAY CONVERSATIONS USING UNSUPERVISED HMMS
1013SIMULTANEOUS FEATURE SELECTION AND PARAMETER OPTIMIZATION FOR TRAINING OF DIALOG POLICY BY REINFORCEMENT LEARNING
1079SPEAKER DIARIZATION AND LINKING OF LARGE CORPORA
1146SPEECH-BASED EMOTION CLASSIFICATION USING MULTICLASS SVM WITH HYBRID KERNEL AND THRESHOLDING FUSION
1091STATISTICAL METHODS FOR VARYING THE DEGREE OF ARTICULATION IN NEW HMM-BASED VOICES
1104STATISTICAL SEMANTIC INTERPRETATION MODELING FOR SPOKEN LANGUAGE UNDERSTANDING WITH ENRICHED SEMANTIC FEATURES
1038SYLLABLE-BASED PROSODIC ANALYSIS OF AMHARIC READ SPEECH
1133SYNTHESIZING EXPRESSIVE SPEECH FROM AMATEUR AUDIOBOOK RECORDINGS
1160THE BAVIECA OPEN-SOURCE SPEECH RECOGNITION TOOLKIT
1142THE FAU VIDEO LECTURE BROWSER SYSTEM
1114THE LANGUAGE-INDEPENDENT BOTTLENECK FEATURES
1019TOPIC N-GRAM COUNT LANGUAGE MODEL ADAPTATION FOR SPEECH RECOGNITION
1121TOWARDS A NEW SPEECH EVENT DETECTION APPROACH FOR LANDMARK-BASED SPEECH RECOGNITION
1016TRAIN&ALIGN: A NEW ONLINE TOOL FOR AUTOMATIC PHONETIC ALIGNMENT
1105TRANSCRIPTION OF MULTI-GENRE MEDIA ARCHIVES USING OUT-OF-DOMAIN DATA
1147TWO-LAYER MUTUALLY REINFORCED RANDOM WALK FOR IMPROVED MULTI-PARTY MEETING SUMMARIZATION
1109UNSUPERVISED CROSS-LINGUAL KNOWLEDGE TRANSFER IN DNN-BASED LVCSR
1102USE OF KERNEL DEEP CONVEX NETWORKS AND END-TO-END LEARNING FOR SPOKEN LANGUAGE UNDERSTANDING
1023USING RHYTHMIC FEATURES FOR JAPANESE SPOKEN TERM DETECTION
1164USING SYNTACTIC AND CONFUSION NETWORK STRUCTURE FOR OUT-OF-VOCABULARY WORD DETECTION
1028WHAT MAKES THIS VOICE SOUND SO BAD? A MULTIDIMENSIONAL ANALYSIS OF STATE-OF-THE-ART TEXT-TO-SPEECH SYSTEMS
1027WORD SEGMENTATION THROUGH CROSS-LINGUAL WORD-TO-PHONEME ALIGNMENT