List of Accepted Papers
Following is the list of accepted SLT 2012 papers, sorted by paper title. You can use the search feature of your web browser to find your paper number. Notifications to all authors have also been sent by email. If you have not received your notification of the results by email, please contact us at papers@slt2012.org.
| 1130 | A COMPARISON-BASED APPROACH TO MISPRONUNCIATION DETECTION |
| 1136 | A CRITICAL ANALYSIS OF TWO STATISTICAL SPOKEN DIALOG SYSTEMS IN PUBLIC USE |
| 1088 | A GRAPHEME-BASED METHOD FOR AUTOMATIC ALIGNMENT OF SPEECH AND TEXT DATA |
| 1047 | A NOISE-ROBUST SPEECH RECOGNITION METHOD COMPOSED OF WEAK NOISE SUPPRESSION AND WEAK VECTOR TAYLOR SERIES ADAPTATION |
| 1044 | A NONPARAMETRIC BAYESIAN APPROACH TO LEARNING MULTIMODAL INTERACTION MANAGEMENT |
| 1127 | A RERANKING APPROACH FOR RECOGNITION AND CLASSIFICATION OF SPEECH INPUT IN CONVERSATIONAL DIALOGUE SYSTEMS |
| 1113 | ACOUSTIC MODELING FOR UNDER-RESOURCED LANGUAGES BASED ON VECTORIAL HMM-STATES REPRESENTATION USING SUBSPACE GAUSSIAN MIXTURE MODELS |
| 1174 | ACTIVE LEARNING FOR ACCENT ADAPTATION IN AUTOMATIC SPEECH RECOGNITION |
| 1178 | ADAPTATION OF CONTEXT-DEPENDENT DEEP NEURAL NETWORKS FOR AUTOMATIC SPEECH RECOGNITION |
| 1123 | AFFECTIVE EVALUATION OF A MOBILE MULTIMODAL DIALOGUE SYSTEM USING BRAIN SIGNALS |
| 1064 | AMERICAN SIGN LANGUAGE FINGERSPELLING RECOGNITION WITH PHONOLOGICAL FEATURE-BASED TANDEM MODELS |
| 1068 | AN AUTOMATIC PITCH ACCENT FEEDBACK SYSTEM FOR ENGLISH LEARNERS WITH ADAPTATION OF AN ENGLISH CORPUS SPOKEN BY KOREANS |
| 1125 | ANALYSIS OF SPEECH TRANSCRIPTS TO PREDICT WINNERS OF U.S. PRESIDENTIAL AND VICE-PRESIDENTIAL DEBATES |
| 1086 | AUDIO-VISUAL FEATURE INTEGRATION BASED ON PIECEWISE LINEAR TRANSFORMATION FOR NOISE ROBUST AUTOMATIC SPEECH RECOGNITION |
| 1167 | AUTOMATIC CHINESE PRONUNCIATION ERROR DETECTION USING SVM WITH STRUCTURAL FEATURES |
| 1139 | AUTOMATIC CLASSIFICATION OF UNEQUAL LEXICAL STRESS PATTERNS USING MACHINE LEARNING ALGORITHMS |
| 1015 | AUTOMATIC DETECTION AND CORRECTION OF SYNTAX-BASED PROSODY ANNOTATION ERRORS |
| 1156 | AUTOMATIC TRANSCRIPTION OF ACADEMIC LECTURES FROM DIVERSE DISCIPLINES |
| 1036 | CLASS-BASED SPEECH RECOGNITION USING A MAXIMUM DISSIMILARITY CRITERION AND A TOLERANCE CLASSIFICATION MARGIN |
| 1069 | COMBINING CEPSTRAL NORMALIZATION AND COCHLEAR IMPLANT-LIKE SPEECH PROCESSING FOR MICROPHONE ARRAY-BASED SPEECH RECOGNITION |
| 1033 | COMBINING CRITERIA FOR THE DETECTION OF INCORRECT ENTRIES OF NON-NATIVE SPEECH IN THE CONTEXT OF FOREIGN LANGUAGE LEARNING |
| 1076 | COMBINING MULTIPLE TRANSLATION SYSTEMS FOR SPOKEN LANGUAGE UNDERSTANDING PORTABILITY |
| 1071 | COMPARISON OF ADAPTATION METHODS FOR GMM-SVM BASED SPEECH EMOTION RECOGNITION |
| 1176 | CONTEXT DEPENDENT RECURRENT NEURAL NETWORK LANGUAGE MODEL |
| 1083 | CONTEXT-DEPENDENT DEEP NEURAL NETWORKS FOR AUDIO INDEXING OF REAL-LIFE DATA |
| 1161 | CROWDSOURCING THE ACQUISITION OF NATURAL LANGUAGE CORPORA: METHODS AND OBSERVATIONS |
| 1180 | DEEP-LEVEL ACOUSTIC-TO-ARTICULATORY MAPPING FOR DBN-HMM BASED PHONE RECOGNITION |
| 1035 | DISCRIMINATIVE SPOKEN LANGUAGE UNDERSTANDING USING WORD CONFUSION NETWORKS |
| 1158 | ECOLOGICAL VALIDITY AND THE EVALUATION OF SPEECH SUMMARIZATION QUALITY |
| 1065 | EFFICIENT PRIOR AND INCREMENTAL BEAM WIDTH CONTROL TO SUPPRESS EXCESSIVE SPEECH RECOGNITION TIME BASED ON SCORE RANGE ESTIMATION |
| 1155 | EMPLOYING BOOSTING TO COMPARE CUES TO VERBAL FEEDBACK IN MULTI-LINGUAL DIALOG |
| 1185 | EVALUATING THE EFFECT OF NORMALIZING INFORMAL TEXT ON TTS OUTPUT |
| 1150 | EXEMPLAR-BASED VOICE CONVERSION IN NOISY ENVIRONMENT |
| 1175 | EXPLOITING LOUDNESS DYNAMICS IN STOCHASTIC MODELS OF TURN-TAKING |
| 1151 | EXPLOITING THE SEMANTIC WEB FOR UNSUPERVISED SPOKEN LANGUAGE UNDERSTANDING |
| 1141 | FRAME-BASED PHONOTACTIC LANGUAGE IDENTIFICATION |
| 1112 | GENERATING GRAMMAR QUESTIONS USING CORPUS DATA IN L2 LEARNING |
| 1037 | IMPROVED SEMANTIC RETRIEVAL OF SPOKEN CONTENT BY LANGUAGE MODELS ENHANCED WITH ACOUSTIC SIMILARITY GRAPH |
| 1054 | IMPROVING LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION BY COMBINING GMM-BASED AND RESERVOIR-BASED ACOUSTIC MODELING |
| 1066 | IMPROVING WIDEBAND SPEECH RECOGNITION USING MIXED-BANDWIDTH TRAINING DATA IN CD-DNN-HMM |
| 1090 | INCORPORATING SYLLABLE DURATION INTO LINE-DETECTION-BASED SPOKEN TERM DETECTION |
| 1131 | INTENT TRANSFER IN SPEECH-TO-SPEECH MACHINE TRANSLATION |
| 1078 | JOINT LANGUAGE MODELS FOR AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING |
| 1009 | LEXICAL ENTRAINMENT AND SUCCESS IN STUDENT ENGINEERING GROUPS |
| 1085 | LOCALIZED DETECTION OF SPEECH RECOGNITION ERRORS |
| 1060 | MEDIAPARL: BILINGUAL MIXED LANGUAGE ACCENTED SPEECH DATABASE |
| 1183 | MODELING INTENSITY CONTOURS AND THE INTERACTION OF PITCH AND INTENSITY TO IMPROVE AUTOMATIC PROSODIC EVENT DETECTION AND CLASSIFICATION |
| 1128 | MODELING MULTIWORD PHRASES WITH CONSTRAINED PHRASE TREES FOR IMPROVED TOPIC MODELING OF CONVERSATIONAL SPEECH |
| 1120 | N-BEST ERROR SIMULATION FOR TRAINING SPOKEN DIALOGUE SYSTEMS |
| 1144 | NOISY CHANNEL ADAPTATION IN LANGUAGE IDENTIFICATION |
| 1042 | ON THE GENERALIZATION OF SHANNON ENTROPY FOR SPEECH RECOGNITION |
| 1077 | ON THE USE OF PHONE LOG-LIKELIHOOD RATIOS AS FEATURES IN SPOKEN LANGUAGE RECOGNITION |
| 1159 | OPTIMIZATION OF THE DET CURVE IN SPEAKER VERIFICATION |
| 1049 | PERFORMANCE IMPROVEMENT OF AUTOMATIC PRONUNCIATION ASSESSMENT IN A NOISY CLASSROOM |
| 1058 | PERSONALIZED LANGUAGE MODELING BY CROWD SOURCING WITH SOCIAL NETWORK DATA FOR VOICE ACCESS OF CLOUD APPLICATIONS |
| 1087 | POLICY OPTIMISATION OF POMDP-BASED DIALOGUE SYSTEMS WITHOUT STATE SPACE COMPRESSION |
| 1149 | POMDP-BASED LET'S GO SYSTEM FOR SPOKEN DIALOG CHALLENGE |
| 1034 | REACTIVE AND CONTINUOUS CONTROL OF HMM-BASED SPEECH SYNTHESIS |
| 1063 | REALISTIC ANSWER VERIFICATION: AN ANALYSIS OF USER ERRORS IN A SENTENCE-REPETITION TASK |
| 1061 | RECOGNITION RATE ESTIMATION BASED ON WORD ALIGNMENT NETWORK AND DISCRIMINATIVE ERROR TYPE CLASSIFICATION |
| 1137 | RECOVERY OF ACRONYMS, OUT-OF-LATTICE WORDS AND PRONUNCIATIONS FROM PARALLEL MULTILINGUAL SPEECH |
| 1032 | REINFORCEMENT LEARNING FOR SPOKEN DIALOGUE SYSTEMS USING OFF-POLICY NATURAL GRADIENT METHOD |
| 1101 | ROBUST DETECTION OF VOICED SEGMENTS IN SAMPLES OF EVERYDAY CONVERSATIONS USING UNSUPERVISED HMMS |
| 1013 | SIMULTANEOUS FEATURE SELECTION AND PARAMETER OPTIMIZATION FOR TRAINING OF DIALOG POLICY BY REINFORCEMENT LEARNING |
| 1079 | SPEAKER DIARIZATION AND LINKING OF LARGE CORPORA |
| 1146 | SPEECH-BASED EMOTION CLASSIFICATION USING MULTICLASS SVM WITH HYBRID KERNEL AND THRESHOLDING FUSION |
| 1091 | STATISTICAL METHODS FOR VARYING THE DEGREE OF ARTICULATION IN NEW HMM-BASED VOICES |
| 1104 | STATISTICAL SEMANTIC INTERPRETATION MODELING FOR SPOKEN LANGUAGE UNDERSTANDING WITH ENRICHED SEMANTIC FEATURES |
| 1038 | SYLLABLE-BASED PROSODIC ANALYSIS OF AMHARIC READ SPEECH |
| 1133 | SYNTHESIZING EXPRESSIVE SPEECH FROM AMATEUR AUDIOBOOK RECORDINGS |
| 1160 | THE BAVIECA OPEN-SOURCE SPEECH RECOGNITION TOOLKIT |
| 1142 | THE FAU VIDEO LECTURE BROWSER SYSTEM |
| 1114 | THE LANGUAGE-INDEPENDENT BOTTLENECK FEATURES |
| 1019 | TOPIC N-GRAM COUNT LANGUAGE MODEL ADAPTATION FOR SPEECH RECOGNITION |
| 1121 | TOWARDS A NEW SPEECH EVENT DETECTION APPROACH FOR LANDMARK-BASED SPEECH RECOGNITION |
| 1016 | TRAIN&ALIGN: A NEW ONLINE TOOL FOR AUTOMATIC PHONETIC ALIGNMENT |
| 1105 | TRANSCRIPTION OF MULTI-GENRE MEDIA ARCHIVES USING OUT-OF-DOMAIN DATA |
| 1147 | TWO-LAYER MUTUALLY REINFORCED RANDOM WALK FOR IMPROVED MULTI-PARTY MEETING SUMMARIZATION |
| 1109 | UNSUPERVISED CROSS-LINGUAL KNOWLEDGE TRANSFER IN DNN-BASED LVCSR |
| 1102 | USE OF KERNEL DEEP CONVEX NETWORKS AND END-TO-END LEARNING FOR SPOKEN LANGUAGE UNDERSTANDING |
| 1023 | USING RHYTHMIC FEATURES FOR JAPANESE SPOKEN TERM DETECTION |
| 1164 | USING SYNTACTIC AND CONFUSION NETWORK STRUCTURE FOR OUT-OF-VOCABULARY WORD DETECTION |
| 1028 | WHAT MAKES THIS VOICE SOUND SO BAD? A MULTIDIMENSIONAL ANALYSIS OF STATE-OF-THE-ART TEXT-TO-SPEECH SYSTEMS |
| 1027 | WORD SEGMENTATION THROUGH CROSS-LINGUAL WORD-TO-PHONEME ALIGNMENT |