to ISI Home Page
isd home
About ISD
education at isd
employment
environment
news
people
research
AI Seminars
div3admin

environment
Weixiong Zhang
Department of Computer Science, Washington University in St. Louis


"Build a Dictionary, Learn a Grammar, Discover Hidden Motifs, and Decipher a Stego Script"

4/23/2004: 10:30am - 12:00pm
10th FL Conference Room

Abstract: Steganography, or information hiding, is to conceal the existence of messages so as to protect their confidentiality. We consider deciphering a stego script, a text with secret messages embedded within a covertext, and identifying the vocabularies used in the messages, with no knowledge of the vocabularies and grammar in which the script was written. Our research was motivated by the problem of identifying conserved non-coding functional elements in genome sequences, which we hypothesize to be constructed by nature using a dictionary of vocabularies (motifs) and a set of grammatical rules of constructing long words from short ones. Our approach identifies significant motifs, which form a dictionary, while learning a stochastic grammar from a stego script, and then applies the dictionary and grammar to identify the embedded secret messages. We apply our algorithm, called WordSpy, to recover the most possible text of the first ten chapters of a novel embedded in a stego script and identify the transcription factor binding motifs in the upstream regions of ~800 genes in budding yeast. This is a joint work with Guandong Wang

About Weixiong Zhang: Dr. Weixiong Zhang is Associate Professor in computer science and genetics at Washington University in Saint Louis. He received his B.S. and M.S. in computer engineering from Tsinghua University, Beijing, China, and his Ph.D. in computer science from UCLA. From 1994 to 2000, he was Senior Scientist at Information Sciences Institute of USC and research assistant professor at USC. Dr. Zhang's research areas include artificial intelligence (e.g., heuristic search, distributed multiagent systems), combinatorial optimization (e.g., TSP and Boolean satisfiability) and computational biology (e.g., RNA folding, targeted gene finding, transcription factor binding motif finding, noncoding small RNA genes, and gene regulatory networks).


Last updated: Mon Jun 19 17:44:06 2006

 

 

 

 

 
USC Home Page ISI Home Page