| Title |
A
Metapattern-Based Automated Discovery Loop for Integrated Data Mining |
| Type |
Award |
| NSF
Org |
IRI |
| Latest
Amendment |
|
| Date |
May
7, 1996 |
| File |
a9529615 |
| Award
Number |
9529615 |
| Award
Instr |
Continuing
Grant |
| Prgm
Manager |
Maria
Zemankova |
| Start
Date |
June
1, 1996 |
| Expires
Expected |
May
31, 1999 (Estimated) |
| Total
Amt. |
:
$279,253 (Estimated) |
| Investigator |
Wei-Min
Shen shen@isi.edu |
| Sponsor |
U
of Southern California,University Park,Los Angeles, CA 900891147
213/740-2934 |
| NSF
Program |
6855
DATABASE & EXPERT SYSTEMS |
| Fld
Applictn |
0104000
Information Systems |
| Abstract
: |
This
research is developing a metapattern-based discovery loop for integrated
data mining. Metapatterns (also known as metaqueries) are second-order,
declarative expressions that specify the types of patterns to be
discovered and assist humans in focusing on more fruitful search
directions. The discovery loop is a search engine that integrates
deduction, induction, and external guidance from humans, as well
as internal guidance of inter-component dependencies. Given a completely
newdatabase, the system first generates an initial set of the most
general metapatterns based on the meta-information of the database,
and then executes these metapatterns against the database to discover
actual patterns. Based on the results, new metapatterns are dynamically
generated by adding more constraints to the more plausible metapatterns.
In this iterative process, human discovers can analyze, create,
select, and execute metapatterns, or instruct the system to pursue
metapatterns on its own. This ability not only makes the process
of data mining moreefficient and productive (the more expert users
can use the system for inspiration of better metapatterns, and the
less expert users can learn how to perform data mining in a particular
domain by observation), but also provides a new method for unsupervised
learning of probabilistic,relation-based patterns. |
|
 |