Artificial intelligence applications and innovations by brammer

Acknowledgments Paper Sessions Applications 1 Artificial Intelligence Systems in Micromechanics FELIPE LARA-ROSANO, ERNST KUSSUL, TATIANA BAIDYK, LEOPOLDO RUIZ, ALBERTO CABALLERO AND GRA

Trang 2

AND INNOVATIONS

Trang 3

IFIP was founded in 1960 under the auspices of UNESCO, following the First World Computer Congress held in Paris the previous year An umbrella organization for societies working in information processing, IFIP’s aim is two-fold: to support information processing within its member countries and to encourage technology transfer to developing nations As its mission statement clearly states,

IFIP’s mission is to be the leading, truly international, apolitical

organization which encourages and assists in the development, exploitation

and application of information technology for the benefit of all people.

IFIP is a non-profitmaking organization, run almost solely by 2500 volunteers It operates through a number of technical committees, which organize events and publications IFIP’s events range from an international congress to local seminars, but the most important are:

The IFIP World Computer Congress, held every second year;

Open conferences;

Working conferences.

The flagship event is the IFIP World Computer Congress, at which both invited and contributed papers are presented Contributed papers are rigorously refereed and the rejection rate is high.

As with the Congress, participation in the open conferences is open to all and papers may be invited or submitted Again, submitted papers are stringently refereed.

The working conferences are structured differently They are usually run by a working group and attendance is small and by invitation only Their purpose is to create an atmosphere conducive to innovation and development Refereeing is less rigorous and papers are subjected to extensive group discussion.

Publications arising from IFIP events vary The papers presented at the IFIP World Computer Congress and at open conferences are published as conference proceedings, while the results

of the working conferences are often published as collections of selected and edited papers Any national society whose primary activity is in information may apply to become a full member of IFIP, although full membership is restricted to one society per country Full members are entitled to vote at the annual General Assembly, National societies preferring a less committed involvement may apply for associate or corresponding membership Associate members enjoy the same benefits as full members, but without voting rights Corresponding members are not represented in IFIP bodies Affiliated membership is open to non-national societies, and individual and honorary membership schemes are also offered.

Trang 4

ARTIFICIAL INTELLIGENCE APPLICATIONS AND

INNOVATIONS

IFIP 18th World Computer Congress

TC12 First International Conference on

Artificial Intelligence Applications and Innovations (AIAI-2004) 22–27 August 2004

University of Belgrade, Serbia and Montenegro

KLUWER ACADEMIC PUBLISHERS

NEW YORK, BOSTON, DORDRECHT, LONDON, MOSCOW

Trang 5

Print ISBN: 1-4020-8150-2

No part of this eBook may be reproduced or transmitted in any form or by any means, electronic, mechanical, recording, or otherwise, without written consent from the Publisher

Created in the United States of America

Boston

Visit Springer's eBookstore at: http://www.ebooks.kluweronline.com

and the Springer Global Website Online at: http://www.springeronline.com

Trang 6

Acknowledgments

Paper Sessions

Applications 1

Artificial Intelligence Systems in Micromechanics

FELIPE LARA-ROSANO, ERNST KUSSUL, TATIANA BAIDYK,

LEOPOLDO RUIZ, ALBERTO CABALLERO AND GRACIELA VELASCO

Integrating Two Artificial Intelligence Theories in a Medical

Diagnosis Application

H ADRIAN P ETER AND W AYNE G OODRIDGE

Artificial Intelligence and Law

H UGO C H OESCHL AND V ÂNIA B ARCELLOS

Virtual Market Environment for Trade

P AUL B OGG AND P ETER D ALMARIS

xixiii

1

11

25

35

Trang 7

Neural Networks and Fuzzy Systems

An Artificial Neural Networks Approach to the Estimation of

Physical Stellar Parameters

A.R ODRIGUEZ, I.C ARRICAJO, C.D AFONTE, B.A RCAY AND

M.M ANTEIGA

Evolutionary Robot Behaviors Based on Natural Selection and

Neural Network

J INGAN Y ANG, Y ANBIN Z HUANG AND H ONGYAN W ANG

Control of Overhead Crane by Fuzzy-PID with Genetic Optimisation

A S OUKKOU, A . K HELLAF AND S LEULMI

Creative Design of Fuzzy Logic Controller

L OTFI H AMROUNI AND A DEL M A LIMI

On-Line Extraction of Fuzzy Rules in a Wastewater Treatment Plant

J V ICTOR RAMOS, C G ONÇALVES AND A D OURADO

Agents

An Autonomous Intelligent Agent Architecture Based on

Constructivist AI

F ILIPO S TUDZINSKI P EROTTO, R OSA V ICARI

AND L UÍS O TÁVIO A LVARES

Finding Manufacturing Expertise Using Ontologies and

Cooperative Agents

O LGA N ABUCO, M AURO K OYAMA, F RANCISCO P EREIRA

AND K HALIL D RIRA

Using Agents in the Exchange of Product Data

U DO K ANNENGIESSER AND J OHN S G ERO

Applications 2

A Pervasive Identification and Adaptation System for the

Smart House

P AULO F F R OSA, S ANDRO S L IMA, W AGNER T B OTELHO,

A LEXANDRE F N ASCIMENTO AND M AX S ILVA A LALUNA

Trang 8

Deductive Diagnosis of Digital Circuits

J J A LFERES, F A ZEVEDO, P B ARAHONA, C V D AMÁSIO

AND T S WIFT

Verification of Nasa Emergent Systems

C HRISTOPHER R OUFF, A MY V ANDERBILT,

W ALT T RUSZKOWSKI, J AMES R ASH AND M IKE H INCHEY

V AGAN T ERZIYAN AND O LEKSANDRA V ITKO

Using Organisational Structures Emergence for Maintaining

Functional Integrity In Embedded Systems Networks

J EAN- P AUL J AMONT AND M ICHEL O CCELLO

Efficient Attribute Reduction Algorithm

Z HONGZHI S HI, S HAOHUI L IU AND Z HENG Z HENG

Using Relative Logic for Pattern Recognition

J ULIUSZ L K ULIKOWSKI

Intelligent Tutoring and Collaboration

MathTutor: A Multi-Agent Intelligent Tutoring System

J ANETTE C ARDOSO, G UILHERME B ITTENCOURT,

L UCIANA F RIGO, E LIANE P OZZEBON AND A DRIANA P OSTAL

Analysis and Intelligent Support of Learning Communities in

Semi-structured Discussion Environments

A NDREAS H ARRER

An Adaptive Assessment System to Evaluate Student Ability Level

A NTONELLA C ARBONARO, G IORGIO C ASADEI AND

Trang 9

Forming the Optimal Team of Experts for Collaborative Work

A CHIM K ARDUCK AND A MADOU S IENOU

Internet

Impact on Performance of Hypertext Classification of Selective

Rich HTML Capture

H OUDA B ENBRAHIM AND M AX B RAMER

Introducing a Star Topology into Latent Class Models for

Collaborative Filtering

AND

Dialoguing with an Online Assistant in a Financial Domain:

The VIP-Advisor Approach

J OSEFA Z H ERNANDEZ, A NA G ARCIA- S ERRANO AND

J AVIER C ALLE

An Agency for Semantic-Based Automatic Discovery of

Web Services

S IMONA C OLUCCI, T OMMASO D I N OIA,

E UGENIO D I S CIASCIO, F RANCESCO M.D ONINI,

M ARINA M ONGIELLO, G IACOMO P ISCITELLI AND

G IANVITO R OSSI

Genetic Algorithms

GESOS: A Multi-Objective Genetic Tool for Project Management

Considering Technical and Non-Technical Constraints

C LAUDE B ARON, S AMUEL R OCHET AND D ANIEL E STEVE

Using Genetic Algorithms and Tabu Search Parallel Models to

Solve the Scheduling Problem

P EDRO P INACHO, M AURICIO S OLAR, M ARIO I NOSTROZA

AND R OSA M UÑOZ

Modelling Document Categories by Evolutionary Learning of

Trang 10

Ontologies and Data Mining

ODEVAL: A Tool for Evaluating RDF(S), DAML+OIL

and OWL Concept Taxonomies

Ó SCAR C ORCHO, A SUNCIÓN G ÓMEZ- P ÉREZ,

R AFAEL G ONZÁLEZ- C ABERO AND M C ARMEN S UÁREZ- F IGUEROA

AIR - A Platform for Intelligent Systems

D RAGAN D JURIC, D RAGAN G ASEVIC AND V IOLETA D AMJANOVIC

SwissAnalyst

O P OVEL AND C G IRAUD- C ARRIER

Data Mining by MOUCLAS Algorithm for Petroleum Reservoir

Characterization from Well Logging Data

Y ALEI H AO, M ARKUS S TUMPTNER, G ERALD Q UIRCHMAYR AND

Q ING H E

Reasoning and Scheduling

Verification of Procedural Reasoning Systems (PRS) Programs

Using Coloured Petri Nets (CPN)

R ICARDO W AGNER D E A RAÚJO AND A DELARDO A DELINO

D E M EDEIROS

Online Possibilistic Diagnosis Based on Expert Knowledge for

Engine Dyno Test Benches

O DE M OUZON, X G UÉRANDEL, D D UBOIS, H.P RADE

AND S B OVERIE

CBR and Micro-Architecture Anti-Patterns Based Software

Design Improvement

T IE F ENG, J IACHEN Z HANG, H ONGYUAN W ANG AND X IAN W ANG

A Decision Support System (DSS) for the Railway Scheduling

Problem

L I NGOLOTTI, P T ORMOS, A L OVA, F B ARBER, M.A S ALIDO

AND M.A BRIL

An Interactive Multicriteria Optimisation Approach to Scheduling

M ARTIN J OSEF G EIGER AND S ANJA P ETROVIC

Trang 12

The papers in this volume comprise the refereed proceedings of the FirstInternational Conference on Artificial Intelligence Applications andInnovations (AIAI-2004), which formed part of the 18th World ComputerCongress of IFIP, the International Federation for Information Processing(WCC-2004), in Toulouse, France in August 2004.

The conference is organised by the IFIP Technical Committee on ArtificialIntelligence (Technical Committee 12) and its Working Group 12.5(Artificial Intelligence Applications) Further information about both can befound on the website at http://www.ifiptc12.org

A very promising sign of the growing importance of Artificial Intelligencetechniques in practical applications is the large number of submissionsreceived this time - more than twice the number for the ArtificialIntelligence stream of the last World Computer Congress two years ago Allpapers were reviewed by at least three members of the ProgrammeCommittee The best 40 were selected for the conference and are included inthis volume The international nature of IFIP is amply reflected in the largenumber of countries represented here

The conference also featured an invited talk by Eunika Mercier-Laurent and

a Symposium on Professional Practice in Artificial Intelligence, which ranalongside the refereed papers

I should like to thank the joint conference chairs, Professor John Debenhamand Dr Eunika Mercier-Laurent and my co-program chair Dr Vladan

Trang 13

Devedzic for all their efforts in organising the conference and the members

of our programme committee for reviewing an unexpectedly large number ofpapers to a tight deadline I should also like to thank my wife Dawn for herhelp in editing this volume of proceedings

This is the first in a new series of conferences dedicated to real-worldapplications of AI around the world The wide range and importance of theseapplications is clearly indicated by the papers in this volume Both are likely

to increase still further as time goes by and we intend to reflect thesedevelopments in our future conferences

Max Bramer

Chair, IFIP Technical Committee on Artificial Intelligence

Trang 14

Conference Organising Committee

Conference General Chairs

John Debenham (University of Technology, Sydney, Australia)

Eunika Mercier-Laurent (Association Francaise pour l’IntelligenceArtificielle, France)

Conference Program Chairs

Max Bramer (University of Portsmouth, United Kingdom)

Vladan Devedzic (University of Belgrade, Serbia and Montenegro)

Programme Committee

Agnar Aamodt (Norway)

Luigia Carlucci Aiello (Italy)

Adel Alimi (Tunisia)

Lora Aroyo (The Netherlands)

Max Bramer (United Kingdom)

Zdzislaw Bubnicki (Poland)

Weiqin Chen (Norway)

Monica Crubezy (USA)

John Debenham (Australia)

Trang 15

Yves Demazeau (France)

Vladan Devedzic (Yugoslavia)

Rose Dieng (France)

Henrik Eriksson (Sweden)

Ana Garcia-Serrano (Spain)

Nicola Guarino (Italy)

Andreas Harrer (Germany)

Jean-Paul Haton (France)

Timo Honkela (Finland)

Kostas Karpouzis (Greece)

Dusko Katic (Serbia and Montenegro)Ray Kemp (New Zealand)

Kinshuk (New Zealand)

Piet Kommers (The Netherlands)

Jasna Kuljis (United Kingdom)

Ilias Maglogiannis (Greece)

Eunika Mercier-Laurent (France)

Antonija Mitrovic (New Zealand)

Riichiro Mizoguchi (Japan)

Enrico Motta (United Kingdom)

Wolfgang Nejdl (Germany)

Erich Neuhold (Germany)

Bernd Neumann (Germany)

Natasha Noy (USA)

Zeljko Obrenovic (Serbia and Montenegro)Mihaela Oprea (Romania)

Petra Perner (Germany)

Alun Preece (United Kingdom)

Abdel-Badeeh Salem (Egypt)

Demetrios Sampson (Greece)

Pierre-Yves Schobbens (Belgium)

Yuval Shahar (Israel)

Stuart Shapiro (USA)

Derek Sleeman (United Kingdom)

Constantine Spyropoulos (Greece)

Steffen Staab (Germany)

Mia Stern (USA)

Gerd Stumme (Germany)

Valentina Tamma (United Kingdom)Vagan Terziyan (Finland)

Trang 16

Felipe Lara-Rosano, Ernst Kussul, Tatiana Baidyk, Leopoldo Ruiz, AlbertoCaballero, Graciela Velasco

CCADET, UNAM

Abstract: Some of the artificial intelligence (AI) methods could be used to improve the

automation system performance in manufacturing processes However, the implementation of these AI methods in the industry is rather slow, because of the high cost of the experiments with the conventional manufacturing and AI systems To lower the experiment cost in this field, we have developed a special micromechanical equipment, similar to conventional mechanical equipment, but of much smaller size and therefore of lower cost This equipment could be used for evaluation of different AI methods in an easy and inexpensive way The proved methods could be transferred to the industry through appropriate scaling In this paper we describe the prototypes of low cost microequipment for manufacturing processes and some AI method implementations to increase its precision, like computer vision systems based

on neural networks for microdevice assembly, and genetic algorithms for microequipment characterization and microequipment precision increase.

Key words: artificial intelligence, micromechanics, computer vision, genetic algorithms

The development of AI technologies opens an opportunity to use themnot only for conventional applications (expert systems, intelligent data bases[1], technical diagnostics [2,3] etc.), but also for total automation ofmechanical manufacturing Such AI methods as adaptive critic design [4,5],adaptive fuzzy Petri networks [6,7], neural network based computer visionsystems [8-12], etc could be used to solve the automation problems Tocheck this opportunity up, it is necessary to create an experimental factory

Trang 17

with fully automated manufacturing processes This is a very difficult andexpensive task.

To make a very small mechanical microequipment, a new technologywas proposed [13,14] This technology is based on micromachine tools andmicroassembly devices, which can be produced as sequential generations ofmicroequipment Each generation should include equipment (machine-tools,manipulators, assembly devices, measuring instruments, etc.) sufficient formanufacturing an identical equipment set of smaller size Each subsequentequipment generation could be produced by the preceding one Theequipment size of each subsequent generation is smaller than the overall size

of preceding generation

The first-generation microequipment can be produced by conventionallarge-scale equipment Using microequipment of this first generation, asecond microequipment generation having smaller overall sizes can beproduced

We call this approach to mechanical microdevices manufacturingMicroEquipment Technology (MET) [15]

The proposed MET technology has many advantages:

(1) The equipment miniaturization leads to decreasing the occupied space

as well as energy consumption, and, therefore, the cost of the products.(2) The labor costs are bound to decrease due to the reduction ofmaintenance costs and a higher level of automation expected in MET

(3) Miniaturization of equipment by MET results in a decrease of its cost.This is a consequence of the fact that microequipment itself becomes theobject of MET The realization of universal microequipment that is capable

of extended reproduction of itself will allow the manufacture of low-costmicroequipment in a few reproductive acts because of the lowerconsumption of materials, energy, labor, and space in MET Thus theminiaturization of equipment opens the way to a drastic decrease in the unitcost of individual processing

At a lower unit cost of individual micromachining, the most natural way

to achieve high throughput is to parallelize the processes of individualmachining by concurrent use of a great quantity of microequipment of thesame kind Exploitation of that great number of microsized machine-tools isonly feasible with their automatic operation and a highly automated control

of the microfactory as a whole We expect that many useful and provedconcepts, ideas and techniques of automation can be borrowed frommechanical engineering They vary from the principles of factory automation

Trang 18

(FMS and CAM) to the ideas of unified containers and clamping devices andtechniques of numerical control However automation ofmicromanufacturing has peculiarities that will require the special methods ofartificial intelligence.

MICROMECHANICAL FACTORY

Let us consider a general hierarchical structure of the automatic controlsystem for a micromechanical factory The lowest (first) level of the systemcontrols the micromechanical equipment (the micro machine-tools andassembly manipulators), provides the simplest microequipment diagnosticsand the final measurement and testing of production The second level of thecontrol system controls the devices that transport workpieces, tools, parts,and the whole equipment items; coordinates the operation of the lowest leveldevices; provides the intermediate quality inspection of production and themore advanced diagnostics of equipment condition The third control levelcontains the system for the automatic choice of process modes and routes forparts machining The top (fourth) level of the control system performsdetecting of non-standard and alarm situations and decision making,including communication with the operator

We proceed from the assumption that no more than one operator willmanage the microfactory It means that almost all the problems arising atany control level during the production process should be solvedautomatically and that operator must solve only a few problems, that are toocomplex or unusual to be solved automatically

Since any production process is affected by various disturbances, thecontrol system should be an adaptive one Moreover, it should be self-learning, because it is impossible to foresee all kinds of disturbances inadvance AI that is able to construct the self-learning algorithms and tominimize the participation of operator, seem to be especially useful for thistask AI includes different methods for creating autonomous control systems.The neural classifiers will be particularly useful at the lowest level of thecontrol system They could be used for the selection of treatment modes,checking of cutting tool conditions, control of the assembly processes, etc.They allow to make the control system more flexible The system willautomatically compensate for small deviations of production conditions,such as the change of cutting tool shape or external environment parameters,variations in the structure of workpiece materials, etc AI will permit todesign self-learning classifiers and should provide the opportunity to excludethe participation of human operator at this level of control

Trang 19

At the second control level, the AI system should detect all deviationsfrom the normal production process and make decisions about how tomodify the process to compensate for the deviation The compensationshould be made by tuning the parameters of the lower level control systems.The examples of such deviations are the deviations from the productionschedule, failures in some devices, off-standard production, etc At this levelthe AI system should contain the structures in which the interrelations ofproduction process constituents are represented As in the previous case, it isdesirable to have the algorithms working without the supervisor.

The third control level is connected basically with the change ofnomenclature or volume of the production manufactured by the factory It isconvenient to develop such a system so that the set-up costs for a newproduction or the costs to change the production volume should be minimal.The self-learning AI structures formed at the lowest level could provide thebasis for such changes of set-up by selection of the process parameters, thechoice of equipment configuration for machining and assembly, etc At thethird control level the AI structures should detect the similarity of newproducts with the products which were manufactured in the past On thebasis of this similarity, the proposals about the manufacturing schedule,process modes, routing, etc will be automatically formed Then they will bechecked up by the usual computational methods of computer aidedmanufacturing (CAM) The results of the check, as well as the subsequentinformation about the efficiency of decisions made at this level, may be usedfor improving the AI system

The most complicated AI structures should be applied at the top controllevel This AI system level must have the ability to reveal the recent unusualfeatures in the production process, to make the evaluation of possibleinfluence of these new features on the production process, and to makedecisions for changing the control system parameters at the varioushierarchical levels or for calling for the operator’s help At this level, thecontrol system should contain the intelligence knowledge base, which can becreated using the results of the operation of the lower level control systemsand the expert knowledge At the beginning, the expert knowledge ofmacromechanics may be used

At present many methods of AI are successfully used in the industry[16,17] They could be used also for micromechanics But the problems offully automated microfactory creation can not be investigated experimentally

in conventional industry because of the high cost of the experiments Here

we propose to develop low cost micromechanical test bed to solve theseproblems

The prototypes of the first generation microequipment are designed andexamined in the Laboratory of Micromechanics and Mechatronics,

Trang 20

CCADET, UNAM The prototypes use adaptive algorithms of the lowestlevel At present more sophisticated algorithms based on neural networksand genetic algorithms are being developed Below we describe ourexperiments in the area of such algorithms development and applications.

Figure 1 The developed second prototype of the first generation of micromachine tool.

This prototype of the micromachine tool has the size

and is controlled by a PC The axes X and Z have 20 mm of displacement and the Y -axis has 35 mm of displacement; all have the same configuration.

The resolution is per motor step

4.2 Micromanipulators

At present, in the Laboratory of Micromechanics and Mechatronics,CCADET, UNAM the principles, designs and methods of manufacture ofmicromachine tools and micromanipulators corresponding to the first

Trang 21

microequipment generation are developed All these works are accompaniedwith of the prototypes development (Fig.2).

Figure 2 Sequential micromanipulator prototype

4.3 Computer vision system

To obtain a low cost microequipment it is necessary to use low costcomponents Low cost components do not permit us to obtain high absoluteaccuracy of the assembly devices To avoid this drawback we havedeveloped an adaptive algorithm for microassembly using a technical visionsystem (Fig 3)

Figure 3 The prototype of visual controlled assembly system

The main idea of this approach is to replace the stereovision system,which demands two video cameras, for the system with one TV camera forteleconferences, with a cost of 40 dollars, and four light sources Theshadows from the light sources permit us to obtain the 3-D position of the

Trang 22

needle with the microring relative to the hole The microring is to be insertedinto the hole We use a neural classifier to recognize the relative position.The problem of automatic microdevices assembly is very important inmechatronics and micromechanics To obtain the high precision, it isnecessary to use adaptive algorithms on the base of technical vision systems.

We proposed an approach, that permits us to develop the adaptive algorithmsbased on neural networks We consider the conventional pin-hole task It isnecessary to insert the pin into the hole using a low cost technical visionsystem

For this purpose it is necessary to know the displacements (dx, dy, dz) of

the pin tip relative to the hole It is possible to evaluate these displacementswith a stereovision system, which resolves 3D problems The stereovisionsystem demands two TV cameras To simplify the control system wepropose the transformation of 3D into 2D images preserving all theinformation about mutual location of the pin and the hole This approachmakes it possible to use only one TV camera

Four light sources are used to obtain pin shadows Mutual location ofthese shadows and the hole contains all the information about thedisplacements of the pin relative to the hole The displacements in the

horizontal plane (dx, dy) could be obtained directly by displacements of

shadows center points relative to the hole center Vertical displacement ofthe pin may be obtained from the distance between the shadows Tocalculate the displacements it is necessary to have all the shadows in oneimage We capture four images corresponding to each light sourcesequentially, and then we extract contours and superpose four contourimages We use the resulting image to recognize the position of the pinrelative to the hole We developed the neural network system which permits

us to recognize the pin-hole displacements with errors less than 1 pixel[11,12]

4.4 Adaptive Algorithm of the Lowest Level

To compensate for the machine tool errors we have developed a specialalgorithm for the workpiece diameter measurement using the electricalcontact of the workpiece with the measurement disk (Fig 4) Thismeasurement allows us to develop the algorithm for brass needle cutting Weobtained a brass needle with a diameter of and a length of

(Fig 5) almost equal to the Japanese needle [18]

Trang 23

Figure 4 The workpiece with measurement disk

Figure 5 The brass needle with diameter

4.5 Genetic Algorithm for Micromachine Tool

Characterization

To improve the micromachine tool precision it is necessary to correct itserrors To obtain the information about the micromachine tools errors, weuse a two balls scheme for machine tool parameters measurement One ball

is fixed on the special tool support, which is inserted to the chuck Thesecond ball is fixed on the machine tool carriage (Fig 6)

By moving the carriage with the second ball up to the contact with thefirst ball in different positions it is possible to obtain all the neededinformation about the geometrical properties of the machine tool But thegeometrical parameters depend on the contact positions in a verycomplicated manner To resolve the system of nonlinear equations whichrepresent the mentioned dependence we use a genetic algorithm Thisapproach permits us to reduce to one third the micromachine tools errors

Trang 24

5 CONCLUSIONS

AI algorithms could be used to increase the level of manufacturingprocesses automatization The experiments with AI algorithms in realindustry factories are too expensive In this article a low cost test bed for AImethod examinations is proposed This test bed is composed of themicromechanical models of conventional industry devices The prototypes

of micromachine tools and micromanipulators were developed and examinedwith some AI algorithms The test bed examination results show that AIsystems could be proved with low expenses

Figure 6 Ball location in the micromachine tool

Trang 25

Awadallah, M., Morcos, M.: Application of AI tools in fault diagnosis of electrical

machines and drives-an overview IEEE Transactions on Energy Conversion 18, Issue 2

(2003) 245-251

Werbos, P.: Advanced Forecasting Methods for Global Crisis Warning and Models of

Intelligence In: General Systems Yearbook 22 (1977) 25-38

Prokhorov, D., Wunsch, D.: Adaptive Critic Designs IEEE Trans on Neural Networks 8,

N 5 (1997) 997-1007

Xiaoou Li, Lara-Rosano, F.: A weighted fuzzy Petri net model for knowledge learning and

reasoning International Joint Conference on Neural Networks, IJCNN ’99 4 2368 –2372

Xiaoou Li, Wen Yu, Lara-Rosano, F.: Dynamic Knowledge Inference and Learning under Adaptive Fuzzy Petri Net Framework IEEE Transactions on Systems, Man, and

Cybernetics 39, N4 (2000) 442-450

Bottou, L., Cortes, C., Denker, J., Drucker, H., Guyon L., Jackel L., LeCun J., Muller U., Sackinger E., Simard P., Vapnik V.: Comparison of Classifier Methods: a Case Study in Handwritten Digit Recognition In: Proceedings of IAPR International Conference on

Pattern Recognition 2 (1994) 77-82

Fukushima, K Neocognitron: A hierarchical neural network capable of visual pattern

recognition Neural Networks 1 (1988) 119-130

Roska, T., Rodriguez-Vazquez, A.: Toward visual microprocessors Proceedings of the

IEEE 90, Issue 7 (July 2002) 1244-1257

Baidyk, T.: Application of Flat Image Recognition Technique for Automation of Micro Device Production Proceedings of the International Conference on Advanced Intelligent Mechatronics “AIM’01”, Italy (2001) 488-494

Baidyk, T., Kussul, E., Makeyev, O., Caballero, A., Ruiz, L., Carrera, G., Velasco, G.: Flat image recognition in the process of microdevice assembly Pattern Recognition

Kussul, E., Baidyk, T., Ruiz-Huerta, L., Caballero, A., Velasco, G., Kasatkina, L.: Development of Micromachine Tool Prototypes for Microfactories Journal of

Micromechanics and Microengineering 12 (2002) 795-812

Wenxin Liu, Venayagamoorthy, G., Wunsch, D.: A heuristic dynamic programming based power system stabilizer for a turbogenerator in a single machine power system.

IAS Annual Meeting on Industry Applications 1 (2003) 270-276

Croce, F., Delfino, B., et al.: Operations and management of the electric system for industrial plants: an expert system prototype for load-shedding operator assistance IEEE

Transactions on Industry Applications 37, Issue 3 (2001) 701-708

Okazaki, Yu., Kitahara T.: Micro-Lathe Equipped with Closed-Loop Numerical Control Proceedings of the 2-nd International Workshop on Microfactories, Switzerland (2000) 87- 90

Trang 26

INTELLIGENCE THEORIES IN A MEDICAL DIAGNOSIS APPLICATION

Hadrian Peter1, Wayne Goodridge2

1

University of the West Indies, Barbados; 2 Dalhousie University, Canada

Abstract: Reasoning Systems (Inference Mechanisms) and Neural Networks are two

major areas of Artificial Intelligence (AI) The use of case-based reasoning in Artificial Intelligence systems is well known Similarly, the AI literature is replete with papers on neural networks However, there is relatively little research in which the theories of case-based reasoning and neural networks are combined In this paper we integrate the two theories and show how the resulting model is used in a medical diagnosis application An implementation

of our model provides a valuable prototype for medical experts and medical students alike.

Key words: Medical diagnosis, neural networks, case-based reasoning, reasoning system.

Research in Artificial Intelligence (AI) in medicine has relied on progress

in research in knowledge bases and reasoning systems (inferencemechanisms) Over the years many medical diagnosis systems – MYCIN,Iliad, DXplain, CADIAG-II, INTERNIST, QMR, and MDDB, to name afew – of which MYCIN [1, 2] is arguably the most popular, have beendeveloped The predominant form of knowledge in MYCIN is represented as

a set of rules, and the reasoning system used is backward chaining Iliad and

DXplain [3] both use Bayesian reasoning to calculate probabilities of variousdiagnoses CADIAG-II [4] – Computer-Assisted DIAGnosis – is acomputer-assisted consultation system to support the differential diagnosticprocess in internal medicine CADIAG-II uses fuzzy-based reasoning,however, the underlying knowledge base used is not explicitly described in

Trang 27

the literature The knowledge base of INTERNIST, and the strategy used byINTERNIST to address the diagnosis of patients, are described in [5, 6,7, 8].QMR (Quick Medical Reference) [9, 10, 11], a reengineering ofINTERNIST, is an in-depth information resource that helps physicians todiagnose adult disease However, again, the underlying reasoning andknowledge systems employed in this diagnosis system are not readilyavailable in the literature Although MDDB [12] uses case-based reasoning,

it uses simple lists as its knowledge base A disadvantage of most of thesemethods is that, although they exhibit the capability of making differentialdiagnoses1, they do not offer definitive medical consultation2 A few earlierattempts at combining the theories of neural networks and case-basedreasoning are found in [13,14,15]

In this paper we attempt to correct the shortcomings of the abovemethods by presenting a new approach to medical diagnosis in which wecombine a knowledge base, whose underlying structure is the neural network[16, 17,18], with a Case-Based Reasoning system [19,20,21, 22] We begin

by reviewing case-based reasoning (CBR), we identify problems with suchreasoning when used in the medical domain, and provide the motivation forour approach We then examine neural networks, in particular the

mathematical underpinnings of heteroassociative memory neural networks

[23], and how they are incorporated in our model The architecture of ourmodel – the Case-based Memory Network (CBMN) – is introduced in thenext section We then present the medical diagnosis process in our model,followed by the operational model, a short simulation, and a consultationsession The paper ends with a brief evaluation of the model

2.1 Case-Based Reasoning

Case-based Reasoning (CBR) [19] is an inference mechanism that hasfound increasing use in expert systems It consists of the following fourstages: retrieve the most similar case or cases; reuse the retrieved case orcases to solve the problem by analogical reasoning; revise the proposed

Trang 28

solution; retain the parts of this experience which are likely to be useful forfuture problem solving.

When CBR is applied to medical diagnosis systems, the followingproblems are usually identified:

There is a concentration on reference rather than on diagnosis.There is a lack of intelligent dialog This may result in “missinginformation” and therefore a decrease of the accuracy of thediagnosis

Inability of most similarity algorithms to handle attributes whose

values are unknown

If the case base contains cases with attributes that take onmultiple (rather than just binary) values, then the case base will

be quite complex – requiring large numbers of predicates,relations, constraints, and operators [24]

Updating (revision) of the case base requires complex algorithmsand/or highly skilled users

To overcome these problems, therefore, we developed a variation to theCBR technique called the Case-Based Memory Network (CBMN) model[25] It was primarily developed to solve medical diagnostic problems andnot “pure” classification problems [22] To simulate the CBMN model wehave also designed and implemented an expert system prototype called

CaseB-Pro - an interactive system that accepts observed findings, generates

appropriate questions, and makes conclusions based on the observedfindings

2.2 The role of Neural Networks

The attraction of neural networks in our model is that they have theability to tolerate noisy inputs and to learn – features which are verydesirable in a medical diagnosis system The CBMN uses a special type of

neural network called a heteroassociative neural network [23] This neural

network provides a mechanism for learning, recording what has been learnt,and identifying stored knowledge The network stores disease patternsassociated with cases, and also recalls cases from memory based on thesimilarity of those cases to the symptoms of the current case This technique

is different from the similarity measure and retrieval techniques such as trees and Case Retrieval Nets (CRNs) [22] employed in CBR Relatedclassical works in the field of associative memories are found in [26, 27]

Trang 29

kd-Let the findings associated with a case be represented by a vector s(p),

where p = 1,2, .,P Each vector s(p) is an n-tuple Let the case associatedwith findings, s(p), be represented by a vector t(p) Each t(p) is an m-tuple

In our model we store (findings, case) pairs – that is, p =1, ,P Here, a “case” is an actual patient, and a “finding” is a symptom,sign, or an investigation “P” is the maximum number of cases in thedatabase, where

We also define a weight matrix where

The heteroassociative neural network can be described as a discretenetwork where the input and output nodes take values from the set {-1, 0,1}

We interpret the values as follows: -1 represents the findings that are absent,

0 represents the unknown findings, and 1 represents the findings that arepresent Now (observed findings) can be represented as an n-tupleinput vector, say k Vector k will then be mapped to the domain by thematrix – findings will be mapped onto cases That is,

Whenever new findings are presented to the current case, k is changedand, when multiplied with the weight matrix the vector t(p) isdetermined This value of t(p) is then used to determine a set of actual casesfrom the case base that matches the observed findings

3 The term training set is sometimes used to describe the vector t(p).

and

Trang 30

If a node in vector t has a positive value, then this node represents a case

in which the disorder associated with that case matches the current observedfindings For example, in map 1 the disorder associated with case 1 is apossible candidate

If a mapped vector t contains nodes with varying positive values, then thenode with the largest positive value is most likely to be the case that has themost likely associated disorder for the observed findings For example, if t =(3,1,-1) then the disorders associated with cases 1 and 2 are likely However,the disorder associated with case 1 is the more likely candidate

A disorder, say k, is a part of a definitive diagnosis only if the availablefindings that will lead to a diagnosis of k exceed the findings that are known.This serves as the point at which we stop posing questions to the system

If then k can be a part of the definitive diagnosis

In its simplest form the CBMN structure consists of input information entity(IE) nodes and output (case) nodes The design goal of the CBMN model is

to ensure that a knowledge base, and learning and reasoning mechanisms can

be incorporated in the same data structure and be used for diagnosticproblem solving In diagnosing a patient the physician utilizes informationfrom past typical or known exceptional cases that are usually described by alist of symptoms

To design a medical case base used for diagnostic purposes it isnecessary to have two types of cases in the case base [28]:

1

2

Case Generalizations called prototypes (pure cases) - these are

the “classical” (textbook) cases as viewed by the medical expert.General domain cases – these are actual cases

The features of disorders – in the input layer of the network - aremapped onto case prototypes (in the hidden layer) which represent the “textbook” view of disorders in terms of its identifying features A case –representing the output layer of the network - is an instance of a prototype,

in the same way that an object is an instance of a class in the object orientedprogramming paradigm [29, 30] The arrows in the diagram denote weightedlinks from the input nodes to the case nodes, and the calculation, andadjustment, of these weights is known as training the network

Cases are actual examples of patients and, in the CBMN model, cannotexist without prototypes, which are the physician’s representation of

Trang 31

disorders That is, every case in the case-base must be associated with oneand only one known classical medical case (prototype).

Figure 1 The CBMN Architecture with Prototypes

2.4 The Medical Diagnostic Process in the CBMN

A medical consultation consists of the following stages:

Recording of symptoms and patient history

Elicitation / Identification of signs

Formulation of notion of diagnosis [31] leading to a hypothesis anddifferential diagnosis

Investigations to narrow down or confirm the diagnosis

Medical diagnosis depends heavily on known facts about the case inquestion The facts are used to form a notion of diagnosis [31], which results

in a hypothesis This hypothesis is strengthened or weakened by discoveringmore facts about the current case, which in turn invokes a different notion ofdiagnosis This process is continued until a definitive diagnosis is found So,

Trang 32

again, making a definitive diagnosis is one of the essential differencesbetween CBMN and many extant medical diagnosis systems.

The new approach to the CBMN model includes a certainty factor4 [18]

and a prevalence factor for each prototype in the case base A certainty

factor is a number ranging from 0 to 10 that represents the physician’simpression of the significance of the presence of a feature with respect to theprototype A prevalence factor is a number ranging from 0 to 10 thatexpresses the physician’s impression that a patient will have the disorderassociated with a given prototype The certainty factor and prevalence factormay or may not be a scientific measurement since it represents only thephysician's notion of the disorder

The presence or absence of features will affect the physician’s belief ordisbelief in his hypothesis Hence the concept of the belief factor is used inthe CBMN to “balance” a physician’s belief and disbelief in a hypothesis

We now present the algorithm to find the next best question to be asked at

a given stage of the medical consultation process The system cannot reachany definitive conclusions until it has exhausted each stage

If the certainty factor of f is greater than a question threshold valueset by the experimenter), then the system moves to the nextconsultation stage until the investigation stage is reached

When a diagnostic stage is finished the system lists all the prototypeswith a confidence measure greater that a threshold value as candidatesfor the diagnosis of the presented features

Repeat steps 1-5 above until the investigation stage is completed

The main objective of this algorithm is to find the optimum diagnostic paththat will: (a) Get the correct diagnosis by asking the minimum number ofquestions and (b) exhaust each diagnostic stage before moving on to thenext

Trang 33

the following two steps: training the system to identify new cases and usingthe case base to gain a computer-generated diagnosis.

The two main design goals of the CaseB-Pro prototype - an expertsystem that combines the theories of neural networks and case-basedreasoning – are to implement and test the CBMN model and to develop acomputer system that can assist medical students and physicians with thediagnosing of patients In section 4 we provide an evaluation of the modeland an assessment of its “success”

Figure 2 Creating Prototype for CML

The CBMN model uses three types of data structure to representknowledge These include: feature information entity data structures,prototype information entity data structures and case information entity datastructures

The training of the model involves adding prototypes to the case base andthen, if desired, adding sub-prototypes and actual cases associated with thoseprototypes Training also involves the assignment of symptoms, signs,investigations, and exclusions Training is conducted by using data fromclassical and actual cases of the disorder in question The network is trainedeach time a new case is added to the database, or an existing case ismodified The neural network is used to store and recall cases Each

Trang 34

prototype, sub-prototype, and case, of the case base, has in common a nodeidentification number that uniquely identifies the network node in question.Figure 2 illustrates how the prototype for the Chronic MyeloidLeukaemia (CML) disorder can be added to the system Other prototypescan be added in a similar manner.

2.4.2 Interacting with the System

Figure 3 provides the interface through which users – namely, medicalstudents, physicians, or other medical experts - interact with the CaseB-Prosystem A typical session (consultation) is invoked when a user types the

“consult” command A list of symptoms is then shown where the user canselect major presenting symptoms in the case under consideration Thesystem then allows the user to enter examination results of the case inquestion When the examination stage is exhausted the consultation entersinto the investigation stage An example consultation is provided forillustration

Figure 3 Example of part of consultation session

Trang 35

3 RESULTS

For purposes of testing the system ten cases of Haematologicalconditions and related disorders were added to the case base In order tomake a preliminary evaluation of CaseB-Pro’s diagnostic capabilities, twomedical experts who specialize in Haemoncological disorders independentlysimulated 18 classical Haematological cases within the scope of the system

Key of symbols/abbreviations used in the table:

DD = Differential Diagnosis

+++ = DD Relevant (Related Disorders)

++ = DD Relevant (Unrelated Disorders)

+ = DD Possibly Relevant

- = DD Irrelevant

ALL = Acute Lymphoblastic Leukaemia

AML = Acute Myeloid Leukamia

AA = Aplastic Anemia

MM = Multiple Myeloma NHL = Non Hodgkins Lymphoma CML = Chronic Myeloid Leukaemia

MF = Myelofibrosis PRV = Polycythaemia Rubra Vera

TB = Tuberculosis

Table 1 shows the results of five out of 18 randomly selected simulatedinteractions It is, however, important to note that although only 5 of the 18cases are included in the table, in none of the 18 cases did the medical expertand CaseB-Pro arrive at totally different diagnoses More specifically, in 9

of the 18 (50%) of the cases the diagnoses were the same at the end of stage

2 (see section 2.4) of the medical consultation Of the 9 cases in which thediagnoses did not match exactly at the end of stage 2, 7 (77.8%) of themresulted in a match after the differential diagnosis (third) stage Theremaining 2 cases, for which concrete results were not possible at the end ofstage 3, produced results in concurrence with the medical expert’s diagnosesafter further investigations were conducted (in stage 4)

As indicated in section 2.4.1 CaseB-Pro was also used as a teaching tool

To test this feature of CaseB-Pro, two students in the advanced stage of theirmedical studies were encouraged to interact with the system Initially thestudents complained that they felt intimidated by the system and feared that

Trang 36

the system would expose their lack of knowledge However, after beingpersuaded by their medical instructor to use the system as a classroomexercise, the students were noticeably more relaxed Under the guidance oftheir medical professor, the students were allowed to conduct an extensiveinteraction with the system in an effort to diagnose MM, MF, PRV, ALL,and AA (please see the table below) In 80% of the cases the students wereable to arrive at the correct diagnoses largely due, they reported, to the easewith which they were able to follow the “trend of thought” used by CaseB-Pro.

Many medical diagnosis systems have been designed and are currently inuse The originality of our approach, however, is that we have designed andimplemented a system that combines case-based reasoning and artificialneural networks Because of the restriction placed on the length of our paper,

we were unable to provide a more detailed comparison with otherapproaches Consistent with our goals we were able to (a) implement andtest our model, and (b) to develop a computer system that can assist medicalstudents and physicians with the diagnosing of patients We have been able

to develop a prototype, the CaseB-Pro, based on our new approach, whoseauthenticity medical experts and medical students were able to test

It may be too early to make strong pronouncements about the success ofour model because it was tested on a small domain Therefore more researchusing our approach should be conducted using larger domains and differentevaluation strategies Thus far the feedback from persons who haveinteracted with our prototype has been encouraging, and therefore we areconfident that with further development and testing our prototype can evolveinto a useful, full-fledged system

Shortliffe, B.G Computer-Based Medical Consultants: MYCIN Elsevier/North-Holland, Amsterdam (1976)

Bouhaddou, O., Sorenson, D.: Iliad V4.4 Journal of Medical Systems, 15(1) (1991) 93-110

Trang 37

Adlassnig, K –P., Kolarz, G : Representation and semiautomatic acquisition of medical

knowledge in CADIAG-1 and CADIAG-2 Computers and Biomedical Research, 19

(1986) 63-79

Miller, R A., Pople, Jr., H E., and Myers, J D INTERNIST-I, an Experimental

Computer-Based Diagnostic Consultant for General Internal Medicine, New England J Med., 307

(August 1982) 468-476.

Pople, Jr., H E The Formation of Composite Hypotheses in Diagnostic Problem Solving:

An Exercise in Synthetic Reasoning, IJCAI 5 (1977)

Pople, Jr., H E Heuristic Methods for Imposing Structure on Ill-Structured Problems: The Structuring of Medical Diagnostics, in P Szolovits (ed.), Artificial Intelligence in Medicine, Boulder, CO (1982) 119-185

Masserie, F E., Miller, R.A., and Myers, J.D INTERNIST-I properties: Representing common sense and good medical practice in a computerized medical knowledge base.

Computers and Biomedical Research, 18 (1985) 458-479

Arene, I., et al : Evaluation of quick medical reference (QMR) as a teaching tool MD

Comput., 15 (1998) 323-326

Aliferis, C.F., et al: A temporal analysis of QMR J Am Med Inform Assoc., 3 (1996)

79-91

Miller, R.A., Masarie, F.E.,Jr : Use of the Quick Medical Reference (QMR) program as a

tool for medical education, Methods Inf Med., 28 (1989) 340-345

Gierl, L., Stengel-Rutkowski, S Integrating Consultation and Semi-automatic Knowledge Acquisition in a Prototype-based Architecture: Experiences with Dysmorphic Syndromes.

Artificial Intelligence in Med., 6 (1994) 29-49

Myllymaki, P., Tirri, H Bayesian case-based reasoning with neural networks In Proceedings of the IEEE International Conference on Neural Networks, San Francisco,

Gallant, S.I.: Neural Network Learning and Expert Systems MIT, Mass (1993)

Goodridge, W : The Case-Based Memory Network Model and its Use in Medical Expert Systems: MPhil Thesis, University of the West Indies, Barbados (2000)

Russell, S.J., Norvig, P : Artificial Intelligence : A Modern Approach, ed Pearson Education, Inc., New Jersey (2003)

Kolodner, J.L Case-Based Reasoning Morgan Kaufmann, New York (1993)

Koton, P Evaluating Case-Based Problem Solving In: Hammond, K.J (Ed.) Proceedings

of the DARPA Workshop on Case-Based Reasoning, Morgan Kaufmann, CA (1989)

260-17022 Lenz, M., et al, (eds.) Case-Based Reasoning Technology: from foundations to

Trang 38

Bergmann, R., Wilke, W Case-Based Reasoning: Foundations Issues, Methodological

Variations, and Systems Approaches AI Communications, 3 (1995) 53-118

Goodridge, W., Peter, H., Abayomi, A The Case-Based Neural Network Model and Its Use in Medical Expert Systems In: Lecture Notes in Artificial Intelligence, Vol.1620 Springer-Verlag, Denmark (1999)

Anderson, J A A simple neural network generating an interactive memory Mathematical

Schach, S.R Object-Oriented and Classical Software Engineering, ed McGraw-Hill, New York (2002)

Booch, G Object-Oriented Analysis and Design with Applications Addison-Wesley Publishing Company, California (1994)

Lucas, P A Theory of Medical Diagnosis as Hypothesis Refinement Conference on Artificial Intelligence, Springer-Verlag, New York (1997) 169-180

Trang 40

Hugo C Hoeschl;Vânia Barcellos

E-Gov, Juridical Intelligence and Systems Institute – Ijuris

Keywords: Artificial Intelligence (AI); Natural Intelligence (NI), Law; Information of

Technology

Abstract: This article intends to make an analysis of the intersection between Artificial

Intelligence (AI) and Natural Intelligence (NI) and its application in the scope of Right The impact caused by the Information Technology, methodologies and techniques used by the main systems developed in the last years and which the elements for the development of intelligent applications in the legal domain, with the aim of demonstrating the capacity to manipulate the knowledge properly and, being so, systemizing its relations, clarifying its bonds, to evaluate the results and applications There is a real need of new tools that conciliate the best of AI and NI techniques, generating methods and techniques of storage and manipulation of information, what will reflect on law and justice.

According to Carnelutti [7] “to discover the rule of legal constructing,science does not have, of course, other ways beyond the senses andintelligence Intelligence is nothing but the capacity to learn, to apprehend or

to understand, to interpret and, mainly to adapt the factual situations On theone hand, we have all this systematization of Law, using NI, its evolution, itstechnical, historical and social conditioning; on the other, we have thevertiginous evolution of the technology of the computer sciences, whichhave an search field dedicated of to the reproduction of human abilities,handicrafts as well as intellectual capacities, which is AI

Định dạng
Số trang	499
Dung lượng	20,98 MB