Docstoc

Medical-Template-PSU Research Proposal

Document Sample
Medical-Template-PSU Research Proposal Powered By Docstoc
					PSU Research Proposal


    Title: Health Informatics-
    Use of Medical Data Mining to
    Enhance Service, Diagnosis, and
    Reduce Costs




    Department: Computer
    Science



    PI Name: Ahmed Sameh

    Duration: 1 Year

    Budget Est.: SR 55,000

    Date: 12/20/2010




                  0
                                               I - PROPOSAL
I-1: PROPOSAL TITLE (Provide a short descriptive title, give prominence to keywords)


 Health Informatics- Use of Medical Data Mining to Enhance Service, Diagnosis, and
 Reduce Costs



I-2: COMMERCIAL POTENTIAL

 Could this project have commercial potential? (Select one)                                  Yes         No
  If yes, briefly elaborate on the commercial potential

 Healthcare is significantly affected by technological advancements, as technology both shapes and changes
 healthcare systems. As areas of computer science, information technology, and healthcare merge, it is
 important to understand the current and future implications of health informatics. The area of data mining is a
 new advancement in empirical research findings. Its application to healthcare informatics will reveal
 implications and consequences both positive and negative of health informatics for ones and the society’s
 health. As such, we can create commercial strategies for health based services/products to support new lines
 of health aware businesses. For example, after producing negative correlation between the over use of
 wireless mobile phones and health hazards on the brain, we can set up a commercial strategy that promotes
 the product based on such hazard negative correlation (another example Skin creams and skin cancer). We
 can also build a set of tools to be sold to Healthcare providers to reduce healthcare costs by analyzing
 individuals’ healthcare data and generate deviation reports that describes access spending. A methodology
 that was developed by one of the investigators has been enhanced and adapted to the Saudi environment.




I-3: CHECK-LIST

        Have you checked to ensure all questions in the application form have been answered?
        Have you checked to ensure you have included the correct costs in your budget?
        The principal investigator and all co-principal investigators should sign.



I-4: PERSONNEL AND AUTHORIZATION

PRINCIPAL INVESTIGATOR [PI]

 Academic Rank:           Professor                   Full Name: Ahmed Sameh
 College:       CIS                          Department: Computer Science
 Telephone:           494-8524                        Ext:      X8524

 Mobile:      0544299846                                    E-Mail: asameh@cis.psu.edu.sa
 Signature:                                                                         Date: 12/20/2010

CO- INVESTIGATOR(S) [CIs]            (non-PSU CIs permitted)



                                                        1
1)     Full Name:         Mohamed El-Affendi
Academic Rank:         Professor                        E-Mail:
College:       CIS                     Department: Computer Science
Telephone:                                 Mobile:
Signature:                                                             Date:   /    /

2)     Full Name:        Gregory Shapiro       (University of Massachusetts, Lowell, USA)
Academic Rank: Associate Professor                  E-Mail:
College:     Science & Engineering                   Department: Computer Science
Telephone:                                 Mobile:
Signature:                                                             Date:   /    /

3)     Full Name: Mohamed Tunsi
Academic Rank: Associate Professor                                E-Mail:
College: CIS                           Department: Computer Science
Telephone:                                 Mobile:
Signature:                                                             Date:   /    /

4)     Full Name: Ayman kassem (King Fahd University)
Academic Rank: Associate Professor                                E-Mail:
College:                             Department: Computer Science
Telephone:                                 Mobile:
Signature:                                                             Date:   /    /

5)     Full Name:
Academic Rank:                               E-Mail:
College:                             Department:
Telephone:                                 Mobile:
Signature:                                                             Date:   /    /

6)     Full Name:
Academic Rank:                               E-Mail:
College:                             Department:
Telephone:                                 Mobile:


                                                2
 Signature:                                                                         Date:      /      /

                                             II - DESCRIPTION
II-1: ABSTRACT (Provide a statement of the project - maximum 200 words)


 Health informatics (also called health care informatics, healthcare informatics, or medical informatics) is a
 discipline at the intersection of information science, computer science, and health care. It deals with resources
 and methods required to optimize the acquisition, storage, retrieval, and use of information in health and
 medical research. It is applied to the areas of medical research, clinical care, dentistry, pharmacy, nursing,
 and public health.

 In the area of medical research: With analytical data mining the screening, diagnosis and detection of
 diseases may get more efficient by reducing both time and costs for the corresponding procedures. It can also
 be used to improve individuals’ medication by using patients’ medication history to promote specific drugs
 directly to certain patients. It can be effective in providing low-cost screening using disease models that
 require easily-obtained attributes from historical cases. It can perform automated analysis of pathological
 signals (ECG, EEG, EMG), and medical images (MRI, CT, X-ray, and ultrasound). Data mining can also
 produce more accurate results in the field of empirical medical research. For example, classification of
 patterns of kidney stones in urine clustering,

 Data mining can be used in improving clinical care: For example it can be used in healthcare management.
 For example, time series analysis data mining algorithms can be used to predict (based on historical data)
 patient volume per month, patient volume per medical specialization, length of stay for incoming patients per
 medical department, ambulance run volume per month, and clinical decision support systems and information
 workflows.

 Data mining can be used in dentistry: It can produce dental and anatomical models for dentists. It can also be
 used to improve dental management. It can be used effectively in dental marketing, and teledentistry
 consultaion services. It can classify full crown and bridge plus all implants systems and cosmetic restorations.
 Lastly, it can be used for analysis of X-ray of head and neck region. It can improve infection control, and
 pharmaceuticals for dental use.

 Data mining can be used in pharmacy: Classification and clustering algorithms can be used for supplements,
 vitamins, and nutritional products grouping and recommendation. Association algorithms can be used to
 discover relationships between medications. Data mining techniques can be used to enhance alternative
 medicine, acupuncture and Chinese medicine, herbs by discovering correlative effects between these
 alternative medicines and their corresponding chemical ones.

 Data mining can be used in nursing, and public health care. It can discover better work needs for nursing
 specialization. It can be used to study epidemics and the way they spread in poor communities.

 Data mining can also be used to provide summary medical reports to hand-held portable devices to assist
 providers with data entry/retrieval or medical decision making, sometime called mHealth.

 Acquisition of medical data for data mining algorithms is quite a difficult task, specialy in Saudi Arabia.
 Although most of healthcare and medical facilities in KSA collect large amounts of digital data, they are
 hesitant to make this data available for research. As such the scope of this proposal is somewhat not very
 specific due to this fact. In this project, we have some arrangements for collecting data that we hope will
 eventually work. Depending on the type of data we can secure the scope of the project will focus on such
 area.

 As a starter we will explore only three of the above areas (management, medical research, and reducing
 healthcare costs) until we stumble into a rich area with data, background knowledge, and specific
 investigation queries in the other areas. It is not clear at the moment which area will open up for us.
 Healthcare agencies in KSA are so reluctant to provide their own data, and background knowledge. For the



                                                        3
Healthcare management and medical research, we were able to get two Saudi data sets: Monthly patient
volume at “family Community Primary Healthcare Clinic of King Faisal University”, we used this data set to
forecast future volumes based on past data. We used time series analysis algorithm for prediction. The second
data set is urinary kidney stones from the Division of Urology, Department of Surgery, King Khalid
University Hospital. We used this data set to classify the samples by cluster analysis of ionic composition.
These two experiments gave good results and stand as positive indication that further data sets can be
acquired and utilized by this project. The third direction of investigation is “reducing healthcare costs” by
analyzing individuals’ data and discovering deviations that leads to higher costs. Deviation analysis is a data
mining technique that can discover also frauds and misuses. The proposed system is called “KEFIR: Key
Findings Reporter”.



II-2: PROJECT GOALS AND OBJECTIVES


The specific goals of this project are to demonstrate the power of data mining in using healthcare informatics
to enhance:

1 -Medical Applications: Screening, Diagnosis, Therapy, Prognosis, Monitoring, Biomedical/Biological
analysis, Epidemiological studies, Hospital management, Classifying uninary stones by Cluster Analysis of
ionic composition data, Efficient screening tools reduce demand on costly health care resources, Forecasting
ambulance run volume, Predicting length-of-stay for incoming patients, Diagnosis and classification: e.g.
ECG Interpretation: Using NN to predict which o/p: SV tachycardia, Ventricular tachycardia, LV
hypertrophy, RV hypertrophy, Myocardial infarction, Diagnosis and classification: assist in decision making
with a large number of inputs. E.g. can perform automated analysis of pathological signals (ECG, EEG,
EMG), medical images (Mammograms, Ultrasound, X-ray, CT, and MRI). E.g. Heart Attacks, Chest pains,
Rheumatic disorders, Myocardial ischemia using the ST-T ECG complex), Coronary artery disease using
SPECT images

2 –Patient medication: Medicine revolves on pattern recognition, classification, and prediction: Diagnosis:
recognize and classify patterns in multivariate patient attributes; Therapy: Select from available treatment
methods, based on effectiveness, suitability to patient; Prognosis: Predict future outcomes based on previous
experience and present conditions, Forecasting Patient Volume using uni-variant Time-Series Analysis,
Improving Classification of multiple dermatology disorders by problem decomposition

3-Modeling Obesity in Saudi Arabian youth, Modeling the educational score in Saudi school health surveys,
Better insight into medical survey data, Building disease models for the instruction and assessment of
undergraduate medical and nursing students, Epidemiological studies: Study of health, disease, morbidity,
injuries and mortality in human communities. E.g. Predict outbreaks in simulated populations. E.g. Assess
asthma strategies in inner-city children, Discover patterns relating outcomes to exposures, Study
independence or correlation between diseases, Detecting pathological conditions: e.g. tracking glucose levels,
Accurate prognosis (prediction) and risk assessment for improved disease management and outcome: e.g.
predict ambulation following spinal cord injury. E.g. Survival analysis for AIDS patients. Predict pre-term
birth risk, determine cardiac surgical risk

A separate direction of investigation is “reducing healthcare costs” by analyzing individuals’ data and
discovering deviations that leads to higher costs. Deviation analysis is a data mining technique that can
discover also frauds and misuses. The proposed system is called “KEFIR: Key Findings Reporter”.

Saudi Health Care Data:
The one of the biggest problems in this research is Where to get Saudi data from. The above list represents
possible tracks for the project. It all depends on the data that we will find. We might find partial incomplete
data that we should work on its preparation. How to prepare and pre-process data? Is it possible to make use
of non-Saudi data for proof of concept?




                                                       4
                                           III - INTRODUCTION

III-1: REVIEW AND ANALYSIS OF RELATED WORK


 Medical expert systems such as MYCIN and Internist were among the first computerized systems in
 healthcare and medical applications. The use of data mining in healthcare informatics is a new direction of
 research.
 In Saudi Arabia, the Saudi Association for Health Information (SAHI) was established in 2006 ]to work under
 direct supervision of King Saud University for Health Sciences to practice public activities, develop
 theoretical and applicable knowledge, and provide scientific and applicable studies. SAHI is concerned with
 use information in health care by clinicians. SAHI transform health care by analyzing, designing,
 implementing, and evaluating information and communication systems that enhance individual and
 population health outcomes, improve patient care, and strengthen the clinician-patient relationship. SAHI use
 their knowledge of patient care combined with their understanding of informatics concepts, methods, and
 health informatics tools to:

         assess information and knowledge needs of health care professionals and patients,
         characterize, evaluate, and refine clinical processes,
         develop, implement, and refine clinical decision support systems, and
         lead or participate in the procurement, customization, development, implementation, management,
          evaluation, and continuous improvement of clinical information systems.

 Physicians who are board-certified in clinical informatics collaborate with other health care and information
 technology professionals to develop health informatics tools which promote patient care that is safe, efficient,
 effective, timely, patient-centered, and equitable. The purpose of this project is to add to these health
 informatics tools.




III-2: SIGNIFICANCE OF WORK


 The significance of this project is that it deals with the field of healthcare. Health care is a very important and
 wide field. The outputs of this project can be new results in medical research: With analytical data mining the
 screening, diagnosis and detection of diseases may get more efficient by reducing both time and costs for the
 corresponding procedures. It can also be used to improve individuals’ medication by using patients’
 medication history to promote specific drugs directly to certain patients. It can be effective in providing low-
 cost screening using disease models that require easily-obtained attributes from historical cases. It can
 perform automated analysis of pathological signals (ECG, EEG, EMG), and medical images (MRI, CT, X-
 ray, and ultrasound). Also Data mining can produce more accurate results in the field of empirical medical
 research. For example, classification of patterns of kidney stones in urine clustering,

 The outputs of this project can be used in improving clinical care. For example, the results of applying Data
 mining in healthcare management. For example, time series analysis data mining algorithms can be used to
 predict (based on historical data) patient volume per month, patient volume per medical specialization, length
 of stay for incoming patients per medical department, ambulance run volume per month, and clinical decision
 support systems and information workflows.

 The outputs of this project can also be used in dentistry, pharmacy, and nursing. For example, Data mining
 can also be used to provide summary medical reports to hand-held portable devices to assist providers with
 data entry/retrieval or medical decision making, sometime called mHealth. The outputs of this project can be
 used for insurance fraud detection, infection control, and medical waste management. This direction of
 investigation is “reducing healthcare costs” by analyzing individuals’ data and discovering deviations that


                                                          5
leads to higher costs. Deviation analysis is a data mining technique that can discover also frauds and misuses.
The proposed system is called “KEFIR: Key Findings Reporter”.


Acquisition of medical data for data mining algorithms is quite a difficult task, specialy in Saudi Arabia.
Although most of healthcare and medical facilities in KSA collect large amounts of data, they are hesitant to
make this data available for research. In this project, we have some arrangements for collecting data that we
hope will eventually work.




                            IV - APPROACH AND METHODOLOGY

IV-1: METHODOLOGY


There are many Data mining Methods to be applied to healthcare information such as: Time Series
Prediction, Classification, Clustering, and Association. Such algorithms can be applied to the various domain
in healthcare informatics:

1 -Medical Applications: Screening, Diagnosis, Therapy, Prognosis, Monitoring, Biomedical/Biological
analysis, Epidemiological studies, Hospital management. For example, forecasting Patient Volume using uni-
variant Time-Series Analysis, Classifying uninary stones by Cluster Analysis of ionic composition data,
Optimize allocation of hospital resources, Forecasting ambulance run volume, Predicting length-of-stay for
incoming patients, Therapy: Based on modeled historical performance , select best intervention course: e.g.
best treatment plans in radiotherapy. E.g. Using patient model, predict optimum medication dosage; e.g. for
diabetics, Accurate prognosis (prediction) and risk assessment for improved disease management and
outcome: e.g. predict ambulation following spinal cord injury. E.g. Survival analysis for AIDS patients.
Predict pre-term birth risk, determine cardiac surgical risk, Diagnosis and classification: e.g. ECG
Interpretation: Using NN to predict which o/p: SV tachycardia, Ventricular tachycardia, LV hypertrophy, RV
hypertrophy, Myocardial infarction, Diagnosis and classification: assist in decision making with a large
number of inputs. E.g. can perform automated analysis of pathological signals (ECG, EEG, EMG), medical
images (Mammograms, Ultrasound, X-ray, CT, and MRI). E.g. Heart Attacks, Chest pains, Rheumatic
disorders, Myocardial ischemia using the ST-T ECG complex), Coronary artery disease using SPECT
images, and Risk assessment for improved disease management e.g. spinal cord injuries, and hart attacks

2- Modeling the educational score in Saudi school health surveys, Modeling Obesity in Saudi Arabian youth,
Epidemiological studies: Study of health, disease, morbidity, injuries and mortality in human communities.
E.g. Predict outbreaks in simulated populations. E.g. Assess asthma strategies in inner-city children

3-Better insight into medical survey data, effective Data Fusion from multiple sensors, Efficient screening
tools reduce demand on costly health care resources, Discover patterns relating outcomes to exposures, Study
independence or correlation between diseases, Detecting pathological conditions: e.g. tracking glucose levels,
Data fusion from various sensing modalities in ICUs to assist overburdened medical staff

For example, the figure shows the medical chart of a patient. Methods from the above three categories can be
applied to this chart to discover deviation measures. This direction of investigation is “reducing healthcare
costs” by analyzing individuals’ data and discovering deviations that leads to higher costs. Deviation analysis
is a data mining technique that can discover also frauds and misuses. The proposed system is called “KEFIR:
Key Findings Reporter” (see figure below).




                                                       6
7
 The proposed system will apply deviation analysis techniques on individuals’ healthcare data to find
 “interesting deviations”. The system will then augment these findings with plausible causes, and suggest
 recommendations of appropriate actions. Each healthcare provider can apply the proposed system to its own
 set of insured individuals. Each on possible medical areas covered: Inpatient, Outpatient, Surgical, Maternity,
 etc. For each area, patient, the proposed system will apply “measures and formulas” to discover large
 deviations from the norms. Or deviations from previous period and/or next period. Through models, and
 formulas such deviations can be converted to costs.

 Deliverables in phase I: Beta Version I + its Benchmark + its Tuning
 Deliverables in Phase II: Beta Version II + its Benchmark + its Tuning
 Deliverables in Phase III: Beta Version III + its Benchmark + its Tuning
 Deliverables in Phase IV: Final Version + User Manual
 The following is the project plan schedule. It represents those different tasks within the research and
 estimated duration for each.




IV-2: AVAILABLE RESOURCES

Currently there are some open source data mining algorithms that can be used as tools in some of the above
investigations.

IV-3: EXPECTED RESULTS/OUTPUTS


 Health care is a very important and wide field. The outputs of this project can be new results in medical
 research: With analytical data mining the screening, diagnosis and detection of diseases may get more
 efficient by reducing both time and costs for the corresponding procedures. It can also be used to improve
 individuals’ medication by using patients’ medication history to promote specific drugs directly to certain
 patients. It can be effective in providing low-cost screening using disease models that require easily-obtained
 attributes from historical cases. It can perform automated analysis of pathological signals (ECG, EEG,
 EMG), and medical images (MRI, CT, X-ray, and ultrasound). Also Data mining can produce more accurate
 results in the field of empirical medical research. For example, classification of patterns of kidney stones in
 urine clustering,



                                                        8
The outputs of this project can be used in improving clinical care. For example, the results of applying Data
mining in healthcare management. For example, time series analysis data mining algorithms can be used to
predict (based on historical data) patient volume per month, patient volume per medical specialization, length
of stay for incoming patients per medical department, ambulance run volume per month, and clinical decision
support systems and information workflows.

The outputs of this project can also be used in dentistry, pharmacy, and nursing. For example, Data mining
can also be used to provide summary medical reports to hand-held portable devices to assist providers with
data entry/retrieval or medical decision making, sometime called mHealth.

As a starter we will explore only two of the above areas (management, medical research) until we stumble
into a rich area with data, background knowledge, and specific investigation queries in the other areas. It is
not clear at the moment which area will open up for us. Healthcare agencies in KSA are so reluctant to
provide their own data, and background knowledge. For the Healthcare management and medical research,
we were able to get two Saudi data sets: Monthly patient volume at “family Community Primary Healthcare
Clinic of King Faisal University”, we used this data set to forecast future volumes based on past data. We
used time series analysis algorithm for prediction. The second data set is urinary kidney stones from the
Division of Urology, Department of Surgery, King Khalid University Hospital. We used this data set to
classify the samples by cluster analysis of ionic composition. These two experiments gave good results and
stand as positive indication that further data sets can be acquired and utilized by this project.

The following diagrams are results from the two data sets:




                                                       9
 Methods above can be applied to individual patient’s charts to discover deviation measures. This direction of
 investigation is “reducing healthcare costs” by analyzing individuals’ data and discovering deviations that
 leads to higher costs. Deviation analysis is a data mining technique that can discover also frauds and misuses.
 The proposed system is called “KEFIR: Key Findings Reporter”. The proposed system will apply deviation
 analysis techniques on individuals’ healthcare data to find “interesting deviations”. The system will then
 augment these findings with plausible causes, and suggest recommendations of appropriate actions. Each
 healthcare provider can apply the proposed system to its own set of insured individuals. Each on possible
 medical areas covered: Inpatient, Outpatient, Surgical, Maternity, etc. For each area, patient, the proposed
 system will apply “measures and formulas” to discover large deviations from the norms. Or deviations from
 previous period and/or next period. Through models, and formulas such deviations can be converted to costs.




                                             V - REFERENCES
1- Saudi Ministry of Health http://www.moh.gov.sa/english/index.php
2- SAMIRAD http://www.saudinf.com/main/c6m.htm




                                                       10
                               VI - ROLE(S) OF THE INVESTIGATOR(S)
                   (Attach a brief CV for each investigator following the format in Appendix A)

 #           Name of Investigator                            Area of contribution to the project
 1       Prof. Ahmed Sameh                      System Design & Implementation


 2       Prof. Mohamed El-Affendi               Data Collection & Preparation


 3       Dr. Mohamed Tunsi                       Data Mining Tools


 4       Dr. Gregory Shapiro                    System Design & Implementation


 5       Dr. Ayman Kassem                           Testing


 6




                                       VII - PROJECT SCHEDULE

PHASES OF PROJECT IMPLEMENTATION (SEE GANETT CHART ABOVE)

                                                                                                  Duration
 Steps                                             Task
                                                                                                   (Months)
             System requirements specifications: Sameh, Tunsi
             System Architecture : El-Affendi
             System Design: Sameh, Greg
             Databases Designs: Greg
             Prototyping of critical sub-systems: Tunsi, Sameh
             System Detailed Design: Sameh, Tunsi
             Beta Version Implementation: Sameh, Ayman
             Testing: El-Affendi
             Building Deployment Environment: Sameh                                               See Gantt
             Bench Marking and Collecting Results (First Round): Tunsi                              Chart
     1       System Tuning (Based on First Round Results): Sameh                                  within this
             Bench Marking and Collecting Results (Second Round Results): Al-Effendi
                                                                                                   proposal
             System Tuning (Based on Second Round Results): Sameh
             Bench Marking and Collecting Results (Third Round Results): Tunsi
             Version 1 Release:Tunsi
             Results Documentation and Analysis with the Performance requirements:Sameh
             Detailed Code Documentation: Sameh
             User and Installation Guide (Full How To): Ayman




                                                        11
                              Total duration for the proposed project            12 Month




             VIII - BUDGET OF THE PROPOSED RESEARCH (Budget in SAR)
                                                        Amount Priority 1 = Max;  Amount
                       Item                            Requested        2 = Mod; Approved
                                                         (SAR)          3 = Low.   (SAR)
A. Personnel* (Research Assistant)                          24,000           1
                                                                                   For
 1- Student Ahmed Al-Jabreen
                                                                                  Official
2- Student Kamal Qarawi                                                            Use
3- Student Omar Al-Moughnee
4- Student Amro Al-Munajjed


B. Equipment* (List)                                         5,000           1

   Development Server



C. Testing and Analysis* (Location/Laboratory)               5,000           2

 Labtop Computer



D. Consumables* (List)                                       1000            2

Desk Tools



E. Travel *(Local/Internat)                                 10,000           1

1- Travel for Gregory (Lowell Massachusetts /
Riyadh)
2- Travel for Ayman (Zahram / Riyadh)




                                                  12
F. Software* (List)                                        10,000                  1

      -SAS Data Mining Tools
     -Oracle 9i Data Mining
    -Clementines from SPSS
    -Ants Model Builder




G. Other Items* (Itemize)                                        ---




                 Total Amount Requested (SAR)           55,000

    IX- JUSTIFICATION OF BUDGET (Justify each item listed in the budget in the previous section)
             Item                                          Justification
A                                  Salary of SR 500 for each student for 12 months the duration of
                               the project.
Students Research
Assistants




B                                  For developing the proposed experiments.
Development Server




C                                For on-site data collection and on-site testing
Laptop Computer




D                                  For general use by team members
Desk tools




E                                  For the two outside PSU team members.
Travel




                                                 13
F                               Data Mining Tools Software
Software




G




                  X - RELEASE TIME FOR RESEARCH TEAM MEMBERS

RELEASE TIME FROM TEACHING LOAD

                                                       Time Commitment       Teaching
    #                   Team Member
                                                         (hrs/weeks/terms)   Load Max
    PI                                                                       e.g. 1 course
          Ahmed Sameh                                         4 h/w
                                                                                 FA11
    CI1
           Mohamed El-Affendi                                 2h/w
    CI2
           Mohamed Tunsi                                      2h/w
    CI3
           Gregory Shapiro                                    1h/w
    CI4
           Ayman Kassem                                       1h/w
    CI5



                                XI - EXTERNAL FUNDING

                                                                              Used for
    #                  Source of Funds                   Amount (SAR)
                                                                              …… costs
    1      None

     2

    3




                                             14
    Appendix A: CV Format for Principal Investigator and Co-Investigators
                 (Two pages maximum, material should be related to submitted project)


Title and Name: Professor Ahmed Sameh


Specialty: Artificial Intelligence, Modeling and Information Systems


Department and College: Computer Science


Summary of Experience/Achievements Related to Research Proposal:

1- Ahmed Sameh, Ayman Kassem, “Lumbar Spine: Parameter Estimation for Realistic Modelling”, WSEAS
Transactions on Applied and Theoretical Mechanics, ISSN:1991-8747, Issue 5, Volume 2, May 2008

2- Ahmed Sameh, Ayman Kassem, “A General Framework for Lumbar Spine Modelling and Simulation”,
International Journal of Human Factors in Modelling and Simulation, IJHFMS, The North American Spine
Society, Volume 1, Issue 2, January 2008

3- Dalia El-Mansy, Ahmed Sameh, “A Collaborative Inter-Data Grid Strong Semantic Model with Hybrid
Namespaces”, Journal of Software (JSW), Academic Publisher, Volume 3, Issue 1, January 2008

4- Ahmed Sameh, “Simulating Lumbar Spine Motion”, Research in Computing Science (RCS) Journal,
National Polytechnic Institute of Mexico, ISSN 1665-9899, Volume 18, Issue 4, June 2007

5- Ahmed Sameh, and Ayman Kassem, “3D Modeling and Simulation of Lumbar Spine Dynamics”, in the
International Journal of Human Factors Modelling and Simulation , Volume IJHFMS-942, 2007

6-Adhami Louai, Abdel-Malek Karim, McGowan Dennis, Mohamed A. Sameh, "A Partial Surface/Volume
Match for High Accuracy Object Localization", International Journal of Machine Graphics and Vision, vol
10, no. 2, 2001

7-Mohamed A. Sameh, “Interactive Learning in Artificial Neural Networks Through Visualization”, The
International Journal of Computers and Applications (IJCA), Vol. 20, #2, 1998

8- Mohamed A. Sameh and Attia E. Emad, "Parallel 1D and 2D Vector Quantizers Using Kohonen Self-
Organizing Neural Network", in the International Journal of the Neural Computing and Applications, V.
(4), no. 2, Springer Verlag, London, 1996

9- Ahmed Sameh, Amgad Madkour, “Intelligent open Spaces: Learning User History Using Neural Network
for Future Prediction of Requested Resources”, Proceedings IEEE CSE'08, 11th IEEE International
Conference on Computational Science and Engineering, 16-18 July 2008, São Paulo, SP, Brazil. IEEE
Computer Society 2008, ISBN 978-0-7695-3193-9

10- Ahmed Sameh, Ayman Kaseem, “Modelling and Simulation of Human Lumbar Spine”, Proceedings of
the 2008 International Conference on Modelling, Simulation, and Visualization, MSV 2008, Las Vegas,
Nevada, July 14-17, 2008, CSREA Press 2008, ISBN 1-60132-081-7

11- Ahmed Sameh, Dalia El-Mansy, “A Collaborative Inter-Data Grids Model with Hybrid Namespace”, 14th
IEEE International Conference on Availability, Reliability, and Security, (DAWAM – ARES 2007), Vienna,
Austria, April 10-13, 2007



                                                  15
12- Ahmed Sameh, “Simulating Lumbar Spine Motion: Parameter Estimation for Realistic Modelling”, The
6th Mexican International Conference on Artificial Intelligence (MICAI07), Aguascalientes, Mexico,
November 4-10, 2007

13- Sherif Akoush, Ahmed Sameh, “Bayesian Learning of Neural Networks for Mobile User Position
Prediction”, The International Workshop on Performance Modelling and Evaluation in Computers and
telecommunication Networks (PMECT07)- part of the IEEE 16th International Conference on Computer
Communications and Networks, ICCCN 2007, Honolulu, Hawaii, August 13-16, 2007

14- Ahmed Sameh, “The Schlumberger High Performance Cluster at AUC”, Proceedings of the 13th
International Conference on Artificial Intelligence Applications, Cairo, February 4-6, 2005

15-Mohamed A. Sameh, Rehab El-Kharboutly, "Modeling a Service Discovery Bridge Using Rapide
Architecture Description Language", Proceedings of the 18th European Simulation Multiconference (ESM
2004), Magdeburg, Germany, June 13-16, 2004

16-Mohamed A. Sameh, Rehab El-Kharboutly, and Hazem Al-Ashmawy, "Modeling Wireless Discovery
and Deployment of Hybrid Multimedia N/W-Web Services Using Rapide ADL", Proceedings of the 7th
IEEE International Conference on High Speed N/Ws amd Multimedia Communications (HSNMC04),
Toulouse, France, June 30- July 2nd, 2004

17-Mohamed A. Sameh, Rhab El-Kharboutly, "Modeling Jini-UpnP Using Rapide ADL", Proceedings of the
10th EUROMEDIA Conference (EUROMEDIA 2004), Hasselt, Belgium, April 19-21, 2004

18-Mohamed A. Sameh, "E-Access Custom Webber: A Multi-Protocol Stream Controller", Proceedings of
the IADIS International Conference on Applied Computing, Lisbon, Portugal, March 23-26, 2004

19- Ayman Kassem, A. Sameh, and Tony Keller, “Modeling and Simulation of Lumbar Spine Dynamics”,
Proceedings of the 15th IASTED International Conference on Modeling and Simulation and Optimization
(MSO 2004), Marina Del Rey, California, March 2004

20-Mohamed A. Sameh, and Shenouda S., "Tera-Scale High Performance Distributed and Parallel Super-
Computing at AUC", Proceedings of the 12th International Conference on Artificial Intelligence, Cairo, Feb.
18-20, 2004

21-Shenouda S., Mohamed L., and Mohamed A. Sameh, "AUC Cluster Participation in Global Grid
Communities", Proceedings of the 12th International Conference on Artificial Intelligence, Cairo, Feb. 18-
20, 2004

22-El-Ashmawi Hazem, and Mohamed A. Sameh, “XML-Socket Language-Independent Distributed Object
Computing Model”, Proceedings of the 15th International Conference on Parallel and Distributed
Computing Systems, Louisville, Kentucky, September, 2002

23-Mohamed Karasha, Greenshields Ian, and Mohamed A. Sameh, “HUSKY: A Multi-Agent Architecture
for Adaptive Scheduling of Grid Aware Applications”, Proceedings of the High Performance Computing
Symposium with the 2002 Advanced Simulation Technologies Conference (ASTC 2002), San Diego,
California, April 14-18, 2002

24-Atef Rania, Mohamed A. Sameh,and Abdel-Malek Karim, "Three Dimensional Deformable Modeling of
the Spinal Lumbar Region", Proceedings of the 11th International Conference on Intelligent Systems on
Emerging Technologies (ICIS-2002), Boston, July 18-20, 2002

25-Kassem Ayman, Mohamed A. Sameh, and Abdel-Malek Karim, "A Spring-Dashpot-String Element for
Modeling Spinal Column Dynamics", Proceedings of the International Workshop on Growth and Motion in
3D Medical Images, Copenhagen, Denmark, May 28- June 1, 2002

26-Kassem Ayman, and Mohamed A. Sameh, “A Fast Technique for modeling and Control of Dynamic
System”, Proceedings of the 11th International Conference on Intelligent Systems on Emerging Technologies



                                                    16
(ICIS-2002), Boston, July 18-20, 2002

27-Mohamed A. Sameh, and Kaptan Noha, "Anytime Algorithms for Maximal Constraint Satisfaction",
Proceedings of the ISCA 14th International Conference on Computer Applications in Industry and
Engineering (CAINE' 2001), Nov. 27- 29, at Las Vegas, Nevada, 2001

28-Mohamed A. Sameh, and Mansour Marwa "Enhancing Partitionable Group Membership Service in
Asynchronous Distributed Systems", Proceedings the ISCA 14th International Conference on Computer
Applications in Industry and Engineering (CAINE' 2001), Nov. 27- 29, at Las Vegas, Nevada, 2001

29-Abdalla Mahmoud, Mohamed A. Sameh, Harras Khalid, Darwich Tarek, "Optimizing TCP in a Cluster of
Low-End Linux Machines", Proceedings of the 3rd WSEAS Symposium on Mathematical Methods and
Computational Techniques in Electrical Engineering, Athens, Greece, Dec. 29-31, 2001

30-Rania Abdel Hamid, and Mohamed A. Sameh, “Visual Constraint Programming Environment for
Configuration Problems”, Proceedings of the 15th International Conference on Computers and their
Applications, New Orleans, Louisiana, March 2000

31-Essam A. Lotfy, and Mohamed A. Sameh, “Applying Neural Networks in Case-Based Reasoning
Adaptation for Cost Assessment of Steel Buildings”, Proceedings of the 10th International Conference on
Computing and Information, ICCI-2000, Kuwait, Nov. 18-21, 2000

32-Ghada A. Nasr, and Mohamed A. Sameh, “ Evolution of Recurrent Cascade Correlation Networks with a
Distributed Collaborative Species”, Proceedings of the IEEE Symposium on Computations of Evolutionary
Computation and Neural Networks, San Antonio, TX, May 2000

33-El-Beltagy S., Rafea A., and Mohamed A. Sameh, “An Agent Based Approach to Expert System
Explanation”, Proceedings of the 12th International FLAIRS Conference, Orlando, Florida, 1999

34- Mohamed A. Sameh, Botros A. Kamal, "2D and 3D Fractal Rendering and Animation", Proceedings of
the Seventh Eurographics Workshop on Computer Animation and Simulation, Aug. 31st- Sept. 2nd, in
Poitiers, France, 1996

35-Mohamed A. Sameh, "A Robust Vision System for three Dimensional Facial Shape Acquisition,
Recognition, and Understanding", Proceedings of the 1st Golden West International Conference on
Intelligent Systems, Reno, Nevada, 1991

36-Mohamed A. Sameh, "A Neural Trees Architecture for Fast Control of Motion", Proceedings of the
FLAIRS Artificial Intelligence Conference, Cocoa Beach, Florida, 1991

37-Mohamed A. Sameh, Armstrong W.W., "Towards a Computational Theory for Motion Understanding:
The Expert Animator Model", Proceedings of the 4th International Conference on Artificial Intelligence for
Space Applications, Nasa, Huntsville, Alabama, 1988

CV of Gregory Shapiro:

Gregory Piatetsky-Shapiro, Ph.D. is the President of KDnuggets, which provides
research and consulting services in the areas of data mining, knowledge discovery,
bioinformatics, and business analytics. Previously, he led data mining and consulting
groups at GTE Laboratories, Knowledge Stream Partners, and Xchange. He has extensive
experience developing CRM, customer attrition, cross-sell, segmentation and other models
for some of the leading banks, insurance companies, and telcos. He also worked on clinical
trial, microarray, and proteomic data analysis for several leading biotech and
pharmaceutical companies.

Gregory served as an expert witness and provided expert opinions in several cases.



                                                    17
Gregory is also the Editor and Publisher of KDnuggets News, the leading newsletter on
data mining and knowledge discovery (published since 1993), and the KDnuggets.com
website, (published since 1997) data mining community's top resource for data mining and
analytics software, jobs, solutions, courses, companies, and publications, and more. From
1994 to 1997, while at GTE Laboratories, he published Knowledge Discovery Nuggets
website, an earlier version of KDnuggets.

Gregory is the founder of Knowledge Discovery in Database (KDD) conferences. He
organized and chaired the first three Knowledge Discovery in Databases workshops in
1989, 1991, and 1993, and then chaired the KDD Steering Committee until 1998, when he
co-founded ACM SIGKDD, the leading professional organization for Knowledge
Discovery and Data Mining. He served as Director (1998 - 2005) and was elected
SIGKDD Chair (2005-2009 term).

Gregory has over 60 publications, including 2 best-selling books and several edited
collections on topics related to data mining and knowledge discovery, including SIGKDD
Explorations Special Issue on Microarray Data Mining (Vol 5, Issue 2, Dec 2003).

Gregory received ACM SIGKDD Service Award (2000) and IEEE ICDM Outstanding
Service Award (2007) for contributions to data mining field and community.

Publication Record:

       Data Mining and Knowledge Discovery - 1996 to 2005: Overcoming the Hype and moving
        from "University" to "Business" and "Analytics", Gregory Piatetsky-Shapiro, Data Mining
        and Knowledge Discovery journal, 2007.
       What Are The Grand Challenges for Data Mining? KDD-2006 Panel Report, Gregory
        Piatetsky-Shapiro, Robert Grossman, Chabane Djeraba, Ronen Feldman, Lise Getoor,
        Mohammed Zaki, KDD-06 Panel Report, SIGKDD Explorations, 8(2), Dec 2006.
       10 Challenging Problems in Data Mining Research, Qiang Yang, Xindong Wu, Pedro
        Domingos, Charles Elkan, Johannes Gehrke, Jiawei Han, David Heckerman, Daniel
        Keim, Jiming Liu, David Madigan, Gregory Piatetsky-Shapiro, Vijay V. Raghavan,
        Rajeev Rastogi, Salvatore J. Stolfo, Alexander Tuzhilin, and Benjamin W. Wah, Spec.
        Issue of International Journal of Information Technology & Decision Making, Vol. 5, No. 4
        (2006).
       On Feature Selection through Clustering, R. Butterworth, G. Piatetsky-Shapiro, Dan A.
        Simovici, Proceedings of IEEE ICDM-2005 Conference, Nov 2005.
       A Comprehensive Microarray Data Generator to Map the Space of Classification and
        Clustering Methods, Piatetsky-Shapiro, Gregory , and Grinstein, Georges G., Tech.
        Report No. 2004-016, U. Massachusetts Lowell, 2004.
       Microarray Data Mining: Facing the Challenges (PDF), Gregory Piatetsky-Shapiro and
        Pablo Tamayo, SIGKDD Explorations, Dec 2003.
       Capturing Best Practice for Microarray Gene Expression Data Analysis, G. Piatetsky-
        Shapiro, T. Khabaza, S. Ramaswamy, in Proceedings of KDD-2003 (ACM Conference
        on Knowledge Discovery and Data Mining), Washington, D.C., 2003. (Honorary mention
        for best application paper).
       Measuring Real-Time Predictive Models (poster presentation), S. Steingold, R. Wherry, G.
        Piatetsky-Shapiro, in Proceedings of IEEE ICDM-2001 Conference, San Jose, CA, Nov
        2001.
       Measuring Lift Quality in Database Marketing, (pdf, 100K), G. Piatetsky-Shapiro and S.
        Steingold, SIGKDD Explorations, Dec 2000.
       Knowledge Discovery in Databases: 10 years after, Gregory Piatetsky-Shapiro, SIGKDD
        Explorations, Vol 1, No 2, Feb 2000.


                                               18
   Expert Opinion: The data-mining industry coming of age (PDF), Gregory Piatetsky-Shapiro,
    IEEE Intelligent Systems, Vol. 14, No. 6, November/December 1999.
   Estimating Campaign Benefits and Modeling Lift, (MS Word) Gregory Piatetsky-Shapiro
    and Brij Masand, Proceedings of KDD-99 Conference, ACM Press, 1999.
   Knowledge Discovery and Acquisition from Imperfect Information, G. Piatetsky-
    Shapiro, chapter in A. Motro and P. Smets, eds., Uncertainty in Information
    Management, Kluwer, 1997.
   From Data Mining to Knowledge Discovery in Databases,
    Usama Fayyad, Gregory Piatetsky-Shapiro, and Padhraic Smyth. AI Magazine 17(3):
    Fall 1996, 37-54
   Mining Business Databases, Ron Brachman, Tom Khabaza, Willi Kloesgen, Gregory
    Piatetsky-Shapiro, and Evangelos Simoudis, Communications of ACM, 39:11,
    November 1996.
   Data Mining and Knowledge Discovery in Databases: An overview, Usama M. Fayyad,
    Gregory Piatetsky-Shapiro, Padhraic Smyth, Communications of ACM, 39:11,
    November 1996.
   Improving Classification Accuracy by Automatic Generation of Derived Fields Using
    Genetic Programming, B. Masand and G. Piatetsky-Shapiro, in Advances in Genetic
    Programming II, MIT Press, 1996.
   An Overview of Issues in Developing Industrial Data Mining and Knowledge Discovery
    Applications, Gregory Piatetsky-Shapiro, Ron Brachman, Tom Khabaza, Willi
    Kloesgen, and Evangelos Simoudis, in KDD-96 Conference Proceedings, ed. E.
    Simoudis, J. Han, and U. Fayyad, AAAI Press, 1996.
   A Comparison of Approaches For Maximizing Business Payoff of Prediction Models,
    Brij Masand and Gregory Piatetsky-Shapiro, in KDD-96 Conference Proceedings, ed. E.
    Simoudis, J. Han, and U. Fayyad, AAAI Press, 1996.
   Knowledge Discovery and Data Mining: Towards a Unifying Framework, Usama
    Fayyad, Gregory Piatetsky-Shapiro, and Padhraic Smyth in KDD-96 Conference
    Proceedings, ed. E. Simoudis, J. Han, and U. Fayyad, AAAI Press, 1996.
   From Data Mining to Knowledge Discovery: an Overview, U. Fayyad, G. Piatetsky-
    Shapiro, P. Smyth, in Advances in Knowledge Discovery and Data Mining, AAAI/MIT
    Press, 1996.
   Selecting and Reporting What is Interesting: The KEFIR Application to Healthcare
    Data, C. Matheus, G. Piatetsky-Shapiro, and D. McNeill, in Advances in Knowledge
    Discovery and Data Mining, AAAI/MIT Press, 1996.
   Knowledge Discovery in Personal Data vs. Privacy, G. Piatetsky-Shapiro, IEEE expert,
    April 1995
   KDD-93: Progress and Challenges in Knowledge Discovery in Databases (PDF, latex),
    G. Piatetsky-Shapiro, C. Matheus, P. Smyth, R. Uthurusamy, AI magazine, 15(3): Fall
    1994, 77-82.
   The Interestingness of Deviations, G. Piatetsky-Shapiro, C. Matheus, in Proceedings
    of KDD-94 workshop, AAAI Press, 1994.
   Systems for Knowledge Discovery in Databases, C. Matheus, P. Chan, G. Piatetsky-
    Shapiro, IEEE Transactions on Data and Knowledge Engineering, 5(6), Dec. 1993.
   Measuring Data Dependencies, G. Piatetsky-Shapiro and C. Matheus, in Proceedings
    of AAAI-93 Workshop on KDD, AAAI Press Report WS-02.
   "Knowledge Discovery in Databases - An Overview", W. Frawley, G. Piatetsky-
    Shapiro, C. Matheus, (PDF), in Knowledge Discovery in Databases 1991, pp. 1--30.
    Reprinted in AI Magazine, Fall 1992.
   Knowledge Discovery Workbench for Exploring Business Databases, G. Piatetsky-
    Shapiro and C. Matheus, in Int. J. of Intelligent Systems, 7(7), Sep 1992.
   Report on AAAI91 workshop on Knowledge Discovery in Databases, G. Piatetsky-
    Shapiro, IEEE Expert, Fall 1991
   "Discovery, Analysis, and Presentation of Strong Rules", G. Piatetsky-Shapiro (in
    Knowledge Discovery in Databases 1991), pp. 229-248.
   Knowledge Discovery in Real Databases: A workshop report (PDF, html), AI Magazine,


                                           19
        vol. 11, no. 5, January 1991.

Books and Proceedings

            ACM TKDD Special Issue on Knowledge Discovery for Web Intelligence, Guest
        Editors: Ning Zhong, Gregory Piatetsky-Shapiro, Yiyu Yao, Philip S. Yu, Dec 2010.
       SIGKDD Explorations Special Issue on Microarray Data Mining, Vol. 5, Issue 2, Dec 2003.
       R. Agrawal, P. Stolorz, and G. Piatetsky-Shapiro, eds., Proceedings of KDD-98: 4th
        International Conf. on Knowledge Discovery and Data Mining, AAAI Press, 1998.
       U. Fayyad, G. Piatetsky-Shapiro, P. Smyth, R. Uthurusamy, eds., Advances in
        Knowledge Discovery in Databases, AAAI/MIT Press 1996.
       Mini-symposium on KDD vs. Privacy, IEEE Expert, April 1995. full text of a draft.
       Special issue of J. of Intelligent Information Systems on Knowledge Discovery in
        Databases, ed. G. Piatetsky-Shapiro, 4(1), Jan 1995.
       KDD-93: Proceedings of AAAI-93 Workshop on KDD, ed. G. Piatetsky-Shapiro, AAAI
        Press Report WS-02, 1993.
       Gregory Piatetsky-Shapiro and William Frawley, eds., Knowledge Discovery in
        Databases, AAAI/MIT Press, 1991




                        Appendix B: Evaluations and Approvals

COLLEGE REVIEW COMMITTEE Evaluation and Recommendation

                                                            Excel-    Very
                     Item/ Evaluation                                          Good      Weak
                                                             lent     Good
Research methodology
Research objectives
Research originality
Research contribution
Research applicability and relevance
Overall evaluation

Recommendations of College Committee                            Approved         Disapproved

Amount of Budget Approved by College Committee:             (SAR)

Chair College Committee - Title and Full Name:



                                              20
Signature:                                           Date:       /       /

Recommendations of the College Council                Approved           Disapproved

Dean of the College Council - Title and Full Name


Signature:                                           Date:       /       /


PSU INSTITUTIONAL RESEARCH COMMITTEE (IRC) Recommendation

Recommendation of the PSU IRC                         Approved           Disapproved

Chair IRC Committee - Title and Full Name:

Signature:

                                                    Date:            /       /




                                             21
PSU EXTERNAL REVIEW PANEL FOR RESEARCH PROPOSALS Recommendation

Recommendation of the Eternal Review Committee.
    Approved: Amount of grant approved:                 ( SAR)
    Disapproved:
    Postponed:
    Directed to:
Chair of External Review Panel - Title and Full Name:



Signature:                                              Date:         /       /

Recommendation of University Council                       Approved           Disapproved

Signature:                                              Date:             /       /




                                            22

				
DOCUMENT INFO
Shared By:
Categories:
Tags:
Stats:
views:56
posted:5/18/2012
language:English
pages:23
fanzhongqing fanzhongqing http://
About