Simulation & Tools Group

This group is exploring the use of various simulation techniques for research and education, and the development of independent software tools that solve problems in non-computer-science domains.


Project: Snitch! Spotting & Neutralizing Internet Theft by CHeaters

Purpose: Create an application that scans student technical research papers to detect instances of plagiarism from the Internet.

Researchers: Joe Bruno, Tom Way

Research Alumni: Sebastian Niezgoda

Description:

Snitch is a Java application that scans the text in a student paper, identifying passages that might be plagiarized, searching the Internet for matching web sites that contain the passages, and finally presenting an HTMLized version of the original student paper with embedded links to any plagiarized material.

Resources:

Current Tasks:

  • Develop Java class that performs Flesch-Kincaid Grade Level analysis of textual input. Test application should enable the user to select a text file, open it, analyze it and display appropriate statistics such as grade level for each sentence and paragraph, and counts of characters, words, sentences, and paragraphs.
  • Create example program that converts Microsoft Word document into text document
  • Create example program that converts PDF document into text document
  • Locate as many search APIs as possible and design examples of how to use them in a Java program.
  • Determine if there is a way to make use of the newer Google web application API (because they are phasing out the SOAP API, so it will not be feasible to make Google searches from a standard Java application). In other words, could SNITCH be made into a web-based application rather than a stand-alone application?
  • Explore use of Java 2 BreakIterator class for managing input tokenization
  • Investigate using MOSS to create a user interface for programming project plagiarism detection.

Project Plan:

  • Download & install Jigloo to help with UI design and devel
  • Refine user interface: better design, better functionality, prettier, make buttons same size
  • Create better HTML report generation, add viewer
  • Refactor code as needed, make more object-oriented, improve windowing approach (document->paragraphs->sentences->words)
  • Find a way to handle Word doc and PDF input files
  • Devise better candidate selection algorithm, write up specification for it, see if Fleischer Scale is applicable, other analysis techniques
  • Research approaches to plagiarism detection, both automated and manual
  • Alpha release candidate goal: Summer 2008

Project: Network Modeling & Simulation

Purpose: Develop approaches for modeling distributed networking communications and simulate the resulting network  This project is mentioned here only as it relates to any support the ACT Lab can provide to the "real" project which is part of Villanova's Center for Excellence in Enterprise Technology (CEET).  CEET runs, among its research efforts, a large project that uses modeling and simulation techniques for verification of complex communications architectures.

Researchers: Nagesh Javali, Surya Kuchibhotla, Pavani Kolapalli, Tao Tao

Applications: Large scale distributed communications

Approach:

Use Colored Petri Nets and other techniques.

Resources:

Conferences:

  • Worldcomp - megaconference of 28 conferences in Las Vegas, June 26-29, 2006 (papers: Feb. 20, 2006).
  • ECMDA - The Second European Conference on Model Driven Architecture Foundations and Applications (papers: March 3, 2006).
  • ASE 2006 - 21st IEEE/ACM International Conference on Automated Software Engineering, Tokyo September 18-22, 2006 (papers: March 3, 2006).
  • MoDELS/UML 2006 - ACM/IEEE 9th International Conference on Model Driven Engineering Languages and Systems, October 1-6, 2006, Genova, Italy (papers: April 3, 2006).
  • COMPSAC 2006 - 30th Annual International Computer Software and Applications Conference, September 18-21, 2006, Chicago (papers: February 20, 2006).
  • ATVA 2006 - Fourth international symposium on Automated Technology for Verification and Analysis, 23-26 October 2006, Beijing, China (papers: May 20, 2006).
  • FM 2006 - Formal Methods 2006 Conference, August 21-27, 2006, McMaster University, Hamilton, Ontario, Canada (papers: Feb. 24, 2006)
  • FMICS 2006 - 11th International Workshop on Formal Methods for Industrial, August 26-28, 2006, Bonn, Germany (papers: June 2, 2006).
  • WS-FM 2006 - 3rd International Workshop on  Web Services and Formal Methods, September 8-9, 2006, Vienna, Austria (papers: Apr. 21, 2006).
  • SEFM 2006 - 4th IEEE International Conference on Software Engineering and Formal Methods, September 11-15, 2006, Pune, India (papers: April 4, 2006)

Project: Finite State Machine Simulation

Purpose: Develop an educational software tool for teaching students concepts of finite state machines

Researchers: Tim White

Description:

The concept of a finite machine is applicable to many computer science disciplines, as it expresses computing at a very fundamental level.  A useful tool to gain a deeper understanding of finite state machines and automata is to visualize them.  When a student can build and then run an FSM, the ideas become concrete and learning is enhanced.  This research is being done in collaboration with Tim White.

This project involves the implementation of an intuitive graphical interface for designing and simulating an arbitrary finite state machine.  The goal is the development of a Java program that students can download and use to help in learning in courses such as Discrete Structures, and to evaluate its value as an educational tool and write up the results of that evaluation as a research paper.

Resources:

  • jFAST - the Java Finite Automaton Simulation Tool, by Tim White. Complete package available for download.
  • Visual and Interactive Tools - research that focuses on designing tools for visualizing and interacting with theoretical computer science concepts at Duke University (Dr. Susan H. Roger), including JFLAP (a package of graphical tools which can be used as an aid in learning the basic concepts of Formal Languages and Automata Theory).
  • Papers related to JFLAP - and associated tools, good source of target publications and helpful background
  • Finite Automata - web site of links.

Project: Digital Image Measurement Tool

Purpose: Develop a software tool for performing precise measurements within a digital image displayed on a computer

Researchers: Michael Cianni

Applications: Criminology and crime scene investigation, etc.

Approach:

DIMT - the Digital Image Measurement Tool, an innovative application designed by Michael Cianni. The tool enables easy measurement of digital images (distances, areas, etc.) using a collection of virtual measurement tools. This software tool has applicability to surveying, criminal investigation, satellite and surveillance photo interpretation, to name just a few. Current plans are to enhance to existing tool, and conduct experimental research to measure the tools value in one or more of the applicable user domains. Is a digital measuring tool a valuable aide? And if so, which fields is it likely to benefit?

Resources:

  • EXIF.org - Exchangeable Image File Format (EXIF), how extra info is stored with a JPEG from a digital camera
  • Lens FAQ - rec.photo.moderated Photographic Lenses FAQ
  • Digital Photography images - gallery of sample images from various cameras
  • ExifReader - download (one of many such tools out there, including right-click Properties info in Windows XP)

Cameras:

  • Nikon D100 (Dr. Klassner) - does not record "Subject Distance" (probably same for all Nikon)
  • Canon Powershot A20 (acquire?) - DOES record "Subject Distance" (some other Canon models do not, A70, A95)

Project: Algorithms & Data Structures for Business Analysis

Purpose: Develop the theory and framework for a proprietary business analysis approach

Researchers: Tom Way, Mike Peterson (Univ. of Delaware)

Description:

We have developed a k-layers, massively interconnected data structure and analysis framework for use in Dr. Peterson's organizational culture research.  This technology has been implemented in a software tool that provides a flexible and powerful means to manipulate large data sets, enabling a sophisticated, concept-cluster-based, stimulus-response analysis.  The analysis algorithm and data structure significantly improve upon early analysis methods, making it possible to conduct the complex task in a matter of hours rather than days or weeks.

Current plans are to fully develop the software prototype tool, and to refine the data structures and algorithms used in the analysis to improve the tool's efficiency.

Tasks:

  • Identify salient technical innovations from the project
  • Prepare write-up of the technical aspects

updated: 01/09/08

actlab.csc.villanova.edu