Simulation & Tools
Group
This group is exploring the
use of various simulation techniques for research and education, and the
development of independent software tools that solve problems in
non-computer-science domains.
Project: Snitch!
Spotting & Neutralizing Internet Theft
by CHeaters
Purpose: Create an
application that scans student technical research papers to detect instances
of plagiarism from the Internet.
Researchers: Joe Bruno,
Purushotham Ch, Tom Way, misc. nameless others
Research Alumni:
Sebastian Niezgoda
Description:
Snitch is a Java application that scans the
text in a student paper, identifying passages that might be plagiarized,
searching the Internet for matching web sites that contain the passages, and
finally presenting an HTMLized version of the original student paper with
embedded links to any plagiarized material.
Tools
Resources
References
-
Automatic Conceptual Analysis for Plagiarism Detection - Heinz
Dreher, Issues in Informing Science and Information Technology Volume 4,
2007.
-
Plagiarism Detection Software - Marlin Thomas, E-Leader Bangkok,
2008.
-
Detecting and Tracing Plagiarized Documents by Reconstruction
Plagiarism-Evolution Tree - Chang-Keon Ryu, Hyong-Jun Kim, Seung-Hyun
Ji, Gyun Woo, and Hwan-Gue Cho, 2008.
-
Technical Review of Plagiarism Detection Software Report, Joanna
Bull, Carol Collins, Elisabeth Coughlin, Dale Sharp, Technical Report,
Joint Information Systems Committee, Computer Assisted Assessment
Centre, University of Luton, Bedfordshire, UK, 2001.
-
Issues Raised by Use of Turnitin Plagiarism Detection Software -
online article at Cyberdash.com.
-
Comparison of some plagiarism detection software - on German blog,
2007. Version of
comparison study for 2008 is underway.
Current Tasks
- Develop Java class that performs
Flesch-Kincaid Grade Level analysis of textual input. Test application
should enable the user to select a text file, open it, analyze it and
display appropriate statistics such as grade level for each sentence and
paragraph, and counts of characters, words, sentences, and paragraphs.
- Create example program that converts
Microsoft Word document into text document
- Create example program that converts
PDF document into text document
- Locate as many search APIs as possible
and design examples of how to use them in a Java program.
- Determine if there is a way to make
use of the newer Google web application API (because they are phasing
out the SOAP API, so it will not be feasible to make Google searches
from a standard Java application). In other words, could SNITCH be made
into a web-based application rather than a stand-alone application?
- Explore use of Java 2 BreakIterator class
for managing input tokenization
- Investigate using MOSS to create a
user interface for programming project plagiarism detection.
Project Plan
- Download & install Jigloo to help
with UI design and devel
- Refine user interface: better design,
better functionality, prettier, make buttons same size
- Create better HTML report generation,
add viewer
- Refactor code as needed, make more
object-oriented, improve windowing approach
(document->paragraphs->sentences->words)
- Find a way to handle Word doc and PDF input
files
- Devise better candidate selection
algorithm, write up specification for it, see if Fleischer Scale is
applicable, other analysis techniques
- Research approaches to plagiarism
detection, both automated and manual
- Alpha release candidate goal: Summer
2008
Project: Network Modeling &
Simulation
Purpose: Develop
approaches for modeling distributed networking communications and simulate
the resulting network This project is mentioned here only as it
relates to any support the ACT Lab can provide to the "real" project which
is part of Villanova's Center for Excellence in Enterprise Technology (CEET).
CEET runs, among its research efforts, a large project that uses modeling
and simulation techniques for verification of complex communications
architectures.
Researchers: Nagesh
Javali, Surya Kuchibhotla, Pavani Kolapalli, Tao Tao
Applications:
Large scale distributed communications
Approach:
Use Colored Petri Nets and other techniques.
Resources:
Conferences:
-
Worldcomp
- megaconference of 28 conferences in Las Vegas, June 26-29, 2006
(papers: Feb. 20, 2006).
-
ECMDA - The
Second European Conference on Model Driven Architecture Foundations and
Applications (papers: March 3, 2006).
-
ASE 2006 - 21st IEEE/ACM
International Conference on Automated Software Engineering, Tokyo
September 18-22, 2006 (papers: March 3, 2006).
-
MoDELS/UML 2006 -
ACM/IEEE 9th International Conference on Model Driven Engineering
Languages and Systems, October 1-6, 2006, Genova, Italy (papers: April
3, 2006).
-
COMPSAC 2006
- 30th Annual International Computer Software and Applications
Conference, September 18-21, 2006, Chicago (papers: February 20, 2006).
-
ATVA 2006 - Fourth
international symposium on Automated Technology for Verification and
Analysis, 23-26 October 2006, Beijing, China (papers: May 20, 2006).
- FM
2006 - Formal Methods 2006 Conference, August 21-27, 2006, McMaster
University, Hamilton, Ontario, Canada (papers: Feb. 24, 2006)
-
FMICS 2006 - 11th International
Workshop on Formal Methods for Industrial, August 26-28, 2006, Bonn,
Germany (papers: June 2, 2006).
-
WS-FM 2006 - 3rd
International Workshop on Web Services and Formal Methods,
September 8-9, 2006, Vienna, Austria (papers: Apr. 21, 2006).
- SEFM 2006 - 4th IEEE International
Conference on Software Engineering and Formal Methods, September 11-15,
2006, Pune, India (papers: April 4, 2006)
Project: Finite State
Machine Simulation
Purpose: Develop an
educational software tool for teaching students concepts of finite state
machines
Researchers: Tim White
Description:
The concept of a finite machine is applicable to many computer science
disciplines, as it expresses computing at a very fundamental level. A
useful tool to gain a deeper understanding of finite state machines and automata
is to visualize them. When a student can build and then run an FSM, the
ideas become concrete and learning is enhanced. This research is being
done in collaboration with Tim White.
This project involves the implementation of
an intuitive graphical interface for designing and simulating an arbitrary
finite state machine. The goal is the development of a Java program
that students can download and use to help in learning in courses such as
Discrete Structures, and to evaluate its value as an educational tool and
write up the results of that evaluation as a research paper.
Resources:
-
jFAST -
the Java Finite Automaton Simulation Tool, by Tim White. Complete package
available for download.
- Visual and
Interactive Tools - research that focuses on designing tools for
visualizing and interacting with theoretical computer science concepts at Duke
University (Dr. Susan H. Roger),
including
JFLAP (a package of graphical tools which can be used as an aid in
learning the basic concepts of Formal Languages and Automata Theory).
- Papers related to
JFLAP - and associated tools, good source of target publications and
helpful background
- Finite
Automata - web site of links.
Project: Digital Image
Measurement Tool
Purpose: Develop a
software tool for performing precise measurements within a digital image
displayed on a computer
Researchers: Michael
Cianni
Applications:
Criminology and crime scene investigation, etc.
Approach:
DIMT - the Digital Image Measurement Tool, an innovative application
designed by Michael Cianni. The tool enables easy measurement of digital
images (distances, areas, etc.) using a collection of virtual measurement
tools. This software tool has applicability to surveying, criminal
investigation, satellite and surveillance photo interpretation, to name just a
few. Current plans are to enhance to existing tool, and conduct experimental
research to measure the tools value in one or more of the applicable user
domains. Is a digital measuring tool a valuable aide? And if so, which fields
is it likely to benefit?
Resources:
- EXIF.org - Exchangeable Image File
Format (EXIF), how extra info is stored with a JPEG from a digital camera
- Lens FAQ -
rec.photo.moderated Photographic Lenses FAQ
- Digital Photography images
- gallery of sample images from various cameras
-
ExifReader - download (one of many such tools out there, including
right-click Properties info in Windows XP)
Cameras:
- Nikon D100 (Dr. Klassner) - does not record "Subject Distance" (probably
same for all Nikon)
- Canon Powershot A20 (acquire?) - DOES record "Subject Distance" (some other Canon
models do not, A70, A95)
Project: Algorithms & Data
Structures for Business Analysis
Purpose: Develop the
theory and framework for a proprietary business analysis approach
Researchers: Tom Way,
Mike Peterson (Univ. of Delaware)
Description:
We have
developed a k-layers, massively interconnected data structure and analysis
framework for use in Dr. Peterson's organizational culture research. This
technology has been implemented in a software tool that provides a flexible and
powerful means to manipulate large data sets, enabling a sophisticated,
concept-cluster-based, stimulus-response analysis. The analysis algorithm
and data structure significantly improve upon early analysis methods, making it
possible to conduct the complex task in a matter of hours rather than days or
weeks.
Current plans are to fully develop the software prototype tool, and to refine
the data structures and algorithms used in the analysis to improve the tool's
efficiency.
Tasks:
- Identify salient technical innovations from the project
- Prepare write-up of the technical aspects
updated:
03/07/09
actlab.csc.villanova.edu |