Java Doc for Tokenizer.java in » Web-Server » simple » simple » page » translate » Java Source Code / Java DocumentationJava Source Code and Java Documentation

Java Source Code / Java Documentation

1.	6.0 JDK Core
2.	6.0 JDK Modules
3.	6.0 JDK Modules com.sun
4.	6.0 JDK Modules com.sun.java
5.	6.0 JDK Modules sun
6.	6.0 JDK Platform
7.	Ajax
8.	Apache Harmony Java SE
9.	Aspect oriented
10.	Authentication Authorization
11.	Blogger System
12.	Build
13.	Byte Code
14.	Cache
15.	Chart
16.	Chat
17.	Code Analyzer
18.	Collaboration
19.	Content Management System
20.	Database Client
21.	Database DBMS
22.	Database JDBC Connection Pool
23.	Database ORM
24.	Development
25.	EJB Server geronimo
26.	EJB Server GlassFish
27.	EJB Server JBoss 4.2.1
28.	EJB Server resin 3.1.5
29.	ERP CRM Financial
30.	ESB
31.	Forum
32.	GIS
33.	Graphic Library
34.	Groupware
35.	HTML Parser
36.	IDE
37.	IDE Eclipse
38.	IDE Netbeans
39.	Installer
40.	Internationalization Localization
41.	Inversion of Control
42.	Issue Tracking
43.	J2EE
44.	JBoss
45.	JMS
46.	JMX
47.	Library
48.	Mail Clients
49.	Net
50.	Parser
51.	PDF
52.	Portal
53.	Profiler
54.	Project Management
55.	Report
56.	RSS RDF
57.	Rule Engine
58.	Science
59.	Scripting
60.	Search Engine
61.	Security
62.	Sevlet Container
63.	Source Control
64.	Swing Library
65.	Template Engine
66.	Test Coverage
67.	Testing
68.	UML
69.	Web Crawler
70.	Web Framework
71.	Web Mail
72.	Web Server
73.	Web Services
74.	Web Services apache cxf 2.0.1
75.	Web Services AXIS2
76.	Wiki Engine
77.	Workflow Engines
78.	XML
79.	XML UI

Java

Java Tutorial

Illustrator Tutorials

GIMP Tutorials

C# / C Sharp

C# / CSharp Tutorial

C# / CSharp Open Source

SQL Server / T-SQL Tutorial

Oracle PL / SQL

Oracle PL/SQL Tutorial

Flash / Flex / ActionScript

VBA / Excel / Access / Word

XML

XML Tutorial

Microsoft Office PowerPoint 2007 Tutorial

Microsoft Office Excel 2007 Tutorial

Microsoft Office Word 2007 Tutorial

Java Source Code / Java Documentation » Web Server » simple » simple.page.translate

Source Cross Reference Class Diagram Java Document (Java Doc)

java.lang .Object

simple.page.translate .Tokenizer

Tokenizer

final class Tokenizer implements Lexer(Code)

The Tokenizer is used to extract valid tokens from the stream of bytes given to it for scanning. Identifying the tokens from the stream of input is done using delimiters to specify the start and end of a valid token. For example take the well known JSP syntax. A parsable segment typically opens using the following token <% and closes with the %>, as shown in the JSP text shown below.

 <%= new java.util.Date() %>

This tokenizer can be used to extract HTML expressions and other such formats by specifying the starting and ending of the expression. For example the following HTML could be used to specify the opening and closure of an valid token.

 <script language='groovy'>
 java.util.Date();
 </script>

The above token will be identified using a case insensitive match, and whitespace characters can be ignored, such that the HTML does not have to be formatted correctly in order for this tokenizer to extract the HTML as a valid token.
author:
Niall Gallagher

Constructor Summary
public	Tokenizer(Parser parser) Constructor for the `Tokenizer` object.

Method Summary
public void	match(String start, String finish) This method tells the lexer how to extract the tokens from the source document.
public void	match(String start, String finish, String special) This method tells the lexer how to extract the tokens from the source document.
public void	scan(char[] text) This will scan the provided bytes for tokens that should be emitted to the `Parser`.
public void	scan(char[] text, int pos, int len) This will scan the provided bytes for tokens that should be emitted to the `Parser`.

Constructor Detail

Tokenizer
public Tokenizer(Parser parser)(Code)
	Constructor for the `Tokenizer` object. This is used to scan a stream of bytes and pass any extracted tokens from the stream to the `Parser`. Parameters: parser - the parser used to parse extracted tokens

Method Detail

match
public void match(String start, String finish)(Code)
	This method tells the lexer how to extract the tokens from the source document. This is given the opening and closing tokens used to identify a segment. Typically with languages such as JSP and PHP code segments are opened with a delimiter like `<%` for JSP and `<?php` for PHP. This method allows the lexer to be configured to process such delimiters. Parameters: start - this is the opening token for a segment Parameters: finish - this is the closing token for a segment

match

public void match(String start, String finish, String special)(Code)

This method tells the lexer how to extract the tokens from the source document. This is given the opening and closing tokens used to identify a segment. Typically with languages such as JSP and PHP code segments are opened with a delimiter like <% for JSP and <?php for PHP. This method allows the lexer to be configured to process such delimiters.

With this match method a collection of special characters can be specified. These characters tell the lexer what it should allow whitespace to surround. For example take the HTML expressions below.

 <   script language ='groovy' >
 <script language='groovy'>

The above two HTML expressions should be considered equal using the special characters <, >, and =.
Parameters:
  start - this is the opening token for a segment
Parameters:
  finish - this is the closing token for a segment
Parameters:
  special - this is the set of special characters

scan
public void scan(char[] text)(Code)
	This will scan the provided bytes for tokens that should be emitted to the `Parser`. The tokens emitted to the parser object are either plain text tokens or valid segments that require further processing by the parser. Parameters: text - this is the buffer that contains the bytes

scan
public void scan(char[] text, int pos, int len)(Code)
	This will scan the provided bytes for tokens that should be emitted to the `Parser`. The tokens emitted to the parser object are either plain text tokens or valid segments that require further processing by the parser. Parameters: text - this is the buffer that contains the bytes Parameters: pos - this is the offset within the buffer to read Parameters: len - this is the number of bytes to use

Methods inherited from java.lang.Object

native protected Object clone() throws CloneNotSupportedException(Code)(Java Doc)
public boolean equals(Object obj)(Code)(Java Doc)
protected void finalize() throws Throwable(Code)(Java Doc)
final native public Class getClass()(Code)(Java Doc)
native public int hashCode()(Code)(Java Doc)
final native public void notify()(Code)(Java Doc)
final native public void notifyAll()(Code)(Java Doc)
public String toString()(Code)(Java Doc)
final native public void wait(long timeout) throws InterruptedException(Code)(Java Doc)
final public void wait(long timeout, int nanos) throws InterruptedException(Code)(Java Doc)
final public void wait() throws InterruptedException(Code)(Java Doc)

www.java2java.com | Contact Us

All other trademarks are property of their respective owners.