nsISemanticUnitScanner

IID:9f620be4-e535-11d6-b254-00039310a47a
Inherits From:nsISupports

Provides a language independent way to break UNICODE text into meaningful semantic units (e.g. words).

This interface is implemented by the following components:


Methods

PRBool next ( PRUnichar* text , PRInt32 length , PRInt32 pos , PRBool isLastBuffer , out PRInt32 begin , out PRInt32 end ) void start ( char* characterSet )

PRBool next ( PRUnichar* text , PRInt32 length , PRInt32 pos , PRBool isLastBuffer , out PRInt32 begin , out PRInt32 end )

Next() Get the begin / end offset of the next unit in the current text

Arguments:
text: the text to be scanned
length: the number of characters in the text to be processed
pos: the current position
isLastBuffer: , the buffer is the last one
begin: the end offset of the next unit
end
Returns:
has more unit in the current text

void start ( char* characterSet )

Start()

Starts up the semantic unit scanner with an optional character set, which acts as a hint to optimize the heuristics used to determine the language(s) of the processed text.

Arguments:
characterSet: the character set the text was originally encoded in (can be NULL)

Reference documentation is generated from Mozilla's source.

Add a note User Contributed Notes
No comments available

Copyright © 1999 - 2005 XULPlanet.com