stringcompare.preprocessing package

class stringcompare.preprocessing.DeepparseAddressTagger(deepparse_handle)[source]

Bases: stringcompare.preprocessing.tagger.Tagger

LABELS = ['StreetNumber', 'StreetName', 'Unit', 'Municipality', 'Province', 'PostalCode', 'Orientation', 'GeneralDelivery']
batch_tag(objs: List) List[Dict][source]
tag(obj) Dict[source]
class stringcompare.preprocessing.DelimTokenizer(delim=' ')[source]

Bases: stringcompare.preprocessing.tokenizer.Tokenizer

tokenize(sentence)[source]
class stringcompare.preprocessing.NGramTokenizer(n)[source]

Bases: stringcompare.preprocessing.tokenizer.Tokenizer

tokenize(sentence)[source]
class stringcompare.preprocessing.Tagger[source]

Bases: abc.ABC

abstract property LABELS

classmethod(function) -> method

Convert a function to be a class method.

A class method receives the class as implicit first argument, just like an instance method receives the instance. To declare a class method, use this idiom:

class C:

@classmethod def f(cls, arg1, arg2, …):

It can be called either on the class (e.g. C.f()) or on an instance (e.g. C().f()). The instance is ignored except for its class. If a class method is called for a derived class, the derived class object is passed as the implied first argument.

Class methods are different than C++ or Java static methods. If you want those, see the staticmethod builtin.

batch_tag(objs: List) List[Dict][source]
abstract tag(obj) Dict[source]
class stringcompare.preprocessing.Tokenizer[source]

Bases: abc.ABC

String tokenization interface.

batch_tokenize(sentences)[source]
abstract tokenize(sentence)[source]
class stringcompare.preprocessing.WhitespaceTokenizer[source]

Bases: stringcompare.preprocessing.tokenizer.DelimTokenizer

Submodules

stringcompare.preprocessing.tagger module

class stringcompare.preprocessing.tagger.DeepparseAddressTagger(deepparse_handle)[source]

Bases: stringcompare.preprocessing.tagger.Tagger

LABELS = ['StreetNumber', 'StreetName', 'Unit', 'Municipality', 'Province', 'PostalCode', 'Orientation', 'GeneralDelivery']
batch_tag(objs: List) List[Dict][source]
tag(obj) Dict[source]
class stringcompare.preprocessing.tagger.Tagger[source]

Bases: abc.ABC

abstract property LABELS

classmethod(function) -> method

Convert a function to be a class method.

A class method receives the class as implicit first argument, just like an instance method receives the instance. To declare a class method, use this idiom:

class C:

@classmethod def f(cls, arg1, arg2, …):

It can be called either on the class (e.g. C.f()) or on an instance (e.g. C().f()). The instance is ignored except for its class. If a class method is called for a derived class, the derived class object is passed as the implied first argument.

Class methods are different than C++ or Java static methods. If you want those, see the staticmethod builtin.

batch_tag(objs: List) List[Dict][source]
abstract tag(obj) Dict[source]

stringcompare.preprocessing.tokenizer module

class stringcompare.preprocessing.tokenizer.DelimTokenizer(delim=' ')[source]

Bases: stringcompare.preprocessing.tokenizer.Tokenizer

tokenize(sentence)[source]
class stringcompare.preprocessing.tokenizer.NGramTokenizer(n)[source]

Bases: stringcompare.preprocessing.tokenizer.Tokenizer

tokenize(sentence)[source]
class stringcompare.preprocessing.tokenizer.Tokenizer[source]

Bases: abc.ABC

String tokenization interface.

batch_tokenize(sentences)[source]
abstract tokenize(sentence)[source]
class stringcompare.preprocessing.tokenizer.WhitespaceTokenizer[source]

Bases: stringcompare.preprocessing.tokenizer.DelimTokenizer