Class NameFinderEventStream

  • All Implemented Interfaces:
    java.lang.AutoCloseable, ObjectStream<Event>

    public class NameFinderEventStream
    extends AbstractEventStream<NameSample>
    Class for creating an event stream out of data files for training an name finder.
    • Constructor Detail

      • NameFinderEventStream

        public NameFinderEventStream​(ObjectStream<NameSample> dataStream,
                                     java.lang.String type,
                                     NameContextGenerator contextGenerator,
                                     SequenceCodec<java.lang.String> codec)
        Creates a new name finder event stream using the specified data stream and context generator.
        Parameters:
        dataStream - The data stream of events.
        type - null or overrides the type parameter in the provided samples
        contextGenerator - The context generator used to generate features for the event stream.
    • Method Detail

      • generateOutcomes

        @Deprecated
        public static java.lang.String[] generateOutcomes​(Span[] names,
                                                          java.lang.String type,
                                                          int length)
        Deprecated.
        use the BioCodec implementation of the SequenceValidator instead!
        Generates the name tag outcomes (start, continue, other) for each token in a sentence with the specified length using the specified name spans.
        Parameters:
        names - Token spans for each of the names.
        type - null or overrides the type parameter in the provided samples
        length - The length of the sentence.
        Returns:
        An array of start, continue, other outcomes based on the specified names and sentence length.
      • generateEvents

        public static java.util.List<Event> generateEvents​(java.lang.String[] sentence,
                                                           java.lang.String[] outcomes,
                                                           NameContextGenerator cg)
      • additionalContext

        public static java.lang.String[][] additionalContext​(java.lang.String[] tokens,
                                                             java.util.Map<java.lang.String,​java.lang.String> prevMap)
        Generated previous decision features for each token based on contents of the specified map.
        Parameters:
        tokens - The token for which the context is generated.
        prevMap - A mapping of tokens to their previous decisions.
        Returns:
        An additional context array with features for each token.