Class DocFormatParameters

    • Field Detail

      • DEFAULT_LANGUAGE

        public static final ParameterDef<Language> DEFAULT_LANGUAGE
        Defines the default language for new documents
      • DISTRIBUTED

        public static final ParameterDef<Boolean> DISTRIBUTED
        Defines if a document collection should be distributed (true) or local (false)
      • saveMode

        public final ParamMap.Parameter<SaveMode> saveMode
        Whether to overwrite, ignore, or throw an error when writing a corpus to an existing file/directory (default ERROR).
      • distributed

        public final ParamMap.Parameter<Boolean> distributed
        Creates a distributed document collection when the value is set to true (default false).
      • defaultLanguage

        public final ParamMap.Parameter<Language> defaultLanguage
        The default language for new documents. (default calls Hermes.defaultLanguage())
      • normalizers

        public final ParamMap.Parameter<List<TextNormalizer>> normalizers
        The class names of the text normalizes to use when constructing documents. (default calls TextNormalization.configuredInstance().getPreprocessors())
    • Constructor Detail

      • DocFormatParameters

        public DocFormatParameters()
    • Method Detail

      • getDocumentFactory

        public DocumentFactory getDocumentFactory()
        Returns:
        Creates a document factory based on the this set of parameters