org.apache.spark.streamdm.clusterers

StreamKM

class StreamKM extends Clusterer

Implements the StreamKM++ algorithm for data streams. StreamKM++ computes a small (weighted) sample of the stream by using coresets, and then uses it as an input to a k-means++ algorithm. It uses a data structure called BucketManager to handle the coresets.

It uses the following options:

Linear Supertypes
Clusterer, Learner, Serializable, Configurable, Serializable, AnyRef, Any
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. StreamKM
  2. Clusterer
  3. Learner
  4. Serializable
  5. Configurable
  6. Serializable
  7. AnyRef
  8. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Instance Constructors

  1. new StreamKM()

Type Members

  1. type T = BucketManager

    Definition Classes
    StreamKMLearner

Value Members

  1. final def !=(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  2. final def !=(arg0: Any): Boolean

    Definition Classes
    Any
  3. final def ##(): Int

    Definition Classes
    AnyRef → Any
  4. final def ==(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  5. final def ==(arg0: Any): Boolean

    Definition Classes
    Any
  6. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  7. def assign(input: DStream[Example]): DStream[(Example, Double)]

    Assigns examples to clusters, given the current Clusters data structure.

    Assigns examples to clusters, given the current Clusters data structure.

    input

    the DStream of Examples to be assigned a cluster

    returns

    a DStream of tuples containing the original Example and the assigned cluster.

    Definition Classes
    StreamKMClusterer
  8. var bucketmanager: BucketManager

  9. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  10. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  11. def equals(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  12. var exampleLearnerSpecification: ExampleSpecification

  13. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  14. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  15. def getClusters: Array[Example]

    Get the currently computed clusters

    Get the currently computed clusters

    returns

    an Array of Examples representing the clusters

    Definition Classes
    StreamKMClusterer
  16. def getModel: BucketManager

    Gets the current Model used for the Learner.

    Gets the current Model used for the Learner.

    returns

    the Model object used for training

    Definition Classes
    StreamKMLearner
  17. def hashCode(): Int

    Definition Classes
    AnyRef → Any
  18. def init(exampleSpecification: ExampleSpecification): Unit

    Init the StreamKM++ algorithm.

    Init the StreamKM++ algorithm.

    exampleSpecification

    the ExampleSpecification of the input stream.

    Definition Classes
    StreamKMLearner
  19. var initialBuffer: Array[Example]

  20. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  21. val kOption: IntOption

  22. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  23. final def notify(): Unit

    Definition Classes
    AnyRef
  24. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  25. var numInstances: Long

  26. val repOption: IntOption

  27. val sizeCoresetOption: IntOption

  28. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  29. def toString(): String

    Definition Classes
    AnyRef → Any
  30. def train(input: DStream[Example]): Unit

    Maintain the BucketManager for coreset extraction, given an input DStream of Example.

    Maintain the BucketManager for coreset extraction, given an input DStream of Example.

    input

    a stream of instances

    Definition Classes
    StreamKMLearner
  31. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  32. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  33. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  34. val widthOption: IntOption

Inherited from Clusterer

Inherited from Learner

Inherited from Serializable

Inherited from Configurable

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped