API Reference¶

This section provides detailed API reference documentation for the Kura package, automatically generated from the source code using mkdocstrings.

How to Use This Reference¶

The API reference is organized by module, with each module containing related classes and functions. For each class, you'll find:

Constructor parameters and their descriptions
Instance methods with parameter details and return types
Properties and attributes

To use these classes in your code, import them from their specific modules:

# Import functions from their specific modules
from kura.summarisation import summarise_conversations, SummaryModel
from kura.cluster import generate_base_clusters_from_conversation_summaries, ClusterDescriptionModel
from kura.meta_cluster import reduce_clusters_from_base_clusters, MetaClusterModel
from kura.dimensionality import reduce_dimensionality_from_clusters, HDBUMAP
from kura.visualization import visualise_pipeline_results
from kura.types import Conversation
from kura.checkpoints import JSONLCheckpointManager
from kura.cache import DiskCacheStrategy

Core Classes¶

Procedural API¶

The procedural API provides a functional approach to conversation analysis with composable pipeline functions.

Pipeline Functions¶

`kura.summarisation.summarise_conversations(conversations: list[Conversation], *, model: BaseSummaryModel, response_schema: Type[T] = GeneratedSummary, prompt: str = DEFAULT_SUMMARY_PROMPT, temperature: float = 0.2, checkpoint_manager: Optional[BaseCheckpointManager] = None, **kwargs) -> list[ConversationSummary]` `async` ¶

Generate summaries for a list of conversations using the CLIO framework.

This is a pure function that takes conversations and a summary model, and returns conversation summaries with automatic extensibility. Optionally uses checkpointing for efficient re-runs.

The function works with any model that implements BaseSummaryModel, supporting heterogeneous backends (OpenAI, vLLM, Hugging Face, etc.) through polymorphism.

Extensibility Features: - Custom Fields: Extend GeneratedSummary to add custom analysis fields - Prompt Modification: Use prompt to modify CLIO analysis - Automatic Mapping: Extended fields are automatically placed in metadata

Parameters:

Name	Type	Description	Default
`conversations`	`list[Conversation]`	List of conversations to summarize	required
`model`	`BaseSummaryModel`	Model to use for summarization (OpenAI, vLLM, local, etc.)	required
`response_schema`	`Type[T]`	Pydantic model class for LLM output. Extend GeneratedSummary to add custom fields that will appear in metadata	`GeneratedSummary`
`prompt`	`str`	Custom prompt to modify the CLIO analysis	`DEFAULT_SUMMARY_PROMPT`
`temperature`	`float`	LLM temperature for generation	`0.2`
`checkpoint_manager`	`Optional[BaseCheckpointManager]`	Optional checkpoint manager for caching	`None`

Returns:

Type	Description
`list[ConversationSummary]`	List of ConversationSummary objects with core CLIO fields and any
`list[ConversationSummary]`	additional fields from extended schemas in metadata

Example - Basic Usage

model = SummaryModel() summaries = await summarise_conversations( ... conversations=my_conversations, ... model=model ... )

Example - Custom Analysis

class DetailedSummary(GeneratedSummary): ... sentiment: str ... technical_depth: int

summaries = await summarise_conversations( ... conversations=my_conversations, ... model=model, ... response_schema=DetailedSummary, ... prompt="Analyze sentiment and rate technical depth 1-10" ... )

Custom fields available in metadata¶

print(summaries[0].metadata["sentiment"])

Source code in kura/summarisation.py

async def summarise_conversations(
    conversations: list[Conversation],
    *,
    model: BaseSummaryModel,
    response_schema: Type[T] = GeneratedSummary,
    prompt: str = DEFAULT_SUMMARY_PROMPT,
    temperature: float = 0.2,
    checkpoint_manager: Optional[BaseCheckpointManager] = None,
    **kwargs,
) -> list[ConversationSummary]:
    """Generate summaries for a list of conversations using the CLIO framework.

    This is a pure function that takes conversations and a summary model,
    and returns conversation summaries with automatic extensibility.
    Optionally uses checkpointing for efficient re-runs.

    The function works with any model that implements BaseSummaryModel,
    supporting heterogeneous backends (OpenAI, vLLM, Hugging Face, etc.)
    through polymorphism.

    Extensibility Features:
    - **Custom Fields**: Extend GeneratedSummary to add custom analysis fields
    - **Prompt Modification**: Use prompt to modify CLIO analysis
    - **Automatic Mapping**: Extended fields are automatically placed in metadata

    Args:
        conversations: List of conversations to summarize
        model: Model to use for summarization (OpenAI, vLLM, local, etc.)
        response_schema: Pydantic model class for LLM output. Extend GeneratedSummary
                        to add custom fields that will appear in metadata
        prompt: Custom prompt to modify the CLIO analysis
        temperature: LLM temperature for generation
        checkpoint_manager: Optional checkpoint manager for caching

    Returns:
        List of ConversationSummary objects with core CLIO fields and any
        additional fields from extended schemas in metadata

    Example - Basic Usage:
        >>> model = SummaryModel()
        >>> summaries = await summarise_conversations(
        ...     conversations=my_conversations,
        ...     model=model
        ... )

    Example - Custom Analysis:
        >>> class DetailedSummary(GeneratedSummary):
        ...     sentiment: str
        ...     technical_depth: int
        >>>
        >>> summaries = await summarise_conversations(
        ...     conversations=my_conversations,
        ...     model=model,
        ...     response_schema=DetailedSummary,
        ...     prompt="Analyze sentiment and rate technical depth 1-10"
        ... )
        >>> # Custom fields available in metadata
        >>> print(summaries[0].metadata["sentiment"])
    """
    logger.info(
        f"Starting summarization of {len(conversations)} conversations using {type(model).__name__}"
    )

    # Try to load from checkpoint
    if checkpoint_manager:
        cached = checkpoint_manager.load_checkpoint(
            model.checkpoint_filename, ConversationSummary
        )
        if cached:
            logger.info(f"Loaded {len(cached)} summaries from checkpoint")
            return cached

    # Generate raw summaries
    logger.info("Generating new summaries...")
    raw_summaries = await model.summarise(
        conversations,
        response_schema=response_schema,
        temperature=temperature,
        prompt=prompt,
        **kwargs,
    )
    logger.info(f"Generated {len(raw_summaries)} raw summaries")

    # Summaries are already ConversationSummary objects from _summarise_single_conversation
    summaries = raw_summaries
    logger.info(f"Generated {len(summaries)} conversation summaries")

    # Save to checkpoint
    if checkpoint_manager:
        logger.info(f"Saving summaries to checkpoint: {model.checkpoint_filename}")
        checkpoint_manager.save_checkpoint(model.checkpoint_filename, summaries)

    return summaries

`kura.cluster.generate_base_clusters_from_conversation_summaries(summaries: List[ConversationSummary], embedding_model: Optional[BaseEmbeddingModel] = None, clustering_method: Optional[BaseClusteringMethod] = None, clustering_model: Optional[BaseClusterDescriptionModel] = None, checkpoint_manager: Optional[BaseCheckpointManager] = None, max_contrastive_examples: int = 10, prompt: str = DEFAULT_CLUSTER_PROMPT, **kwargs) -> List[Cluster]` `async` ¶

Cluster conversation summaries using embeddings.

Parameters:

Name	Type	Description	Default
`summaries`	`List[ConversationSummary]`	List of conversation summaries to cluster	required
`embedding_model`	`Optional[BaseEmbeddingModel]`	Model for generating embeddings (defaults to OpenAI)	`None`
`clustering_method`	`Optional[BaseClusteringMethod]`	Clustering algorithm (defaults to K-means)	`None`
`clustering_model`	`Optional[BaseClusterDescriptionModel]`	Model for generating cluster descriptions	`None`
`checkpoint_manager`	`Optional[BaseCheckpointManager]`	Optional checkpoint manager for caching	`None`
`max_contrastive_examples`	`int`	Number of contrastive examples to use	`10`
`prompt`	`str`	Custom prompt for cluster generation	`DEFAULT_CLUSTER_PROMPT`
`**kwargs`		Additional parameters for clustering model	`{}`

Returns:

Type	Description
`List[Cluster]`	List of clusters with generated names and descriptions

Source code in kura/cluster.py

async def generate_base_clusters_from_conversation_summaries(
    summaries: List[ConversationSummary],
    embedding_model: Optional[BaseEmbeddingModel] = None,
    clustering_method: Optional[BaseClusteringMethod] = None,
    clustering_model: Optional[BaseClusterDescriptionModel] = None,
    checkpoint_manager: Optional[BaseCheckpointManager] = None,
    max_contrastive_examples: int = 10,
    prompt: str = DEFAULT_CLUSTER_PROMPT,
    **kwargs,
) -> List[Cluster]:
    """
    Cluster conversation summaries using embeddings.

    Args:
        summaries: List of conversation summaries to cluster
        embedding_model: Model for generating embeddings (defaults to OpenAI)
        clustering_method: Clustering algorithm (defaults to K-means)
        clustering_model: Model for generating cluster descriptions
        checkpoint_manager: Optional checkpoint manager for caching
        max_contrastive_examples: Number of contrastive examples to use
        prompt: Custom prompt for cluster generation
        **kwargs: Additional parameters for clustering model

    Returns:
        List of clusters with generated names and descriptions
    """
    if not summaries:
        raise ValueError("Empty summaries list provided")

    # Initialize default models if not provided
    if embedding_model is None:
        embedding_model = OpenAIEmbeddingModel()
    if clustering_method is None:
        clustering_method = KmeansClusteringModel()
    if clustering_model is None:
        clustering_model = ClusterDescriptionModel()

    if checkpoint_manager:
        cached = checkpoint_manager.load_checkpoint(
            clustering_model.checkpoint_filename, Cluster
        )
        if cached:
            logger.info(f"Loaded {len(cached)} clusters from checkpoint")
            return cached

    logger.info(f"Clustering {len(summaries)} conversation summaries")

    # Embed the summaries
    embedded_items = await embed_summaries(summaries, embedding_model)

    # Generate Initial Mapping of Cluster IDs to Summaries
    clusters_id_to_summaries = clustering_method.cluster(embedded_items)

    # Generate Clusters
    clusters = await clustering_model.generate_clusters(
        cluster_id_to_summaries=clusters_id_to_summaries,
        max_contrastive_examples=max_contrastive_examples,
        prompt=prompt,
    )

    if checkpoint_manager:
        checkpoint_manager.save_checkpoint(
            clustering_model.checkpoint_filename, clusters
        )

    return clusters

`kura.meta_cluster.reduce_clusters_from_base_clusters(clusters: list[Cluster], *, model: BaseMetaClusterModel, checkpoint_manager: Optional[BaseCheckpointManager] = None) -> list[Cluster]` `async` ¶

Reduce clusters into a hierarchical structure.

Iteratively combines similar clusters until the number of root clusters is less than or equal to the model's max_clusters setting.

Parameters:

Name	Type	Description	Default
`clusters`	`list[Cluster]`	List of initial clusters to reduce	required
`model`	`BaseMetaClusterModel`	Meta-clustering model to use for reduction	required
`checkpoint_manager`	`Optional[BaseCheckpointManager]`	Optional checkpoint manager for caching	`None`

Returns:

Type	Description
`list[Cluster]`	List of clusters with hierarchical structure

Example

meta_model = MetaClusterModel(max_clusters=5) reduced = await reduce_clusters( ... clusters=base_clusters, ... model=meta_model, ... checkpoint_manager=checkpoint_mgr ... )

Source code in kura/meta_cluster.py

async def reduce_clusters_from_base_clusters(
    clusters: list[Cluster],
    *,
    model: BaseMetaClusterModel,
    checkpoint_manager: Optional[BaseCheckpointManager] = None,
) -> list[Cluster]:
    """Reduce clusters into a hierarchical structure.

    Iteratively combines similar clusters until the number of root clusters
    is less than or equal to the model's max_clusters setting.

    Args:
        clusters: List of initial clusters to reduce
        model: Meta-clustering model to use for reduction
        checkpoint_manager: Optional checkpoint manager for caching

    Returns:
        List of clusters with hierarchical structure

    Example:
        >>> meta_model = MetaClusterModel(max_clusters=5)
        >>> reduced = await reduce_clusters(
        ...     clusters=base_clusters,
        ...     model=meta_model,
        ...     checkpoint_manager=checkpoint_mgr
        ... )
    """
    logger.info(
        f"Starting cluster reduction from {len(clusters)} initial clusters using {type(model).__name__}"
    )

    # Try to load from checkpoint
    if checkpoint_manager:
        cached = checkpoint_manager.load_checkpoint(model.checkpoint_filename, Cluster)
        if cached:
            root_count = len([c for c in cached if c.parent_id is None])
            logger.info(
                f"Loaded {len(cached)} clusters from checkpoint ({root_count} root clusters)"
            )
            return cached

    # Start with all clusters as potential roots
    all_clusters = clusters.copy()
    root_clusters = clusters.copy()

    # Get max_clusters from model if available, otherwise use default
    max_clusters = getattr(model, "max_clusters", 10)
    logger.info(f"Starting with {len(root_clusters)} clusters, target: {max_clusters}")

    # Iteratively reduce until we have desired number of root clusters
    while len(root_clusters) > max_clusters:
        # Get updated clusters from meta-clustering
        new_current_level = await model.reduce_clusters(root_clusters)

        # Find new root clusters (those without parents)
        root_clusters = [c for c in new_current_level if c.parent_id is None]

        # Remove old clusters that now have parents
        old_cluster_ids = {c.id for c in new_current_level if c.parent_id}
        all_clusters = [c for c in all_clusters if c.id not in old_cluster_ids]

        # Add new clusters to the complete list
        all_clusters.extend(new_current_level)

        logger.info(f"Reduced to {len(root_clusters)} root clusters")

    logger.info(
        f"Cluster reduction complete: {len(all_clusters)} total clusters, {len(root_clusters)} root clusters"
    )

    # Save to checkpoint
    if checkpoint_manager:
        checkpoint_manager.save_checkpoint(model.checkpoint_filename, all_clusters)

    return all_clusters

`kura.dimensionality.reduce_dimensionality_from_clusters(clusters: list[Cluster], *, model: BaseDimensionalityReduction, checkpoint_manager: Optional[BaseCheckpointManager] = None) -> list[ProjectedCluster]` `async` ¶

Reduce dimensions of clusters for visualization.

Projects clusters to 2D space using the provided dimensionality reduction model. Supports different algorithms (UMAP, t-SNE, PCA, etc.) through the model interface.

Parameters:

Name	Type	Description	Default
`clusters`	`list[Cluster]`	List of clusters to project	required
`model`	`BaseDimensionalityReduction`	Dimensionality reduction model to use (UMAP, t-SNE, etc.)	required
`checkpoint_manager`	`Optional[BaseCheckpointManager]`	Optional checkpoint manager for caching	`None`

Returns:

Type	Description
`list[ProjectedCluster]`	List of projected clusters with 2D coordinates

Example

dim_model = HDBUMAP(n_components=2) projected = await reduce_dimensionality( ... clusters=hierarchical_clusters, ... model=dim_model, ... checkpoint_manager=checkpoint_mgr ... )

Source code in kura/dimensionality.py

async def reduce_dimensionality_from_clusters(
    clusters: list[Cluster],
    *,
    model: BaseDimensionalityReduction,
    checkpoint_manager: Optional[BaseCheckpointManager] = None,
) -> list[ProjectedCluster]:
    """Reduce dimensions of clusters for visualization.

    Projects clusters to 2D space using the provided dimensionality reduction model.
    Supports different algorithms (UMAP, t-SNE, PCA, etc.) through the model interface.

    Args:
        clusters: List of clusters to project
        model: Dimensionality reduction model to use (UMAP, t-SNE, etc.)
        checkpoint_manager: Optional checkpoint manager for caching

    Returns:
        List of projected clusters with 2D coordinates

    Example:
        >>> dim_model = HDBUMAP(n_components=2)
        >>> projected = await reduce_dimensionality(
        ...     clusters=hierarchical_clusters,
        ...     model=dim_model,
        ...     checkpoint_manager=checkpoint_mgr
        ... )
    """
    logger.info(
        f"Starting dimensionality reduction for {len(clusters)} clusters using {type(model).__name__}"
    )

    # Try to load from checkpoint
    if checkpoint_manager:
        cached = checkpoint_manager.load_checkpoint(
            model.checkpoint_filename, ProjectedCluster
        )
        if cached:
            logger.info(f"Loaded {len(cached)} projected clusters from checkpoint")
            return cached

    # Reduce dimensionality
    logger.info("Projecting clusters to 2D space...")
    projected_clusters = await model.reduce_dimensionality(clusters)
    logger.info(f"Projected {len(projected_clusters)} clusters to 2D")

    # Save to checkpoint
    if checkpoint_manager:
        checkpoint_manager.save_checkpoint(
            model.checkpoint_filename, projected_clusters
        )

    return projected_clusters

Checkpoint Management¶

`kura.checkpoint.CheckpointManager` ¶

Bases: BaseCheckpointManager

Handles checkpoint loading and saving for pipeline steps.

Source code in kura/checkpoint.py

class CheckpointManager(BaseCheckpointManager):
    """Handles checkpoint loading and saving for pipeline steps."""

    def __init__(self, checkpoint_dir: str, *, enabled: bool = True):
        """Initialize checkpoint manager.

        Args:
            checkpoint_dir: Directory for saving checkpoints
            enabled: Whether checkpointing is enabled
        """
        super().__init__(checkpoint_dir, enabled=enabled)

    def setup_checkpoint_dir(self) -> None:
        """Create checkpoint directory if it doesn't exist."""
        if not self.checkpoint_dir.exists():
            self.checkpoint_dir.mkdir(parents=True, exist_ok=True)
            logger.info(f"Created checkpoint directory: {self.checkpoint_dir}")

    def get_checkpoint_path(self, filename: str) -> Path:
        """Get full path for a checkpoint file.

        Args:
            filename: Name of the checkpoint file

        Returns:
            Path object for the checkpoint file
        """
        return self.checkpoint_dir / filename

    def load_checkpoint(self, filename: str, model_class: type[T], **kwargs) -> Optional[List[T]]:
        """Load data from a checkpoint file if it exists.

        Args:
            filename: Name of the checkpoint file
            model_class: Pydantic model class for deserializing the data
            **kwargs: Additional arguments (for compatibility with base class)

        Returns:
            List of model instances if checkpoint exists, None otherwise
        """
        if not self.enabled:
            return None

        checkpoint_path = self.get_checkpoint_path(filename)
        if checkpoint_path.exists():
            logger.info(
                f"Loading checkpoint from {checkpoint_path} for {model_class.__name__}"
            )
            with open(checkpoint_path, "r") as f:
                return [model_class.model_validate_json(line) for line in f]
        return None

    def save_checkpoint(self, filename: str, data: List[T], **kwargs) -> None:
        """Save data to a checkpoint file.

        Args:
            filename: Name of the checkpoint file
            data: List of model instances to save
            **kwargs: Additional arguments (for compatibility with base class)
        """
        if not self.enabled:
            return

        checkpoint_path = self.get_checkpoint_path(filename)
        with open(checkpoint_path, "w") as f:
            for item in data:
                f.write(item.model_dump_json() + "\n")
        logger.info(f"Saved checkpoint to {checkpoint_path} with {len(data)} items")

    def list_checkpoints(self) -> List[str]:
        """List all available checkpoint files."""
        if not self.enabled or not self.checkpoint_dir.exists():
            return []
        return [
            f.name
            for f in self.checkpoint_dir.iterdir()
            if f.is_file()
        ]

    def delete_checkpoint(self, filename: str) -> bool:
        """Delete a checkpoint file."""
        if not self.enabled:
            return False
        checkpoint_path = self.get_checkpoint_path(filename)
        if checkpoint_path.exists():
            checkpoint_path.unlink()
            logger.info(f"Deleted checkpoint: {checkpoint_path}")
            return True
        return False

`init(checkpoint_dir: str, *, enabled: bool = True)` ¶

Initialize checkpoint manager.

Parameters:

Name	Type	Description	Default
`checkpoint_dir`	`str`	Directory for saving checkpoints	required
`enabled`	`bool`	Whether checkpointing is enabled	`True`

Source code in kura/checkpoint.py

def __init__(self, checkpoint_dir: str, *, enabled: bool = True):
    """Initialize checkpoint manager.

    Args:
        checkpoint_dir: Directory for saving checkpoints
        enabled: Whether checkpointing is enabled
    """
    super().__init__(checkpoint_dir, enabled=enabled)

`delete_checkpoint(filename: str) -> bool` ¶

Delete a checkpoint file.

Source code in kura/checkpoint.py

def delete_checkpoint(self, filename: str) -> bool:
    """Delete a checkpoint file."""
    if not self.enabled:
        return False
    checkpoint_path = self.get_checkpoint_path(filename)
    if checkpoint_path.exists():
        checkpoint_path.unlink()
        logger.info(f"Deleted checkpoint: {checkpoint_path}")
        return True
    return False

`get_checkpoint_path(filename: str) -> Path` ¶

Get full path for a checkpoint file.

Parameters:

Name	Type	Description	Default
`filename`	`str`	Name of the checkpoint file	required

Returns:

Type	Description
`Path`	Path object for the checkpoint file

Source code in kura/checkpoint.py

def get_checkpoint_path(self, filename: str) -> Path:
    """Get full path for a checkpoint file.

    Args:
        filename: Name of the checkpoint file

    Returns:
        Path object for the checkpoint file
    """
    return self.checkpoint_dir / filename

`list_checkpoints() -> List[str]` ¶

List all available checkpoint files.

Source code in kura/checkpoint.py

def list_checkpoints(self) -> List[str]:
    """List all available checkpoint files."""
    if not self.enabled or not self.checkpoint_dir.exists():
        return []
    return [
        f.name
        for f in self.checkpoint_dir.iterdir()
        if f.is_file()
    ]

`load_checkpoint(filename: str, model_class: type[T], **kwargs) -> Optional[List[T]]` ¶

Load data from a checkpoint file if it exists.

Parameters:

Name	Type	Description	Default
`filename`	`str`	Name of the checkpoint file	required
`model_class`	`type[T]`	Pydantic model class for deserializing the data	required
`**kwargs`		Additional arguments (for compatibility with base class)	`{}`

Returns:

Type	Description
`Optional[List[T]]`	List of model instances if checkpoint exists, None otherwise

Source code in kura/checkpoint.py

def load_checkpoint(self, filename: str, model_class: type[T], **kwargs) -> Optional[List[T]]:
    """Load data from a checkpoint file if it exists.

    Args:
        filename: Name of the checkpoint file
        model_class: Pydantic model class for deserializing the data
        **kwargs: Additional arguments (for compatibility with base class)

    Returns:
        List of model instances if checkpoint exists, None otherwise
    """
    if not self.enabled:
        return None

    checkpoint_path = self.get_checkpoint_path(filename)
    if checkpoint_path.exists():
        logger.info(
            f"Loading checkpoint from {checkpoint_path} for {model_class.__name__}"
        )
        with open(checkpoint_path, "r") as f:
            return [model_class.model_validate_json(line) for line in f]
    return None

`save_checkpoint(filename: str, data: List[T], **kwargs) -> None` ¶

Save data to a checkpoint file.

Parameters:

Name	Type	Description	Default
`filename`	`str`	Name of the checkpoint file	required
`data`	`List[T]`	List of model instances to save	required
`**kwargs`		Additional arguments (for compatibility with base class)	`{}`

Source code in kura/checkpoint.py

def save_checkpoint(self, filename: str, data: List[T], **kwargs) -> None:
    """Save data to a checkpoint file.

    Args:
        filename: Name of the checkpoint file
        data: List of model instances to save
        **kwargs: Additional arguments (for compatibility with base class)
    """
    if not self.enabled:
        return

    checkpoint_path = self.get_checkpoint_path(filename)
    with open(checkpoint_path, "w") as f:
        for item in data:
            f.write(item.model_dump_json() + "\n")
    logger.info(f"Saved checkpoint to {checkpoint_path} with {len(data)} items")

`setup_checkpoint_dir() -> None` ¶

Create checkpoint directory if it doesn't exist.

Source code in kura/checkpoint.py

def setup_checkpoint_dir(self) -> None:
    """Create checkpoint directory if it doesn't exist."""
    if not self.checkpoint_dir.exists():
        self.checkpoint_dir.mkdir(parents=True, exist_ok=True)
        logger.info(f"Created checkpoint directory: {self.checkpoint_dir}")

Implementation Classes¶

Embedding Models¶

`kura.embedding` ¶

`logger = logging.getLogger(name)` `module-attribute` ¶

`CohereEmbeddingModel` ¶

Bases: BaseEmbeddingModel

Source code in kura/embedding.py

class CohereEmbeddingModel(BaseEmbeddingModel):
    def __init__(
        self,
        model_name: str = "embed-v4.0",
        model_batch_size: int = 96,
        n_concurrent_jobs: int = 5,
        input_type: str = "clustering",
        api_key: str | None = None,
    ):
        from importlib.util import find_spec

        if find_spec("cohere") is None:
            raise ImportError(
                "Cohere package is required for CohereEmbeddingModel. "
                "Install it with: uv pip install cohere"
            )

        from cohere import AsyncClient

        self.client = AsyncClient(api_key=api_key)
        self.model_name = model_name
        self.input_type = input_type
        self._model_batch_size = model_batch_size
        self._n_concurrent_jobs = n_concurrent_jobs
        self._semaphore = Semaphore(n_concurrent_jobs)
        logger.info(
            f"Initialized CohereEmbeddingModel with model={model_name}, batch_size={model_batch_size}, concurrent_jobs={n_concurrent_jobs}, input_type={input_type}"
        )

    def slug(self):
        return f"cohere:{self.model_name}-batchsize:{self._model_batch_size}-concurrent:{self._n_concurrent_jobs}-inputtype:{self.input_type}"

    @retry(wait=wait_fixed(3), stop=stop_after_attempt(3))
    async def _embed_batch(self, texts: list[str]) -> list[list[float]]:
        """Embed a single batch of texts."""
        async with self._semaphore:
            try:
                logger.debug(
                    f"Embedding batch of {len(texts)} texts using model {self.model_name}"
                )
                response = await self.client.embed(
                    texts=texts,
                    model=self.model_name,
                    input_type=self.input_type,
                )
                logger.debug(
                    f"Successfully embedded batch of {len(texts)} texts, got {len(response.embeddings)} embeddings"
                )
                return response.embeddings
            except Exception as e:
                logger.error(f"Failed to embed batch of {len(texts)} texts: {e}")
                raise

    async def embed(self, texts: list[str]) -> list[list[float]]:
        if not texts:
            logger.debug("Empty text list provided, returning empty embeddings")
            return []

        logger.info(f"Starting embedding of {len(texts)} texts using {self.model_name}")

        # Create batches
        batches = batch_texts(texts, self._model_batch_size)
        logger.debug(
            f"Split {len(texts)} texts into {len(batches)} batches of size {self._model_batch_size}"
        )

        # Process all batches concurrently
        tasks = [self._embed_batch(batch) for batch in batches]
        try:
            results_list_of_lists = await gather(*tasks)
            logger.debug(f"Completed embedding {len(batches)} batches")
        except Exception as e:
            logger.error(f"Failed to embed texts: {e}")
            raise

        # Flatten results
        embeddings = []
        for result_batch in results_list_of_lists:
            embeddings.extend(result_batch)

        logger.info(
            f"Successfully embedded {len(texts)} texts, produced {len(embeddings)} embeddings"
        )
        return embeddings

`client = AsyncClient(api_key=api_key)` `instance-attribute` ¶

`input_type = input_type` `instance-attribute` ¶

`model_name = model_name` `instance-attribute` ¶

`init(model_name: str = 'embed-v4.0', model_batch_size: int = 96, n_concurrent_jobs: int = 5, input_type: str = 'clustering', api_key: str | None = None)` ¶

Source code in kura/embedding.py

def __init__(
    self,
    model_name: str = "embed-v4.0",
    model_batch_size: int = 96,
    n_concurrent_jobs: int = 5,
    input_type: str = "clustering",
    api_key: str | None = None,
):
    from importlib.util import find_spec

    if find_spec("cohere") is None:
        raise ImportError(
            "Cohere package is required for CohereEmbeddingModel. "
            "Install it with: uv pip install cohere"
        )

    from cohere import AsyncClient

    self.client = AsyncClient(api_key=api_key)
    self.model_name = model_name
    self.input_type = input_type
    self._model_batch_size = model_batch_size
    self._n_concurrent_jobs = n_concurrent_jobs
    self._semaphore = Semaphore(n_concurrent_jobs)
    logger.info(
        f"Initialized CohereEmbeddingModel with model={model_name}, batch_size={model_batch_size}, concurrent_jobs={n_concurrent_jobs}, input_type={input_type}"
    )

`embed(texts: list[str]) -> list[list[float]]` `async` ¶

Source code in kura/embedding.py

async def embed(self, texts: list[str]) -> list[list[float]]:
    if not texts:
        logger.debug("Empty text list provided, returning empty embeddings")
        return []

    logger.info(f"Starting embedding of {len(texts)} texts using {self.model_name}")

    # Create batches
    batches = batch_texts(texts, self._model_batch_size)
    logger.debug(
        f"Split {len(texts)} texts into {len(batches)} batches of size {self._model_batch_size}"
    )

    # Process all batches concurrently
    tasks = [self._embed_batch(batch) for batch in batches]
    try:
        results_list_of_lists = await gather(*tasks)
        logger.debug(f"Completed embedding {len(batches)} batches")
    except Exception as e:
        logger.error(f"Failed to embed texts: {e}")
        raise

    # Flatten results
    embeddings = []
    for result_batch in results_list_of_lists:
        embeddings.extend(result_batch)

    logger.info(
        f"Successfully embedded {len(texts)} texts, produced {len(embeddings)} embeddings"
    )
    return embeddings

`slug()` ¶

Source code in kura/embedding.py

def slug(self):
    return f"cohere:{self.model_name}-batchsize:{self._model_batch_size}-concurrent:{self._n_concurrent_jobs}-inputtype:{self.input_type}"

`OpenAIEmbeddingModel` ¶

Bases: BaseEmbeddingModel

Source code in kura/embedding.py

class OpenAIEmbeddingModel(BaseEmbeddingModel):
    def __init__(
        self,
        model_name: str = "text-embedding-3-small",
        model_batch_size: int = 50,
        n_concurrent_jobs: int = 5,
    ):
        self.client = AsyncOpenAI()
        self.model_name = model_name
        self._model_batch_size = model_batch_size
        self._n_concurrent_jobs = n_concurrent_jobs
        self._semaphore = Semaphore(n_concurrent_jobs)
        logger.info(
            f"Initialized OpenAIEmbeddingModel with model={model_name}, batch_size={model_batch_size}, concurrent_jobs={n_concurrent_jobs}"
        )

    def slug(self):
        return f"openai:{self.model_name}-batchsize:{self._model_batch_size}-concurrent:{self._n_concurrent_jobs}"

    @retry(wait=wait_fixed(3), stop=stop_after_attempt(3))
    async def _embed_batch(self, texts: list[str]) -> list[list[float]]:
        """Embed a single batch of texts."""
        async with self._semaphore:
            try:
                logger.debug(
                    f"Embedding batch of {len(texts)} texts using model {self.model_name}"
                )
                resp = await self.client.embeddings.create(
                    input=texts, model=self.model_name
                )
                embeddings = [item.embedding for item in resp.data]
                logger.debug(
                    f"Successfully embedded batch of {len(texts)} texts, got {len(embeddings)} embeddings"
                )
                return embeddings
            except Exception as e:
                logger.error(f"Failed to embed batch of {len(texts)} texts: {e}")
                raise

    async def embed(self, texts: list[str]) -> list[list[float]]:
        if not texts:
            logger.debug("Empty text list provided, returning empty embeddings")
            return []

        logger.info(f"Starting embedding of {len(texts)} texts using {self.model_name}")

        # Create batches
        batches = batch_texts(texts, self._model_batch_size)
        logger.debug(
            f"Split {len(texts)} texts into {len(batches)} batches of size {self._model_batch_size}"
        )

        # Process all batches concurrently
        tasks = [self._embed_batch(batch) for batch in batches]
        try:
            results_list_of_lists = await gather(*tasks)
            logger.debug(f"Completed embedding {len(batches)} batches")
        except Exception as e:
            logger.error(f"Failed to embed texts: {e}")
            raise

        # Flatten results
        embeddings = []
        for result_batch in results_list_of_lists:
            embeddings.extend(result_batch)

        logger.info(
            f"Successfully embedded {len(texts)} texts, produced {len(embeddings)} embeddings"
        )
        return embeddings

`client = AsyncOpenAI()` `instance-attribute` ¶

`model_name = model_name` `instance-attribute` ¶

`init(model_name: str = 'text-embedding-3-small', model_batch_size: int = 50, n_concurrent_jobs: int = 5)` ¶

Source code in kura/embedding.py

def __init__(
    self,
    model_name: str = "text-embedding-3-small",
    model_batch_size: int = 50,
    n_concurrent_jobs: int = 5,
):
    self.client = AsyncOpenAI()
    self.model_name = model_name
    self._model_batch_size = model_batch_size
    self._n_concurrent_jobs = n_concurrent_jobs
    self._semaphore = Semaphore(n_concurrent_jobs)
    logger.info(
        f"Initialized OpenAIEmbeddingModel with model={model_name}, batch_size={model_batch_size}, concurrent_jobs={n_concurrent_jobs}"
    )

`embed(texts: list[str]) -> list[list[float]]` `async` ¶

Source code in kura/embedding.py

async def embed(self, texts: list[str]) -> list[list[float]]:
    if not texts:
        logger.debug("Empty text list provided, returning empty embeddings")
        return []

    logger.info(f"Starting embedding of {len(texts)} texts using {self.model_name}")

    # Create batches
    batches = batch_texts(texts, self._model_batch_size)
    logger.debug(
        f"Split {len(texts)} texts into {len(batches)} batches of size {self._model_batch_size}"
    )

    # Process all batches concurrently
    tasks = [self._embed_batch(batch) for batch in batches]
    try:
        results_list_of_lists = await gather(*tasks)
        logger.debug(f"Completed embedding {len(batches)} batches")
    except Exception as e:
        logger.error(f"Failed to embed texts: {e}")
        raise

    # Flatten results
    embeddings = []
    for result_batch in results_list_of_lists:
        embeddings.extend(result_batch)

    logger.info(
        f"Successfully embedded {len(texts)} texts, produced {len(embeddings)} embeddings"
    )
    return embeddings

`slug()` ¶

Source code in kura/embedding.py

def slug(self):
    return f"openai:{self.model_name}-batchsize:{self._model_batch_size}-concurrent:{self._n_concurrent_jobs}"

`SentenceTransformerEmbeddingModel` ¶

Bases: BaseEmbeddingModel

Source code in kura/embedding.py

class SentenceTransformerEmbeddingModel(BaseEmbeddingModel):
    def __init__(
        self,
        model_name: str = "all-MiniLM-L6-v2",
        model_batch_size: int = 128,
        device: str = "cpu",
    ):
        from sentence_transformers import SentenceTransformer  # type: ignore

        logger.info(
            f"Initializing SentenceTransformerEmbeddingModel with model={model_name}, batch_size={model_batch_size}"
        )
        try:
            self.model = SentenceTransformer(model_name, device=device)
            self.model_name = model_name
            self._model_batch_size = model_batch_size
            logger.info(f"Successfully loaded SentenceTransformer model: {model_name}")
        except Exception as e:
            logger.error(f"Failed to load SentenceTransformer model {model_name}: {e}")
            raise

    def slug(self) -> str:
        return f"sentence-transformers:{self.model_name}-batchsize:{self._model_batch_size}"

    @retry(wait=wait_fixed(3), stop=stop_after_attempt(3))
    async def embed(self, texts: list[str]) -> list[list[float]]:
        if not texts:
            logger.debug("Empty text list provided, returning empty embeddings")
            return []

        logger.info(
            f"Starting embedding of {len(texts)} texts using SentenceTransformer"
        )

        # Create batches
        batches = batch_texts(texts, self._model_batch_size)
        logger.debug(
            f"Split {len(texts)} texts into {len(batches)} batches of size {self._model_batch_size}"
        )

        # Process all batches
        embeddings = []
        try:
            for i, batch in enumerate(batches):
                logger.debug(
                    f"Processing batch {i + 1}/{len(batches)} with {len(batch)} texts"
                )
                batch_embeddings = self.model.encode(batch).tolist()
                embeddings.extend(batch_embeddings)
                logger.debug(f"Completed batch {i + 1}/{len(batches)}")

            logger.info(
                f"Successfully embedded {len(texts)} texts using SentenceTransformer, produced {len(embeddings)} embeddings"
            )
        except Exception as e:
            logger.error(f"Failed to embed texts using SentenceTransformer: {e}")
            raise

        return embeddings

`model = SentenceTransformer(model_name, device=device)` `instance-attribute` ¶

`model_name = model_name` `instance-attribute` ¶

`init(model_name: str = 'all-MiniLM-L6-v2', model_batch_size: int = 128, device: str = 'cpu')` ¶

Source code in kura/embedding.py

def __init__(
    self,
    model_name: str = "all-MiniLM-L6-v2",
    model_batch_size: int = 128,
    device: str = "cpu",
):
    from sentence_transformers import SentenceTransformer  # type: ignore

    logger.info(
        f"Initializing SentenceTransformerEmbeddingModel with model={model_name}, batch_size={model_batch_size}"
    )
    try:
        self.model = SentenceTransformer(model_name, device=device)
        self.model_name = model_name
        self._model_batch_size = model_batch_size
        logger.info(f"Successfully loaded SentenceTransformer model: {model_name}")
    except Exception as e:
        logger.error(f"Failed to load SentenceTransformer model {model_name}: {e}")
        raise

`embed(texts: list[str]) -> list[list[float]]` `async` ¶

Source code in kura/embedding.py

@retry(wait=wait_fixed(3), stop=stop_after_attempt(3))
async def embed(self, texts: list[str]) -> list[list[float]]:
    if not texts:
        logger.debug("Empty text list provided, returning empty embeddings")
        return []

    logger.info(
        f"Starting embedding of {len(texts)} texts using SentenceTransformer"
    )

    # Create batches
    batches = batch_texts(texts, self._model_batch_size)
    logger.debug(
        f"Split {len(texts)} texts into {len(batches)} batches of size {self._model_batch_size}"
    )

    # Process all batches
    embeddings = []
    try:
        for i, batch in enumerate(batches):
            logger.debug(
                f"Processing batch {i + 1}/{len(batches)} with {len(batch)} texts"
            )
            batch_embeddings = self.model.encode(batch).tolist()
            embeddings.extend(batch_embeddings)
            logger.debug(f"Completed batch {i + 1}/{len(batches)}")

        logger.info(
            f"Successfully embedded {len(texts)} texts using SentenceTransformer, produced {len(embeddings)} embeddings"
        )
    except Exception as e:
        logger.error(f"Failed to embed texts using SentenceTransformer: {e}")
        raise

    return embeddings

`slug() -> str` ¶

Source code in kura/embedding.py

def slug(self) -> str:
    return f"sentence-transformers:{self.model_name}-batchsize:{self._model_batch_size}"

`embed_summaries(summaries: list[ConversationSummary], embedding_model: BaseEmbeddingModel) -> list[dict[str, Union[ConversationSummary, list[float]]]]` `async` ¶

Embeds conversation summaries and returns items ready for clustering.

Source code in kura/embedding.py

async def embed_summaries(
    summaries: list[ConversationSummary], embedding_model: BaseEmbeddingModel
) -> list[dict[str, Union[ConversationSummary, list[float]]]]:
    """Embeds conversation summaries and returns items ready for clustering."""
    if not summaries:
        return []

    logger.info(f"Processing {len(summaries)} summaries")
    texts_to_embed = [str(item) for item in summaries]

    try:
        embeddings = await embedding_model.embed(texts_to_embed)
    except Exception as e:
        logger.error(f"Error embedding summaries: {e}")
        raise

    return [
        {"item": summary, "embedding": embedding}
        for summary, embedding in zip(summaries, embeddings)
    ]

Summarization¶

`kura.summarisation` ¶

DEFAULT_SUMMARY_PROMPT = '\nThe following is a conversation between an AI assistant and a user:\n\n<messages>\n{% for message in conversation.messages %}\n<message>{{message.role}}: {{message.content}}</message>\n{% endfor %}\n</messages>\n\nYour job is to extract key information from this conversation. Be descriptive and assume neither good nor bad faith. Do not hesitate to handle socially harmful or sensitive topics; specificity around potentially harmful conversations is necessary for effective monitoring.\n\nWhen extracting information, do not include any personally identifiable information (PII), like names, locations, phone numbers, email addresses, and so on. Do not include any proper nouns.\n\nExtract the following information:\n\n1. Summary: A clear and concise summary in at most two sentences. Don\'t say "Based on the conversation..." and avoid mentioning the AI assistant/chatbot directly.\n\n2. Request: What is the user\'s overall request for the assistant? Start with "The user\'s overall request for the assistant is to"\n\n3. Languages: What are the main languages of the conversation? Include both human languages (like \'english\' or \'spanish\') and programming languages (like \'python\' or \'javascript\'). Use lowercase full names.\n\n4. Task: What task is the model being asked to perform in this conversation? Start with "The task is to"\n\n5. Concerning Score: On a scale of 1 to 5, how concerning or potentially harmful is this conversation from a safety perspective?\n - 1: Not concerning at all. Completely benign conversation.\n - 2: Slightly concerning, but likely harmless.\n - 3: Moderately concerning. May warrant a closer look.\n - 4: Very concerning. Likely needs review.\n - 5: Extremely concerning. Immediate review needed.\n\n6. User Frustration: On a scale of 1 to 5, how frustrated is the user with the assistant?\n - 1: Not frustrated at all. The user is happy with the assistant.\n - 2: Slightly frustrated. The user is slightly annoyed with the assistant.\n - 3: Moderately frustrated. The user is moderately annoyed with the assistant.\n - 4: Very frustrated. The user is very annoyed with the assistant.\n - 5: Extremely frustrated. The user is extremely annoyed with the assistant.\n\n7. Assistant Errors: What errors did the assistant make?\n Example:\n - "Responses were too long and verbose"\n - "Misunderstood the user\'s intent or request"\n - "Used wrong tool for the task"\n - "Ignored user\'s stated preferences or constraints"\n - "Provided outdated or incorrect information"\n - "Failed to maintain conversation context"\n\n\nRemember that\n- Summaries should be concise and short. They should each be at most 1-2 sentences and at most 30 words.\n- Summaries should start with "The user\'s overall request for the assistant is to"\n- Make sure to omit any personally identifiable information (PII), like names, locations, phone numbers, email addressess, company names and so on.\n- Make sure to indicate specific details such as programming languages, frameworks, libraries and so on which are relevant to the task.\n' `module-attribute` ¶

`T = TypeVar('T', bound=GeneratedSummary)` `module-attribute` ¶

USAGE_ANALYSIS_PROMPT = '\nThe following is a conversation between an AI assistant and a user:\n\n<messages>\n{% for message in conversation.messages %}\n<message>{{message.role}}: {{message.content}}</message>\n{% endfor %}\n</messages>\n\nYour job is to analyze this conversation to understand system usage patterns and failure modes. Focus on HOW the system is being used, WHAT users are trying to accomplish, and WHERE the system succeeds or fails to meet user needs.\n\nWhen extracting information, do not include any personally identifiable information (PII), like names, locations, phone numbers, email addresses, and so on. Do not include any proper nouns.\n\nExtract the following information:\n\n1. Summary: Describe the user\'s interaction pattern and what they were trying to accomplish. Focus on the type of usage, complexity of the task, and whether the system met their needs effectively.\n\n2. Request: Characterize the user\'s core need and how they approached getting help. Consider: Was this exploratory, problem-solving, creative, instructional, or transactional?\n\n3. Languages: What are the main languages of the conversation? Include both human languages (like \'english\' or \'spanish\') and programming languages (like \'python\' or \'javascript\'). Use lowercase full names.\n\n4. Task: Categorize the task complexity and domain. Consider: Is this a beginner question, expert consultation, creative collaboration, debugging session, or knowledge lookup?\n\n5. Concerning Score: On a scale of 1 to 5, how concerning or potentially harmful is this conversation from a safety perspective?\n - 1: Not concerning at all. Completely benign conversation.\n - 2: Slightly concerning, but likely harmless.\n - 3: Moderately concerning. May warrant a closer look.\n - 4: Very concerning. Likely needs review.\n - 5: Extremely concerning. Immediate review needed.\n\n6. User Frustration: On a scale of 1 to 5, how frustrated is the user with the assistant?\n - 1: Not frustrated at all. The user is happy with the assistant.\n - 2: Slightly frustrated. The user is slightly annoyed with the assistant.\n - 3: Moderately frustrated. The user is moderately annoyed with the assistant.\n - 4: Very frustrated. The user is very annoyed with the assistant.\n - 5: Extremely frustrated. The user is extremely annoyed with the assistant.\n\n7. Assistant Errors: What specific ways did the assistant fail to serve the user effectively?\n Focus on systemic issues:\n - "Provided generic responses when specific domain expertise was needed"\n - "Failed to understand the user\'s context or skill level"\n - "Gave technically incorrect or outdated information" \n - "Didn\'t follow the user\'s explicit instructions or constraints"\n - "Responses were too complex/simple for the user\'s expertise level"\n - "Misunderstood the core problem the user was trying to solve"\n - "Failed to provide actionable next steps or concrete guidance"\n\nRemember that:\n- Focus on usage patterns that would help improve the system\n- Identify common failure modes and success patterns\n- Characterize user needs and system performance gaps\n- Group similar usage patterns and failure types for system insights\n- Consider the user\'s expertise level and how well the system adapted to it\n' `module-attribute` ¶

`logger = logging.getLogger(name)` `module-attribute` ¶

`SummaryModel` ¶

Bases: BaseSummaryModel

Instructor-based summary model for conversation analysis.

Example - Custom Schema

class CustomSummary(GeneratedSummary): ... sentiment: str ... complexity: int

summaries = await model.summarise( ... conversations, ... response_schema=CustomSummary ... )

sentiment & complexity will be in summaries[0].metadata¶

Example - Custom Prompt

summaries = await model.summarise( ... conversations, ... prompt="Also assess the technical complexity on a scale of 1-10." ... )

Source code in kura/summarisation.py

class SummaryModel(BaseSummaryModel):
    """
    Instructor-based summary model for conversation analysis.

    Example - Custom Schema:
        >>> class CustomSummary(GeneratedSummary):
        ...     sentiment: str
        ...     complexity: int
        >>>
        >>> summaries = await model.summarise(
        ...     conversations,
        ...     response_schema=CustomSummary
        ... )
        # sentiment & complexity will be in summaries[0].metadata

    Example - Custom Prompt:
        >>> summaries = await model.summarise(
        ...     conversations,
        ...     prompt="Also assess the technical complexity on a scale of 1-10."
        ... )
    """

    def __init__(
        self,
        model: Union[str, "KnownModelName"] = "openai/gpt-4o-mini",
        max_concurrent_requests: int = 50,
        checkpoint_filename: str = "summaries",
        console: Optional[Console] = None,
        cache: Optional[CacheStrategy] = None,
    ):
        """
        Initialize SummaryModel with core configuration.

        Per-use configuration (schemas, prompts, temperature) are method parameters.

        Args:
            model: model identifier (e.g., "openai/gpt-4o-mini")
            max_concurrent_requests: Maximum concurrent API requests
            cache: Caching strategy to use (optional)
        """
        self.model = model
        self.max_concurrent_requests = max_concurrent_requests
        self._checkpoint_filename = checkpoint_filename
        self.console = console

        # Initialize cache
        self.cache = cache

        cache_info = type(self.cache).__name__ if self.cache else "None"
        logger.info(
            f"Initialized SummaryModel with model={model}, max_concurrent_requests={max_concurrent_requests}, cache={cache_info}"
        )

    @property
    def checkpoint_filename(self) -> str:
        """Return the filename to use for checkpointing this model's output."""
        return self._checkpoint_filename

    def _get_cache_key(
        self,
        conversation: Conversation,
        response_schema: Type[T],
        prompt: str,
        temperature: float,
        **kwargs,
    ) -> str:
        """Generate a cache key from conversation messages and parameters."""
        # Create role-content pairs for each message
        message_data = [(msg.role, msg.content) for msg in conversation.messages]

        # Include all parameters that affect the output
        cache_components = (
            tuple(message_data),
            response_schema.__name__,
            hashlib.md5(prompt.encode()).hexdigest(),
            temperature,
            self.model,
        )

        return hashlib.md5(str(cache_components).encode()).hexdigest()

    async def summarise(
        self,
        conversations: list[Conversation],
        prompt: str = DEFAULT_SUMMARY_PROMPT,
        *,
        response_schema: Type[T] = GeneratedSummary,
        temperature: float = 0.2,
        **kwargs,
    ) -> list[ConversationSummary]:
        """
        Summarise conversations with configurable parameters.

        This method uses the CLIO conversation analysis framework, with automatic
        extensibility for custom fields and prompt modifications.

        Args:
            conversations: List of conversations to summarize
            response_schema: Pydantic model class for structured LLM output.
                           Extend GeneratedSummary to add custom fields that will
                           automatically be included in ConversationSummary.metadata
            prompt: Custom prompt for CLIO analysis
            temperature: LLM temperature for generation

        Returns:
            List of ConversationSummary objects with core fields populated and
            any additional fields from extended schemas in metadata

        Example:
            >>> class CustomSummary(GeneratedSummary):
            ...     sentiment: str
            ...     technical_complexity: int
            >>>
            >>> summaries = await model.summarise(
            ...     conversations,
            ...     response_schema=CustomSummary,
            ...     prompt="Rate sentiment and technical complexity 1-10"
            ... )
            >>> # Access core fields
            >>> print(summaries[0].summary)
            >>> # Access custom fields in metadata
            >>> print(summaries[0].metadata["sentiment"])
        """
        # Initialize semaphore per-run to match event loop
        self.semaphore = asyncio.Semaphore(self.max_concurrent_requests)

        logger.info(
            f"Starting summarization of {len(conversations)} conversations using model {self.model}"
        )

        import instructor

        client = instructor.from_provider(self.model, async_client=True)

        if not self.console:
            # Simple progress tracking with tqdm
            summaries = await tqdm_asyncio.gather(
                *[
                    self._summarise_single_conversation(
                        conversation,
                        client=client,
                        response_schema=response_schema,
                        prompt=prompt,
                        temperature=temperature,
                        **kwargs,
                    )
                    for conversation in conversations
                ],
                desc=f"Summarising {len(conversations)} conversations",
            )
        else:
            # Rich console progress tracking with live summary display
            summaries = await self._summarise_with_console(
                conversations,
                client=client,
                response_schema=response_schema,
                prompt=prompt,
                temperature=temperature,
                **kwargs,
            )

        logger.info(
            f"Completed summarization of {len(conversations)} conversations, produced {len(summaries)} summaries"
        )
        return summaries

    async def _summarise_single_conversation(
        self,
        conversation: Conversation,
        *,
        client,
        response_schema: Type[T],
        prompt: str,
        temperature: float,
        **kwargs,
    ) -> ConversationSummary:
        """
        Private method to summarise a single conversation.

        Automatically maps all fields from the response_schema to ConversationSummary:
        - Known GeneratedSummary fields are mapped directly to ConversationSummary fields
        - Additional fields from extended schemas are placed in metadata for extensibility
        """
        logger.debug(
            f"Starting summarization of conversation {conversation.chat_id} with {len(conversation.messages)} messages"
        )

        # Check cache first
        if self.cache:
            cache_key = self._get_cache_key(
                conversation, response_schema, prompt, temperature, **kwargs
            )
            cached_result = self.cache.get(cache_key)
            if cached_result is not None:
                logger.debug(
                    f"Found cached summary for conversation {conversation.chat_id}"
                )
                return cached_result

        async with self.semaphore:  # type: ignore
            try:
                resp = await client.chat.completions.create(  # type: ignore
                    temperature=temperature,
                    messages=[
                        {
                            "role": "user",
                            "content": prompt,
                        },
                    ],
                    context={
                        "conversation": conversation,
                    },
                    response_model=response_schema,
                    **kwargs,
                )
                logger.debug(
                    f"Successfully generated summary for conversation {conversation.chat_id}"
                )
            except Exception as e:
                logger.error(
                    f"Failed to generate summary for conversation {conversation.chat_id}: {e}"
                )
                raise

        logger.debug(
            f"Completed summarization of conversation {conversation.chat_id} - concerning_score: {getattr(resp, 'concerning_score', None)}, user_frustration: {getattr(resp, 'user_frustration', None)}"
        )

        # Extract response data
        response_dict = resp.model_dump()

        # Known GeneratedSummary fields that map directly to ConversationSummary
        known_fields = {
            "summary",
            "request",
            "topic",
            "languages",
            "task",
            "concerning_score",
            "user_frustration",
            "assistant_errors",
        }

        # Extract known fields for direct mapping
        known_data = {k: v for k, v in response_dict.items() if k in known_fields}

        # Put unknown fields in metadata (for extended GeneratedSummary subclasses)
        extra_fields = {k: v for k, v in response_dict.items() if k not in known_fields}

        result = ConversationSummary(
            chat_id=conversation.chat_id,
            metadata={
                "conversation_turns": len(conversation.messages),
                **conversation.metadata,
                **extra_fields,  # Additional fields from extended schemas
            },
            **known_data,
        )

        # Cache the result
        if self.cache:
            self.cache.set(cache_key, result)
            logger.debug(f"Cached summary for conversation {conversation.chat_id}")

        return result

    async def _summarise_with_console(
        self,
        conversations: list[Conversation],
        *,
        client,
        response_schema: Type[T],
        prompt: str,
        temperature: float,
        **kwargs,
    ) -> list[ConversationSummary]:
        """
        Summarise conversations with full-screen Rich console display showing progress and latest 3 results.

        Returns ConversationSummary objects with automatic field mapping from response_schema.
        """
        from rich.live import Live
        from rich.layout import Layout
        from rich.panel import Panel
        from rich.text import Text
        from rich.progress import (
            Progress,
            SpinnerColumn,
            TextColumn,
            TaskProgressColumn,
            TimeRemainingColumn,
        )

        summaries = []
        completed_summaries = []
        max_preview_items = 3

        # Create full-screen layout
        layout = Layout()
        layout.split_column(Layout(name="progress", size=3), Layout(name="preview"))

        # Create progress bar
        progress = Progress(
            SpinnerColumn(),
            TextColumn("[progress.description]{task.description}"),
            TaskProgressColumn(),
            TimeRemainingColumn(),
            console=self.console,
        )
        task_id = progress.add_task("", total=len(conversations))
        layout["progress"].update(progress)

        def update_preview_display():
            if completed_summaries:
                preview_text = Text()

                for summary in completed_summaries[
                    -max_preview_items:
                ]:  # Show latest 3
                    preview_text.append(
                        f"summary: {summary.summary or 'No summary'}\n", style="white"
                    )
                    concern = summary.concerning_score or 0
                    frustration = summary.user_frustration or 0
                    preview_text.append(
                        f"Concern: {concern}/5, Frustration: {frustration}/5\n\n",
                        style="yellow",
                    )

                layout["preview"].update(
                    Panel(
                        preview_text,
                        title=f"[green]Generated Summaries ({len(completed_summaries)}/{len(conversations)})",
                        border_style="green",
                    )
                )
            else:
                layout["preview"].update(
                    Panel(
                        Text("Waiting for summaries...", style="dim"),
                        title="[yellow]Generated Summaries (0/0)",
                        border_style="yellow",
                    )
                )

        # Initialize preview display
        update_preview_display()

        with Live(layout, console=self.console, refresh_per_second=4):
            # Process conversations concurrently
            tasks = []
            for conversation in conversations:
                coro = self._summarise_single_conversation(
                    conversation,
                    client=client,
                    response_schema=response_schema,
                    prompt=prompt,
                    temperature=temperature,
                    **kwargs,
                )
                tasks.append(coro)

            # Use asyncio.as_completed to show results as they finish
            for i, coro in enumerate(asyncio.as_completed(tasks)):
                try:
                    summary = await coro
                    summaries.append(summary)
                    completed_summaries.append(summary)

                    # Update progress
                    progress.update(task_id, completed=i + 1)

                    # Update preview display
                    update_preview_display()

                except Exception as e:
                    logger.error(f"Failed to summarise conversation: {e}")
                    # Still update progress on error
                    progress.update(task_id, completed=i + 1)
                    update_preview_display()

        return summaries

`cache = cache` `instance-attribute` ¶

`checkpoint_filename: str` `property` ¶

Return the filename to use for checkpointing this model's output.

`console = console` `instance-attribute` ¶

`max_concurrent_requests = max_concurrent_requests` `instance-attribute` ¶

`model = model` `instance-attribute` ¶

`init(model: Union[str, KnownModelName] = 'openai/gpt-4o-mini', max_concurrent_requests: int = 50, checkpoint_filename: str = 'summaries', console: Optional[Console] = None, cache: Optional[CacheStrategy] = None)` ¶

Initialize SummaryModel with core configuration.

Per-use configuration (schemas, prompts, temperature) are method parameters.

Parameters:

Name	Type	Description	Default
`model`	`Union[str, KnownModelName]`	model identifier (e.g., "openai/gpt-4o-mini")	`'openai/gpt-4o-mini'`
`max_concurrent_requests`	`int`	Maximum concurrent API requests	`50`
`cache`	`Optional[CacheStrategy]`	Caching strategy to use (optional)	`None`

Source code in kura/summarisation.py

def __init__(
    self,
    model: Union[str, "KnownModelName"] = "openai/gpt-4o-mini",
    max_concurrent_requests: int = 50,
    checkpoint_filename: str = "summaries",
    console: Optional[Console] = None,
    cache: Optional[CacheStrategy] = None,
):
    """
    Initialize SummaryModel with core configuration.

    Per-use configuration (schemas, prompts, temperature) are method parameters.

    Args:
        model: model identifier (e.g., "openai/gpt-4o-mini")
        max_concurrent_requests: Maximum concurrent API requests
        cache: Caching strategy to use (optional)
    """
    self.model = model
    self.max_concurrent_requests = max_concurrent_requests
    self._checkpoint_filename = checkpoint_filename
    self.console = console

    # Initialize cache
    self.cache = cache

    cache_info = type(self.cache).__name__ if self.cache else "None"
    logger.info(
        f"Initialized SummaryModel with model={model}, max_concurrent_requests={max_concurrent_requests}, cache={cache_info}"
    )

`summarise(conversations: list[Conversation], prompt: str = DEFAULT_SUMMARY_PROMPT, *, response_schema: Type[T] = GeneratedSummary, temperature: float = 0.2, **kwargs) -> list[ConversationSummary]` `async` ¶

Summarise conversations with configurable parameters.

This method uses the CLIO conversation analysis framework, with automatic extensibility for custom fields and prompt modifications.

Parameters:

Name	Type	Description	Default
`conversations`	`list[Conversation]`	List of conversations to summarize	required
`response_schema`	`Type[T]`	Pydantic model class for structured LLM output. Extend GeneratedSummary to add custom fields that will automatically be included in ConversationSummary.metadata	`GeneratedSummary`
`prompt`	`str`	Custom prompt for CLIO analysis	`DEFAULT_SUMMARY_PROMPT`
`temperature`	`float`	LLM temperature for generation	`0.2`

Returns:

Type	Description
`list[ConversationSummary]`	List of ConversationSummary objects with core fields populated and
`list[ConversationSummary]`	any additional fields from extended schemas in metadata

Example

class CustomSummary(GeneratedSummary): ... sentiment: str ... technical_complexity: int

summaries = await model.summarise( ... conversations, ... response_schema=CustomSummary, ... prompt="Rate sentiment and technical complexity 1-10" ... )

Access core fields¶

print(summaries[0].summary)

Access custom fields in metadata¶

print(summaries[0].metadata["sentiment"])

Source code in kura/summarisation.py

async def summarise(
    self,
    conversations: list[Conversation],
    prompt: str = DEFAULT_SUMMARY_PROMPT,
    *,
    response_schema: Type[T] = GeneratedSummary,
    temperature: float = 0.2,
    **kwargs,
) -> list[ConversationSummary]:
    """
    Summarise conversations with configurable parameters.

    This method uses the CLIO conversation analysis framework, with automatic
    extensibility for custom fields and prompt modifications.

    Args:
        conversations: List of conversations to summarize
        response_schema: Pydantic model class for structured LLM output.
                       Extend GeneratedSummary to add custom fields that will
                       automatically be included in ConversationSummary.metadata
        prompt: Custom prompt for CLIO analysis
        temperature: LLM temperature for generation

    Returns:
        List of ConversationSummary objects with core fields populated and
        any additional fields from extended schemas in metadata

    Example:
        >>> class CustomSummary(GeneratedSummary):
        ...     sentiment: str
        ...     technical_complexity: int
        >>>
        >>> summaries = await model.summarise(
        ...     conversations,
        ...     response_schema=CustomSummary,
        ...     prompt="Rate sentiment and technical complexity 1-10"
        ... )
        >>> # Access core fields
        >>> print(summaries[0].summary)
        >>> # Access custom fields in metadata
        >>> print(summaries[0].metadata["sentiment"])
    """
    # Initialize semaphore per-run to match event loop
    self.semaphore = asyncio.Semaphore(self.max_concurrent_requests)

    logger.info(
        f"Starting summarization of {len(conversations)} conversations using model {self.model}"
    )

    import instructor

    client = instructor.from_provider(self.model, async_client=True)

    if not self.console:
        # Simple progress tracking with tqdm
        summaries = await tqdm_asyncio.gather(
            *[
                self._summarise_single_conversation(
                    conversation,
                    client=client,
                    response_schema=response_schema,
                    prompt=prompt,
                    temperature=temperature,
                    **kwargs,
                )
                for conversation in conversations
            ],
            desc=f"Summarising {len(conversations)} conversations",
        )
    else:
        # Rich console progress tracking with live summary display
        summaries = await self._summarise_with_console(
            conversations,
            client=client,
            response_schema=response_schema,
            prompt=prompt,
            temperature=temperature,
            **kwargs,
        )

    logger.info(
        f"Completed summarization of {len(conversations)} conversations, produced {len(summaries)} summaries"
    )
    return summaries

`summarise_conversations(conversations: list[Conversation], *, model: BaseSummaryModel, response_schema: Type[T] = GeneratedSummary, prompt: str = DEFAULT_SUMMARY_PROMPT, temperature: float = 0.2, checkpoint_manager: Optional[BaseCheckpointManager] = None, **kwargs) -> list[ConversationSummary]` `async` ¶

Generate summaries for a list of conversations using the CLIO framework.

This is a pure function that takes conversations and a summary model, and returns conversation summaries with automatic extensibility. Optionally uses checkpointing for efficient re-runs.

The function works with any model that implements BaseSummaryModel, supporting heterogeneous backends (OpenAI, vLLM, Hugging Face, etc.) through polymorphism.

Extensibility Features: - Custom Fields: Extend GeneratedSummary to add custom analysis fields - Prompt Modification: Use prompt to modify CLIO analysis - Automatic Mapping: Extended fields are automatically placed in metadata

Parameters:

Name	Type	Description	Default
`conversations`	`list[Conversation]`	List of conversations to summarize	required
`model`	`BaseSummaryModel`	Model to use for summarization (OpenAI, vLLM, local, etc.)	required
`response_schema`	`Type[T]`	Pydantic model class for LLM output. Extend GeneratedSummary to add custom fields that will appear in metadata	`GeneratedSummary`
`prompt`	`str`	Custom prompt to modify the CLIO analysis	`DEFAULT_SUMMARY_PROMPT`
`temperature`	`float`	LLM temperature for generation	`0.2`
`checkpoint_manager`	`Optional[BaseCheckpointManager]`	Optional checkpoint manager for caching	`None`

Returns:

Type	Description
`list[ConversationSummary]`	List of ConversationSummary objects with core CLIO fields and any
`list[ConversationSummary]`	additional fields from extended schemas in metadata

Example - Basic Usage

model = SummaryModel() summaries = await summarise_conversations( ... conversations=my_conversations, ... model=model ... )

Example - Custom Analysis

class DetailedSummary(GeneratedSummary): ... sentiment: str ... technical_depth: int

summaries = await summarise_conversations( ... conversations=my_conversations, ... model=model, ... response_schema=DetailedSummary, ... prompt="Analyze sentiment and rate technical depth 1-10" ... )

Custom fields available in metadata¶

print(summaries[0].metadata["sentiment"])

Source code in kura/summarisation.py

async def summarise_conversations(
    conversations: list[Conversation],
    *,
    model: BaseSummaryModel,
    response_schema: Type[T] = GeneratedSummary,
    prompt: str = DEFAULT_SUMMARY_PROMPT,
    temperature: float = 0.2,
    checkpoint_manager: Optional[BaseCheckpointManager] = None,
    **kwargs,
) -> list[ConversationSummary]:
    """Generate summaries for a list of conversations using the CLIO framework.

    This is a pure function that takes conversations and a summary model,
    and returns conversation summaries with automatic extensibility.
    Optionally uses checkpointing for efficient re-runs.

    The function works with any model that implements BaseSummaryModel,
    supporting heterogeneous backends (OpenAI, vLLM, Hugging Face, etc.)
    through polymorphism.

    Extensibility Features:
    - **Custom Fields**: Extend GeneratedSummary to add custom analysis fields
    - **Prompt Modification**: Use prompt to modify CLIO analysis
    - **Automatic Mapping**: Extended fields are automatically placed in metadata

    Args:
        conversations: List of conversations to summarize
        model: Model to use for summarization (OpenAI, vLLM, local, etc.)
        response_schema: Pydantic model class for LLM output. Extend GeneratedSummary
                        to add custom fields that will appear in metadata
        prompt: Custom prompt to modify the CLIO analysis
        temperature: LLM temperature for generation
        checkpoint_manager: Optional checkpoint manager for caching

    Returns:
        List of ConversationSummary objects with core CLIO fields and any
        additional fields from extended schemas in metadata

    Example - Basic Usage:
        >>> model = SummaryModel()
        >>> summaries = await summarise_conversations(
        ...     conversations=my_conversations,
        ...     model=model
        ... )

    Example - Custom Analysis:
        >>> class DetailedSummary(GeneratedSummary):
        ...     sentiment: str
        ...     technical_depth: int
        >>>
        >>> summaries = await summarise_conversations(
        ...     conversations=my_conversations,
        ...     model=model,
        ...     response_schema=DetailedSummary,
        ...     prompt="Analyze sentiment and rate technical depth 1-10"
        ... )
        >>> # Custom fields available in metadata
        >>> print(summaries[0].metadata["sentiment"])
    """
    logger.info(
        f"Starting summarization of {len(conversations)} conversations using {type(model).__name__}"
    )

    # Try to load from checkpoint
    if checkpoint_manager:
        cached = checkpoint_manager.load_checkpoint(
            model.checkpoint_filename, ConversationSummary
        )
        if cached:
            logger.info(f"Loaded {len(cached)} summaries from checkpoint")
            return cached

    # Generate raw summaries
    logger.info("Generating new summaries...")
    raw_summaries = await model.summarise(
        conversations,
        response_schema=response_schema,
        temperature=temperature,
        prompt=prompt,
        **kwargs,
    )
    logger.info(f"Generated {len(raw_summaries)} raw summaries")

    # Summaries are already ConversationSummary objects from _summarise_single_conversation
    summaries = raw_summaries
    logger.info(f"Generated {len(summaries)} conversation summaries")

    # Save to checkpoint
    if checkpoint_manager:
        logger.info(f"Saving summaries to checkpoint: {model.checkpoint_filename}")
        checkpoint_manager.save_checkpoint(model.checkpoint_filename, summaries)

    return summaries

Clustering¶

`kura.cluster` ¶

DEFAULT_CLUSTER_PROMPT = '\nYou are tasked with summarizing a group of related statements into a short, precise, and accurate description and name. Your goal is to create a concise summary that captures the essence of these statements and distinguishes them from other similar groups of statements.\n\nSummarize all the statements into a clear, precise, two-sentence description in the past tense. Your summary should be specific to this group and distinguish it from the contrastive answers of the other groups.\n\nAfter creating the summary, generate a short name for the group of statements. This name should be at most ten words long (perhaps less) and be specific but also reflective of most of the statements (rather than reflecting only one or two).\n\nThe name should distinguish this group from the contrastive examples. For instance, "Write fantasy sexual roleplay with octopi and monsters", "Generate blog spam for gambling websites", or "Assist with high school math homework" would be better and more actionable than general terms like "Write erotic content" or "Help with homework". Be as descriptive as possible and assume neither good nor bad faith. Do not hesitate to identify and describe socially harmful or sensitive topics specifically; specificity is necessary for monitoring.\n\nPresent your output in the following format:\n<summary> [Insert your two-sentence summary here] </summary>\n<name> [Insert your generated short name here] </name>\n\nThe names you propose must follow these requirements:\n- The cluster name should be a sentence in the imperative that captures the user\'s request. For example, \'Brainstorm ideas for a birthday party\' or \'Help me find a new job.\'\n- Create names that are specific enough to be meaningful, but not so specific that they can\'t meaningfully represent many different statements.\n- Avoid overly general or vague terms, and do not hesitate to describe socially harmful or sensitive topics (in fact, clusters that clearly describe harmful behavior are slightly preferred); specificity is necessary for observability and enforcement.\n- Ensure that the cluster name is distinct from the contrastive examples.\n- Use clear, concise, and descriptive language for the cluster name.\n\nBelow are the related statements:\n<positive_examples>\n{% for item in positive_examples %}{{ item }}\n{% endfor %}\n</positive_examples>\n\nFor context, here are statements from nearby groups that are NOT part of the group you\'re summarizing:\n<contrastive_examples>\n{% for item in contrastive_examples %}{{ item }}\n{% endfor %}\n</contrastive_examples>\n\nDo not elaborate beyond what you say in the tags. Remember to analyze both the statements and the contrastive statements carefully to ensure your summary and name accurately represent the specific group while distinguishing it from others. The cluster name should be a sentence in the imperative that captures the user\'s request. For example, \'Brainstorm ideas for a birthday party\' or \'Help me find a new job.\'\n' `module-attribute` ¶

`logger = logging.getLogger(name)` `module-attribute` ¶

`ClusterDescriptionModel` ¶

Bases: BaseClusterDescriptionModel

Model for generating cluster descriptions using LLMs.

Similar to SummaryModel, this handles the LLM interaction for generating cluster names and descriptions with configurable parameters.

Source code in kura/cluster.py

class ClusterDescriptionModel(BaseClusterDescriptionModel):
    """
    Model for generating cluster descriptions using LLMs.

    Similar to SummaryModel, this handles the LLM interaction for generating
    cluster names and descriptions with configurable parameters.
    """

    def __init__(
        self,
        model: Union[str, "KnownModelName"] = "openai/gpt-4o-mini",
        max_concurrent_requests: int = 50,
        temperature: float = 0.2,
        checkpoint_filename: str = "clusters",
        console: Optional[Console] = None,
    ):
        """
        Initialize ClusterModel with core configuration.

        Args:
            model: model identifier (e.g., "openai/gpt-4o-mini")
            max_concurrent_requests: Maximum concurrent API requests
            temperature: LLM temperature for generation
            checkpoint_filename: Filename for checkpointing
            console: Rich console for progress tracking
        """
        self.model = model
        self.max_concurrent_requests = max_concurrent_requests
        self.temperature = temperature
        self._checkpoint_filename = checkpoint_filename
        self.console = console

        logger.info(
            f"Initialized ClusterModel with model={model}, max_concurrent_requests={max_concurrent_requests}, temperature={temperature}"
        )

    @property
    def checkpoint_filename(self) -> str:
        """Return the filename to use for checkpointing this model's output."""
        return self._checkpoint_filename

    async def generate_clusters(
        self,
        cluster_id_to_summaries: Dict[int, List[ConversationSummary]],
        prompt: str = DEFAULT_CLUSTER_PROMPT,
        max_contrastive_examples: int = 10,
    ) -> List[Cluster]:
        """Generate clusters from a mapping of cluster IDs to summaries."""
        import instructor

        self.sem = Semaphore(self.max_concurrent_requests)
        self.client = instructor.from_provider(self.model, async_client=True)

        if not self.console:
            # Simple processing without rich display
            return await asyncio.gather(
                *[
                    self.generate_cluster_description(
                        summaries,
                        get_contrastive_examples(
                            cluster_id,
                            cluster_id_to_summaries=cluster_id_to_summaries,
                            max_contrastive_examples=max_contrastive_examples,
                        ),
                        self.sem,
                        self.client,
                        prompt,
                    )
                    for cluster_id, summaries in cluster_id_to_summaries.items()
                ]
            )

        return await self._generate_clusters_with_console(
            cluster_id_to_summaries,
            max_contrastive_examples,
            prompt,
        )

    async def generate_cluster_description(
        self,
        summaries: List[ConversationSummary],
        contrastive_examples: List[ConversationSummary],
        semaphore: Semaphore,
        client: "AsyncInstructor",
        prompt: str = DEFAULT_CLUSTER_PROMPT,
    ) -> Cluster:
        """
        Generate a cluster description from summaries with contrastive examples.

        Args:
            summaries: Summaries in this cluster
            contrastive_examples: Examples from other clusters for contrast

        Returns:
            Cluster with generated name and description
        """
        logger.debug(
            f"Generating cluster from {len(summaries)} summaries with {len(contrastive_examples)} contrastive examples"
        )
        async with semaphore:
            try:
                resp = await client.chat.completions.create(
                    messages=[
                        {
                            "role": "system",
                            "content": prompt,
                        },
                    ],
                    response_model=GeneratedCluster,
                    temperature=self.temperature,
                    context={
                        "positive_examples": summaries,
                        "contrastive_examples": contrastive_examples,
                    },
                )

                cluster = Cluster(
                    name=resp.name,
                    description=resp.summary,
                    slug=resp.slug,
                    chat_ids=[item.chat_id for item in summaries],
                    parent_id=None,
                )

                logger.debug(
                    f"Successfully generated cluster '{resp.name}' with {len(summaries)} conversations"
                )
                return cluster

            except Exception as e:
                logger.error(
                    f"Failed to generate cluster from {len(summaries)} summaries: {e}"
                )
                raise

    async def _generate_clusters_with_console(
        self,
        cluster_id_to_summaries: Dict[int, List[ConversationSummary]],
        max_contrastive_examples: int,
        prompt: str,
    ) -> List[Cluster]:
        """
        Generate clusters with full-screen Rich console display showing progress and latest results.
        """
        from rich.live import Live
        from rich.layout import Layout
        from rich.panel import Panel
        from rich.text import Text
        from rich.progress import (
            Progress,
            SpinnerColumn,
            TextColumn,
            TaskProgressColumn,
            TimeRemainingColumn,
        )

        clusters = []
        completed_clusters = []
        max_preview_items = 3
        total_clusters = len(cluster_id_to_summaries)

        # Create full-screen layout
        layout = Layout()
        layout.split_column(Layout(name="progress", size=3), Layout(name="preview"))

        # Create progress bar
        progress = Progress(
            SpinnerColumn(),
            TextColumn("[progress.description]{task.description}"),
            TaskProgressColumn(),
            TimeRemainingColumn(),
            console=self.console,
        )
        task_id = progress.add_task("", total=total_clusters)
        layout["progress"].update(progress)

        def update_preview_display():
            if completed_clusters:
                preview_text = Text()

                for cluster in completed_clusters[-max_preview_items:]:  # Show latest 3
                    preview_text.append(f"Name: {cluster.name}\n", style="bold green")
                    preview_text.append(
                        f"Description: {cluster.description[:100]}{'...' if len(cluster.description) > 100 else ''}\n",
                        style="white",
                    )
                    preview_text.append(
                        f"Conversations: {len(cluster.chat_ids)}\n\n",
                        style="yellow",
                    )

                layout["preview"].update(
                    Panel(
                        preview_text,
                        title=f"[green]Generated Clusters ({len(completed_clusters)}/{total_clusters})",
                        border_style="green",
                    )
                )
            else:
                layout["preview"].update(
                    Panel(
                        Text("Waiting for clusters...", style="dim"),
                        title="[yellow]Generated Clusters (0/0)",
                        border_style="yellow",
                    )
                )

        # Initialize preview display
        update_preview_display()

        with Live(layout, console=self.console, refresh_per_second=4):
            # Prepare tasks for each cluster
            tasks = []
            for cluster_id, summaries in cluster_id_to_summaries.items():
                coro = self.generate_cluster_description(
                    summaries,
                    get_contrastive_examples(
                        cluster_id,
                        cluster_id_to_summaries=cluster_id_to_summaries,
                        max_contrastive_examples=max_contrastive_examples,
                    ),
                    self.sem,
                    self.client,
                    prompt,
                )
                tasks.append(coro)

            # Use asyncio.as_completed to show results as they finish
            for i, coro in enumerate(asyncio.as_completed(tasks)):
                try:
                    cluster = await coro
                    clusters.append(cluster)
                    completed_clusters.append(cluster)

                    # Update progress
                    progress.update(task_id, completed=i + 1)

                    # Update preview display
                    update_preview_display()

                except Exception as e:
                    logger.error(f"Failed to generate cluster: {e}")
                    # Still update progress on error
                    progress.update(task_id, completed=i + 1)
                    update_preview_display()

        return clusters

`checkpoint_filename: str` `property` ¶

Return the filename to use for checkpointing this model's output.

`console = console` `instance-attribute` ¶

`max_concurrent_requests = max_concurrent_requests` `instance-attribute` ¶

`model = model` `instance-attribute` ¶

`temperature = temperature` `instance-attribute` ¶

`init(model: Union[str, KnownModelName] = 'openai/gpt-4o-mini', max_concurrent_requests: int = 50, temperature: float = 0.2, checkpoint_filename: str = 'clusters', console: Optional[Console] = None)` ¶

Initialize ClusterModel with core configuration.

Parameters:

Name	Type	Description	Default
`model`	`Union[str, KnownModelName]`	model identifier (e.g., "openai/gpt-4o-mini")	`'openai/gpt-4o-mini'`
`max_concurrent_requests`	`int`	Maximum concurrent API requests	`50`
`temperature`	`float`	LLM temperature for generation	`0.2`
`checkpoint_filename`	`str`	Filename for checkpointing	`'clusters'`
`console`	`Optional[Console]`	Rich console for progress tracking	`None`

Source code in kura/cluster.py

def __init__(
    self,
    model: Union[str, "KnownModelName"] = "openai/gpt-4o-mini",
    max_concurrent_requests: int = 50,
    temperature: float = 0.2,
    checkpoint_filename: str = "clusters",
    console: Optional[Console] = None,
):
    """
    Initialize ClusterModel with core configuration.

    Args:
        model: model identifier (e.g., "openai/gpt-4o-mini")
        max_concurrent_requests: Maximum concurrent API requests
        temperature: LLM temperature for generation
        checkpoint_filename: Filename for checkpointing
        console: Rich console for progress tracking
    """
    self.model = model
    self.max_concurrent_requests = max_concurrent_requests
    self.temperature = temperature
    self._checkpoint_filename = checkpoint_filename
    self.console = console

    logger.info(
        f"Initialized ClusterModel with model={model}, max_concurrent_requests={max_concurrent_requests}, temperature={temperature}"
    )

`generate_cluster_description(summaries: List[ConversationSummary], contrastive_examples: List[ConversationSummary], semaphore: Semaphore, client: AsyncInstructor, prompt: str = DEFAULT_CLUSTER_PROMPT) -> Cluster` `async` ¶

Generate a cluster description from summaries with contrastive examples.

Parameters:

Name	Type	Description	Default
`summaries`	`List[ConversationSummary]`	Summaries in this cluster	required
`contrastive_examples`	`List[ConversationSummary]`	Examples from other clusters for contrast	required

Returns:

Type	Description
`Cluster`	Cluster with generated name and description

Source code in kura/cluster.py

async def generate_cluster_description(
    self,
    summaries: List[ConversationSummary],
    contrastive_examples: List[ConversationSummary],
    semaphore: Semaphore,
    client: "AsyncInstructor",
    prompt: str = DEFAULT_CLUSTER_PROMPT,
) -> Cluster:
    """
    Generate a cluster description from summaries with contrastive examples.

    Args:
        summaries: Summaries in this cluster
        contrastive_examples: Examples from other clusters for contrast

    Returns:
        Cluster with generated name and description
    """
    logger.debug(
        f"Generating cluster from {len(summaries)} summaries with {len(contrastive_examples)} contrastive examples"
    )
    async with semaphore:
        try:
            resp = await client.chat.completions.create(
                messages=[
                    {
                        "role": "system",
                        "content": prompt,
                    },
                ],
                response_model=GeneratedCluster,
                temperature=self.temperature,
                context={
                    "positive_examples": summaries,
                    "contrastive_examples": contrastive_examples,
                },
            )

            cluster = Cluster(
                name=resp.name,
                description=resp.summary,
                slug=resp.slug,
                chat_ids=[item.chat_id for item in summaries],
                parent_id=None,
            )

            logger.debug(
                f"Successfully generated cluster '{resp.name}' with {len(summaries)} conversations"
            )
            return cluster

        except Exception as e:
            logger.error(
                f"Failed to generate cluster from {len(summaries)} summaries: {e}"
            )
            raise

`generate_clusters(cluster_id_to_summaries: Dict[int, List[ConversationSummary]], prompt: str = DEFAULT_CLUSTER_PROMPT, max_contrastive_examples: int = 10) -> List[Cluster]` `async` ¶

Generate clusters from a mapping of cluster IDs to summaries.

Source code in kura/cluster.py

async def generate_clusters(
    self,
    cluster_id_to_summaries: Dict[int, List[ConversationSummary]],
    prompt: str = DEFAULT_CLUSTER_PROMPT,
    max_contrastive_examples: int = 10,
) -> List[Cluster]:
    """Generate clusters from a mapping of cluster IDs to summaries."""
    import instructor

    self.sem = Semaphore(self.max_concurrent_requests)
    self.client = instructor.from_provider(self.model, async_client=True)

    if not self.console:
        # Simple processing without rich display
        return await asyncio.gather(
            *[
                self.generate_cluster_description(
                    summaries,
                    get_contrastive_examples(
                        cluster_id,
                        cluster_id_to_summaries=cluster_id_to_summaries,
                        max_contrastive_examples=max_contrastive_examples,
                    ),
                    self.sem,
                    self.client,
                    prompt,
                )
                for cluster_id, summaries in cluster_id_to_summaries.items()
            ]
        )

    return await self._generate_clusters_with_console(
        cluster_id_to_summaries,
        max_contrastive_examples,
        prompt,
    )

`KmeansClusteringModel` ¶

Bases: BaseClusteringMethod

Source code in kura/cluster.py

class KmeansClusteringModel(BaseClusteringMethod):
    def __init__(self, clusters_per_group: int = 10):
        self.clusters_per_group = clusters_per_group
        logger.info(
            f"Initialized KmeansClusteringModel with clusters_per_group={clusters_per_group}"
        )

    def cluster(
        self, items: list[dict[str, Union[ConversationSummary, list[float]]]]
    ) -> dict[int, list[ConversationSummary]]:
        """
        We perform a clustering here using an embedding defined on each individual item.

        We assume that the item is passed in as a dictionary with

        - its relevant embedding stored in the "embedding" key.
        - the item itself stored in the "item" key.

        {
            "embedding": list[float],
            "item": any,
        }
        """
        from sklearn.cluster import KMeans

        if not items:
            logger.warning("Empty items list provided to cluster method")
            raise ValueError("Empty items list provided to cluster method")

        logger.info(f"Starting K-means clustering of {len(items)} items")

        try:
            embeddings = [item["embedding"] for item in items]  # pyright: ignore
            data = [item["item"] for item in items]
            n_clusters = math.ceil(len(data) / self.clusters_per_group)

            logger.debug(
                f"Calculated {n_clusters} clusters for {len(data)} items (target: {self.clusters_per_group} items per cluster)"
            )

            X = np.array(embeddings)
            logger.debug(f"Created embedding matrix of shape {X.shape}")

            kmeans = KMeans(n_clusters=n_clusters)
            cluster_labels = kmeans.fit_predict(X)

            logger.debug(
                f"K-means clustering completed, assigned {len(set(cluster_labels))} unique cluster labels"
            )

            result = {
                i: [data[j] for j in range(len(data)) if cluster_labels[j] == i]
                for i in range(n_clusters)
            }

            # Log cluster size distribution
            cluster_sizes = [len(cluster_items) for cluster_items in result.values()]
            logger.info(
                f"K-means clustering completed: {len(result)} clusters created with sizes {cluster_sizes}"
            )
            logger.debug(
                f"Cluster size stats - min: {min(cluster_sizes)}, max: {max(cluster_sizes)}, avg: {sum(cluster_sizes) / len(cluster_sizes):.1f}"
            )

            return cast(dict[int, list[ConversationSummary]], result)

        except Exception as e:
            logger.error(
                f"Failed to perform K-means clustering on {len(items)} items: {e}"
            )
            raise

`clusters_per_group = clusters_per_group` `instance-attribute` ¶

`init(clusters_per_group: int = 10)` ¶

Source code in kura/cluster.py

def __init__(self, clusters_per_group: int = 10):
    self.clusters_per_group = clusters_per_group
    logger.info(
        f"Initialized KmeansClusteringModel with clusters_per_group={clusters_per_group}"
    )

`cluster(items: list[dict[str, Union[ConversationSummary, list[float]]]]) -> dict[int, list[ConversationSummary]]` ¶

We perform a clustering here using an embedding defined on each individual item.

We assume that the item is passed in as a dictionary with

its relevant embedding stored in the "embedding" key.
the item itself stored in the "item" key.

{ "embedding": list[float], "item": any, }

Source code in kura/cluster.py

def cluster(
    self, items: list[dict[str, Union[ConversationSummary, list[float]]]]
) -> dict[int, list[ConversationSummary]]:
    """
    We perform a clustering here using an embedding defined on each individual item.

    We assume that the item is passed in as a dictionary with

    - its relevant embedding stored in the "embedding" key.
    - the item itself stored in the "item" key.

    {
        "embedding": list[float],
        "item": any,
    }
    """
    from sklearn.cluster import KMeans

    if not items:
        logger.warning("Empty items list provided to cluster method")
        raise ValueError("Empty items list provided to cluster method")

    logger.info(f"Starting K-means clustering of {len(items)} items")

    try:
        embeddings = [item["embedding"] for item in items]  # pyright: ignore
        data = [item["item"] for item in items]
        n_clusters = math.ceil(len(data) / self.clusters_per_group)

        logger.debug(
            f"Calculated {n_clusters} clusters for {len(data)} items (target: {self.clusters_per_group} items per cluster)"
        )

        X = np.array(embeddings)
        logger.debug(f"Created embedding matrix of shape {X.shape}")

        kmeans = KMeans(n_clusters=n_clusters)
        cluster_labels = kmeans.fit_predict(X)

        logger.debug(
            f"K-means clustering completed, assigned {len(set(cluster_labels))} unique cluster labels"
        )

        result = {
            i: [data[j] for j in range(len(data)) if cluster_labels[j] == i]
            for i in range(n_clusters)
        }

        # Log cluster size distribution
        cluster_sizes = [len(cluster_items) for cluster_items in result.values()]
        logger.info(
            f"K-means clustering completed: {len(result)} clusters created with sizes {cluster_sizes}"
        )
        logger.debug(
            f"Cluster size stats - min: {min(cluster_sizes)}, max: {max(cluster_sizes)}, avg: {sum(cluster_sizes) / len(cluster_sizes):.1f}"
        )

        return cast(dict[int, list[ConversationSummary]], result)

    except Exception as e:
        logger.error(
            f"Failed to perform K-means clustering on {len(items)} items: {e}"
        )
        raise

`generate_base_clusters_from_conversation_summaries(summaries: List[ConversationSummary], embedding_model: Optional[BaseEmbeddingModel] = None, clustering_method: Optional[BaseClusteringMethod] = None, clustering_model: Optional[BaseClusterDescriptionModel] = None, checkpoint_manager: Optional[BaseCheckpointManager] = None, max_contrastive_examples: int = 10, prompt: str = DEFAULT_CLUSTER_PROMPT, **kwargs) -> List[Cluster]` `async` ¶

Cluster conversation summaries using embeddings.

Parameters:

Name	Type	Description	Default
`summaries`	`List[ConversationSummary]`	List of conversation summaries to cluster	required
`embedding_model`	`Optional[BaseEmbeddingModel]`	Model for generating embeddings (defaults to OpenAI)	`None`
`clustering_method`	`Optional[BaseClusteringMethod]`	Clustering algorithm (defaults to K-means)	`None`
`clustering_model`	`Optional[BaseClusterDescriptionModel]`	Model for generating cluster descriptions	`None`
`checkpoint_manager`	`Optional[BaseCheckpointManager]`	Optional checkpoint manager for caching	`None`
`max_contrastive_examples`	`int`	Number of contrastive examples to use	`10`
`prompt`	`str`	Custom prompt for cluster generation	`DEFAULT_CLUSTER_PROMPT`
`**kwargs`		Additional parameters for clustering model	`{}`

Returns:

Type	Description
`List[Cluster]`	List of clusters with generated names and descriptions

Source code in kura/cluster.py

async def generate_base_clusters_from_conversation_summaries(
    summaries: List[ConversationSummary],
    embedding_model: Optional[BaseEmbeddingModel] = None,
    clustering_method: Optional[BaseClusteringMethod] = None,
    clustering_model: Optional[BaseClusterDescriptionModel] = None,
    checkpoint_manager: Optional[BaseCheckpointManager] = None,
    max_contrastive_examples: int = 10,
    prompt: str = DEFAULT_CLUSTER_PROMPT,
    **kwargs,
) -> List[Cluster]:
    """
    Cluster conversation summaries using embeddings.

    Args:
        summaries: List of conversation summaries to cluster
        embedding_model: Model for generating embeddings (defaults to OpenAI)
        clustering_method: Clustering algorithm (defaults to K-means)
        clustering_model: Model for generating cluster descriptions
        checkpoint_manager: Optional checkpoint manager for caching
        max_contrastive_examples: Number of contrastive examples to use
        prompt: Custom prompt for cluster generation
        **kwargs: Additional parameters for clustering model

    Returns:
        List of clusters with generated names and descriptions
    """
    if not summaries:
        raise ValueError("Empty summaries list provided")

    # Initialize default models if not provided
    if embedding_model is None:
        embedding_model = OpenAIEmbeddingModel()
    if clustering_method is None:
        clustering_method = KmeansClusteringModel()
    if clustering_model is None:
        clustering_model = ClusterDescriptionModel()

    if checkpoint_manager:
        cached = checkpoint_manager.load_checkpoint(
            clustering_model.checkpoint_filename, Cluster
        )
        if cached:
            logger.info(f"Loaded {len(cached)} clusters from checkpoint")
            return cached

    logger.info(f"Clustering {len(summaries)} conversation summaries")

    # Embed the summaries
    embedded_items = await embed_summaries(summaries, embedding_model)

    # Generate Initial Mapping of Cluster IDs to Summaries
    clusters_id_to_summaries = clustering_method.cluster(embedded_items)

    # Generate Clusters
    clusters = await clustering_model.generate_clusters(
        cluster_id_to_summaries=clusters_id_to_summaries,
        max_contrastive_examples=max_contrastive_examples,
        prompt=prompt,
    )

    if checkpoint_manager:
        checkpoint_manager.save_checkpoint(
            clustering_model.checkpoint_filename, clusters
        )

    return clusters

`get_contrastive_examples(cluster_id: int, cluster_id_to_summaries: Dict[int, List[ConversationSummary]], max_contrastive_examples: int = 10) -> List[ConversationSummary]` ¶

Get contrastive examples from other clusters to help distinguish this cluster.

Parameters:

Name	Type	Description	Default
`cluster_id`	`int`	The id of the cluster to get contrastive examples for	required
`cluster_id_to_summaries`	`Dict[int, List[ConversationSummary]]`	A dictionary of cluster ids to their summaries	required
`max_contrastive_examples`	`int`	The number of contrastive examples to return. Defaults to 10.	`10`

Returns:

Type	Description
`List[ConversationSummary]`	List of contrastive examples from other clusters

Source code in kura/cluster.py

def get_contrastive_examples(
    cluster_id: int,
    cluster_id_to_summaries: Dict[int, List[ConversationSummary]],
    max_contrastive_examples: int = 10,
) -> List[ConversationSummary]:
    """Get contrastive examples from other clusters to help distinguish this cluster.

    Args:
        cluster_id: The id of the cluster to get contrastive examples for
        cluster_id_to_summaries: A dictionary of cluster ids to their summaries
        max_contrastive_examples: The number of contrastive examples to return. Defaults to 10.

    Returns:
        List of contrastive examples from other clusters
    """
    other_clusters = [c for c in cluster_id_to_summaries.keys() if c != cluster_id]
    all_examples = []
    for cluster in other_clusters:
        all_examples.extend(cluster_id_to_summaries[cluster])

    logger.debug(
        f"Selecting contrastive examples for cluster {cluster_id}: found {len(all_examples)} examples from {len(other_clusters)} other clusters"
    )

    # If we don't have enough examples, return all of them
    if len(all_examples) <= max_contrastive_examples:
        logger.debug(
            f"Using all {len(all_examples)} available contrastive examples (limit was {max_contrastive_examples})"
        )
        return all_examples

    # Otherwise sample without replacement
    selected = list(
        np.random.choice(all_examples, size=max_contrastive_examples, replace=False)
    )
    logger.debug(
        f"Randomly selected {len(selected)} contrastive examples from {len(all_examples)} available"
    )
    return selected

Meta-Clustering¶

`kura.meta_cluster` ¶

`logger = logging.getLogger(name)` `module-attribute` ¶

`CandidateClusters` ¶

Bases: BaseModel

Source code in kura/meta_cluster.py

class CandidateClusters(BaseModel):
    candidate_cluster_names: list[str]

    @field_validator("candidate_cluster_names")
    def validate_candidate_cluster_names(cls, v: list[str]) -> list[str]:
        if len(v) == 0:
            raise ValueError("Candidate cluster names must be a non-empty list")

        v = [label.strip() for label in v]
        v = [label[:-1] if label.endswith(".") else label for label in v]

        return [re.sub(r"\\{1,}", "", label.replace('"', "")) for label in v]

`candidate_cluster_names: list[str]` `instance-attribute` ¶

`validate_candidate_cluster_names(v: list[str]) -> list[str]` ¶

Source code in kura/meta_cluster.py

@field_validator("candidate_cluster_names")
def validate_candidate_cluster_names(cls, v: list[str]) -> list[str]:
    if len(v) == 0:
        raise ValueError("Candidate cluster names must be a non-empty list")

    v = [label.strip() for label in v]
    v = [label[:-1] if label.endswith(".") else label for label in v]

    return [re.sub(r"\\{1,}", "", label.replace('"', "")) for label in v]

`ClusterLabel` ¶

Bases: BaseModel

Source code in kura/meta_cluster.py

class ClusterLabel(BaseModel):
    higher_level_cluster: str

    @field_validator("higher_level_cluster")
    def validate_higher_level_cluster(cls, v: str, info: ValidationInfo) -> str:
        if not info.context:
            raise ValueError("Context is missing")

        if "candidate_clusters" not in info.context:
            raise ValueError("Candidate clusters are missing from context")

        candidate_clusters = info.context["candidate_clusters"]

        # Exact match check
        if v in candidate_clusters:
            return v

        # Fuzzy match check with 90% similarity threshold
        for candidate in candidate_clusters:
            similarity = fuzz.ratio(v, candidate)
            if similarity >= 90:  # 90% similarity threshold
                return candidate

        # If no match found
        raise ValueError(
            f"""
            Invalid higher-level cluster: |{v}|

            Valid clusters are:
            {", ".join(f"|{c}|" for c in candidate_clusters)}
            """
        )
        return v

`higher_level_cluster: str` `instance-attribute` ¶

`validate_higher_level_cluster(v: str, info: ValidationInfo) -> str` ¶

Source code in kura/meta_cluster.py

@field_validator("higher_level_cluster")
def validate_higher_level_cluster(cls, v: str, info: ValidationInfo) -> str:
    if not info.context:
        raise ValueError("Context is missing")

    if "candidate_clusters" not in info.context:
        raise ValueError("Candidate clusters are missing from context")

    candidate_clusters = info.context["candidate_clusters"]

    # Exact match check
    if v in candidate_clusters:
        return v

    # Fuzzy match check with 90% similarity threshold
    for candidate in candidate_clusters:
        similarity = fuzz.ratio(v, candidate)
        if similarity >= 90:  # 90% similarity threshold
            return candidate

    # If no match found
    raise ValueError(
        f"""
        Invalid higher-level cluster: |{v}|

        Valid clusters are:
        {", ".join(f"|{c}|" for c in candidate_clusters)}
        """
    )
    return v

`MetaClusterModel` ¶

Bases: BaseMetaClusterModel

Source code in kura/meta_cluster.py

class MetaClusterModel(BaseMetaClusterModel):
    @property
    def checkpoint_filename(self) -> str:
        """The filename to use for checkpointing this model's output."""
        return "meta_clusters"

    def __init__(
        self,
        max_concurrent_requests: int = 50,
        model: str = "openai/gpt-4o-mini",
        embedding_model: Optional[BaseEmbeddingModel] = None,
        clustering_model: Union[BaseClusteringMethod, None] = None,
        max_clusters: int = 10,
        console: Optional["Console"] = None,
        **kwargs,  # For future use
    ):
        if clustering_model is None:
            from kura.cluster import KmeansClusteringModel

            clustering_model = KmeansClusteringModel(12)

        self.max_concurrent_requests = max_concurrent_requests
        self.sem = Semaphore(max_concurrent_requests)

        import instructor

        self.client = instructor.from_provider(model, async_client=True)
        self.console = console
        self.max_clusters = max_clusters

        if embedding_model is None:
            embedding_model = OpenAIEmbeddingModel()

        self.embedding_model = embedding_model
        self.clustering_model = clustering_model
        self.model = model
        self.console = console

        logger.info(
            f"Initialized MetaClusterModel with model={model}, max_concurrent_requests={max_concurrent_requests}, embedding_model={type(embedding_model).__name__}, clustering_model={type(clustering_model).__name__}, max_clusters={max_clusters}"
        )

        # Debug: Check if console is set
        if self.console:
            logger.debug(f"Console is set to {type(self.console)}")
        else:
            logger.debug("Console is None - Rich progress bars will not be available")

    async def _gather_with_progress(
        self,
        tasks,
        desc: str = "Processing",
        disable: bool = False,
        show_preview: bool = False,
    ):
        """Helper method to run async gather with Rich progress bar if available, otherwise tqdm."""
        if self.console and not disable:
            try:
                from rich.progress import (
                    Progress,
                    SpinnerColumn,
                    TextColumn,
                    BarColumn,
                    TaskProgressColumn,
                    TimeRemainingColumn,
                )
                from rich.live import Live
                from rich.layout import Layout
                from rich.panel import Panel
                from rich.text import Text
                from rich.errors import LiveError

                # Check if a Live display is already active by trying to get the current live instance
                try:
                    # Try to access the console's current live instance
                    if (
                        hasattr(self.console, "_live")
                        and self.console._live is not None
                    ):
                        show_preview = (
                            False  # Disable preview if Live is already active
                        )
                except AttributeError:
                    pass  # Console doesn't have _live attribute, that's fine

                if show_preview:
                    # Use Live display with progress and preview buffer
                    layout = Layout()
                    layout.split_column(
                        Layout(name="progress", size=3), Layout(name="preview")
                    )

                    preview_buffer = []
                    max_preview_items = 3

                    # Create progress with cleaner display
                    progress = Progress(
                        SpinnerColumn(),
                        TextColumn("[progress.description]{task.description}"),
                        BarColumn(),
                        TaskProgressColumn(),
                        TimeRemainingColumn(),
                        console=self.console,
                    )
                    task_id = progress.add_task(f"[cyan]{desc}...", total=len(tasks))
                    layout["progress"].update(progress)

                    try:
                        with Live(layout, console=self.console, refresh_per_second=4):
                            completed_tasks = []
                            for i, task in enumerate(asyncio.as_completed(tasks)):
                                result = await task
                                completed_tasks.append(result)
                                progress.update(task_id, completed=i + 1)

                                # Handle different result types
                                if isinstance(result, list):
                                    # For operations that return lists of clusters
                                    for item in result:
                                        if (
                                            hasattr(item, "name")
                                            and hasattr(item, "description")
                                            and item.parent_id is None
                                        ):
                                            preview_buffer.append(item)
                                            if len(preview_buffer) > max_preview_items:
                                                preview_buffer.pop(0)
                                elif hasattr(result, "name") and hasattr(
                                    result, "description"
                                ):
                                    # For operations that return single clusters
                                    preview_buffer.append(result)
                                    if len(preview_buffer) > max_preview_items:
                                        preview_buffer.pop(0)

                                # Update preview display if we have clusters
                                if preview_buffer:
                                    preview_text = Text()
                                    for j, cluster in enumerate(preview_buffer):
                                        preview_text.append(
                                            "Meta Cluster: ", style="bold magenta"
                                        )
                                        preview_text.append(
                                            f"{cluster.name[:80]}...\n",
                                            style="bold white",
                                        )
                                        preview_text.append(
                                            "Description: ", style="bold cyan"
                                        )
                                        preview_text.append(
                                            f"{cluster.description[:100]}...\n\n",
                                            style="dim white",
                                        )

                                    layout["preview"].update(
                                        Panel(
                                            preview_text,
                                            title=f"[magenta]Recent Meta Clusters ({len(preview_buffer)}/{max_preview_items})",
                                            border_style="magenta",
                                        )
                                    )

                            return completed_tasks
                    except LiveError:
                        # If Rich Live fails (e.g., another Live is active), fall back to simple progress
                        with progress:
                            completed_tasks = []
                            for i, task in enumerate(asyncio.as_completed(tasks)):
                                result = await task
                                completed_tasks.append(result)
                                progress.update(task_id, completed=i + 1)
                            return completed_tasks
                else:
                    # Regular progress bar without preview (or when Live is already active)
                    progress = Progress(
                        SpinnerColumn(),
                        TextColumn("[progress.description]{task.description}"),
                        BarColumn(),
                        TaskProgressColumn(),
                        TimeRemainingColumn(),
                        console=self.console,
                    )

                    with progress:
                        task_id = progress.add_task(
                            f"[cyan]{desc}...", total=len(tasks)
                        )

                        completed_tasks = []
                        for i, task in enumerate(asyncio.as_completed(tasks)):
                            result = await task
                            completed_tasks.append(result)
                            progress.update(task_id, completed=i + 1)

                        return completed_tasks

            except (ImportError, LiveError):  # type: ignore
                # Rich not available or Live error, run silently
                return await asyncio.gather(*tasks)
        else:
            # No console, run silently
            return await asyncio.gather(*tasks)

    async def generate_candidate_clusters(
        self, clusters: list[Cluster], sem: Semaphore
    ) -> list[str]:
        async with sem:
            resp = await self.client.chat.completions.create(
                messages=[
                    {
                        "role": "user",
                        "content": """
                You are tasked with creating higher-level cluster names based on a given list of clusters and their descriptions. Your goal is to come up with broader categories that could encompass one or more of the provided clusters

                First, review the list of clusters and their descriptions:
                <cluster_list>
                    {% for cluster in clusters %}
                    <cluster>{{ cluster.name }}: {{ cluster.description }}</cluster>
                    {% endfor %}
                </cluster_list>

                Your task is to create at most {{ desired_number }} higher-level cluster names that could potentially include one or more of the provided clusters. These higher-level clusters should represent broader categories or themes that emerge from the given clusters, while remaining as specific as possible. If there are many clusters with a specific theme, ensure that the higher-level cluster name remains the maximum level of specificity. You are helping to organize user behavior data in order to improve safety, monitoring, and observability. You can generate less than {{ desired_number }} names if you feel that fewer are appropriate and accurately capture the clusters.

                Guidelines for creating higher-level clusters names
                1. Analyze the themes, topics or characteristics common to multiple clusters.
                2. Create names that are specific enough to be meaningful but but not so specific that they can't meaningfully represent many different clusters. Avoid overly general or vague terms, and do not hesitate to describe socially harmful or sensitive topics (in fact, clusters that clearly describe harmful behavior are slightly preferred); specificity is necessary for observability and enforcement.
                3. Ensure that the higher-level cluster names are distinct from one another.
                4. Use clear, concise, and descriptive language for the cluster names. Assume neither good nor bad faith for the content in the clusters.

                Think about the relationships between the given clusters and potential overarching themes.

                Focus on creating meaningful, distinct and precise ( but not overly specific ) higher-level cluster names that could encompass multiple sub-clusters.
                """.strip(),
                    },
                ],
                response_model=CandidateClusters,
                context={
                    "clusters": clusters,
                    "desired_number": math.ceil(len(clusters) / 2)
                    if len(clusters)
                    >= 3  # If we have two clusters we just merge them tbh
                    else 1,
                },
                max_retries=3,
            )
            return resp.candidate_cluster_names

    async def label_cluster(self, cluster: Cluster, candidate_clusters: list[str]):
        async with self.sem:
            resp = await self.client.chat.completions.create(
                messages=[
                    {
                        "role": "user",
                        "content": """
You are tasked with categorizing a specific cluster into one of the provided higher-level clusters for observability, monitoring, and content moderation. Your goal is to determine which higher-level cluster best fits the given specific cluster based on its name and description.

First, here are the ONLY valid higher-level clusters you may select from:
<higher_level_clusters>
{% for cluster in candidate_clusters %}
<higher_level_cluster>{{ cluster }}</higher_level_cluster>
{% endfor %}
</higher_level_clusters>

Here is the specific cluster to categorize:
<specific_cluster>
Name: {{ cluster.name }}
Description: {{ cluster.description }}
</specific_cluster>

RULES:
1. You MUST select EXACTLY ONE higher-level cluster from the provided list
2. You MUST output the higher-level cluster name EXACTLY as written - no modifications allowed
3. You MUST NOT create new cluster names or combinations
4. You MUST NOT output any additional text or explanations
5. You MUST NOT use partial matches or approximate names

CLASSIFICATION PROCESS:
1. First, record the exact list of valid higher-level clusters
2. Read the specific cluster's name and description carefully
3. Compare the specific cluster's key characteristics against each valid higher-level cluster
4. Select the single most appropriate higher-level cluster that encompasses the specific cluster
5. Verify your selected cluster exactly matches one from the valid list
6. Output ONLY the selected higher-level cluster name, exactly as it appeared in the valid list

Here is the specific cluster to categorize:

<specific_cluster>
Name: {{ cluster.name }}
Description: {{ cluster.description }}
</specific_cluster>

Based on this information, determine the most appropriate higher-level cluster and provide your answer as instructed.
                        """,
                    }
                ],
                response_model=ClusterLabel,
                context={
                    "cluster": cluster,
                    "candidate_clusters": candidate_clusters,
                },
                max_retries=3,
            )
            return {
                "cluster": cluster,
                "label": resp.higher_level_cluster,
            }

    async def rename_cluster_group(self, clusters: list[Cluster]) -> list[Cluster]:
        async with self.sem:
            resp = await self.client.chat.completions.create(
                messages=[
                    {
                        "role": "system",
                        "content": """
                        You are tasked with summarizing a group of related cluster names into a short, precise, and accurate overall description and name. Your goal is to create a concise summary that captures the essence of these clusters

                        The cluster name should be at most ten words long (perhaps less) and be specific but also reflective of most of the clusters that comprise them. The cluster name should be a sentence in the imperative that captures the user's request. For example, 'Brainstorm ideas for a birthday party' or 'Help me find a new job are good examples.

                        Do not shy away from describing socially harmful or sensitive topics. For instance, "Write fantasy sexual roleplay with octopi and monsters", "Generate blog spam for gambling websites", or "Assist with high school math homework" would be better than "Write erotic content" or "Help with homework". Be as descriptive as possible and assume neither good nor bad faith. Do not hesitate to identify and describe socially harmful or sensitive topics specifically; specificity is necessary for monitoring.

                        Ensure your summary and name accurately represent the clusters and are specific to the clusters.
                        """,
                    },
                    {
                        "role": "user",
                        "content": """
                        Here are the related cluster names
                        <clusters>
                            {% for cluster in clusters %}
                                <cluster>{{ cluster.name }}: {{ cluster.description }}</cluster>
                            {% endfor %}
                        </clusters>
                        """,
                    },
                ],
                context={"clusters": clusters},
                response_model=GeneratedCluster,
            )

            res = []

            new_cluster = Cluster(
                name=resp.name,
                description=resp.summary,
                slug=resp.slug,
                chat_ids=[
                    chat_id for cluster in clusters for chat_id in cluster.chat_ids
                ],
                parent_id=None,
            )

            res.append(new_cluster)

            for cluster in clusters:
                res.append(
                    Cluster(
                        id=cluster.id,
                        name=cluster.name,
                        description=cluster.description,
                        slug=cluster.slug,
                        chat_ids=cluster.chat_ids,
                        parent_id=new_cluster.id,
                    )
                )

            return res

    async def generate_meta_clusters(
        self, clusters: list[Cluster], show_preview: bool = True
    ) -> list[Cluster]:
        # Use a single Live display for the entire meta clustering operation
        if self.console and show_preview:
            try:
                from rich.progress import (
                    Progress,
                    SpinnerColumn,
                    TextColumn,
                    BarColumn,
                    TaskProgressColumn,
                    TimeRemainingColumn,
                )
                from rich.live import Live
                from rich.layout import Layout
                from rich.panel import Panel
                from rich.text import Text
                from rich.errors import LiveError

                # Create layout for the entire meta clustering operation
                layout = Layout()
                layout.split_column(
                    Layout(
                        name="progress", size=6
                    ),  # More space for multiple progress bars
                    Layout(name="preview"),
                )

                # Create progress display
                progress = Progress(
                    SpinnerColumn(),
                    TextColumn("[progress.description]{task.description}"),
                    BarColumn(),
                    TaskProgressColumn(),
                    TimeRemainingColumn(),
                    console=self.console,
                )
                layout["progress"].update(progress)

                preview_buffer = []
                max_preview_items = 3

                try:
                    with Live(layout, console=self.console, refresh_per_second=4):
                        # Step 1: Generate candidate clusters
                        candidate_labels = await self.generate_candidate_clusters(
                            clusters, Semaphore(self.max_concurrent_requests)
                        )

                        # Step 2: Label clusters with progress
                        label_task_id = progress.add_task(
                            "[cyan]Labeling clusters...", total=len(clusters)
                        )
                        cluster_labels = []
                        for i, cluster in enumerate(clusters):
                            result = await self.label_cluster(cluster, candidate_labels)
                            cluster_labels.append(result)
                            progress.update(label_task_id, completed=i + 1)

                        # Group clusters by label
                        label_to_clusters = {}
                        for label in cluster_labels:
                            if label["label"] not in label_to_clusters:
                                label_to_clusters[label["label"]] = []
                            label_to_clusters[label["label"]].append(label["cluster"])

                        # Step 3: Rename cluster groups with progress and preview
                        rename_task_id = progress.add_task(
                            "[cyan]Renaming cluster groups...",
                            total=len(label_to_clusters),
                        )
                        new_clusters = []
                        for i, cluster_group in enumerate(label_to_clusters.values()):
                            result = await self.rename_cluster_group(cluster_group)
                            new_clusters.append(result)
                            progress.update(rename_task_id, completed=i + 1)

                            # Update preview with new meta clusters
                            for cluster in result:
                                if (
                                    hasattr(cluster, "name")
                                    and hasattr(cluster, "description")
                                    and cluster.parent_id is None
                                ):
                                    preview_buffer.append(cluster)
                                    if len(preview_buffer) > max_preview_items:
                                        preview_buffer.pop(0)

                            # Update preview display
                            if preview_buffer:
                                preview_text = Text()
                                for j, cluster in enumerate(preview_buffer):
                                    preview_text.append(
                                        "Meta Cluster: ", style="bold magenta"
                                    )
                                    preview_text.append(
                                        f"{cluster.name[:80]}...\n", style="bold white"
                                    )
                                    preview_text.append(
                                        "Description: ", style="bold cyan"
                                    )
                                    preview_text.append(
                                        f"{cluster.description[:100]}...\n\n",
                                        style="dim white",
                                    )

                                layout["preview"].update(
                                    Panel(
                                        preview_text,
                                        title=f"[magenta]Recent Meta Clusters ({len(preview_buffer)}/{max_preview_items})",
                                        border_style="magenta",
                                    )
                                )

                        # Flatten results
                        res = []
                        for new_cluster in new_clusters:
                            res.extend(new_cluster)

                        return res

                except LiveError:
                    # Fall back to the original method without Live display
                    return await self._generate_meta_clusters_fallback(clusters)

            except ImportError:
                # Rich not available, fall back
                return await self._generate_meta_clusters_fallback(clusters)
        else:
            # No console or preview disabled, use original method
            return await self._generate_meta_clusters_fallback(clusters)

    async def _generate_meta_clusters_fallback(
        self, clusters: list[Cluster]
    ) -> list[Cluster]:
        """Fallback method for generate_meta_clusters when Live display is not available"""
        candidate_labels = await self.generate_candidate_clusters(
            clusters, Semaphore(self.max_concurrent_requests)
        )

        cluster_labels = await self._gather_with_progress(
            [self.label_cluster(cluster, candidate_labels) for cluster in clusters],
            desc="Labeling clusters",
            disable=False,
            show_preview=False,  # Disable preview to avoid nested Live displays
        )

        label_to_clusters = {}
        for label in cluster_labels:
            if label["label"] not in label_to_clusters:
                label_to_clusters[label["label"]] = []

            label_to_clusters[label["label"]].append(label["cluster"])

        new_clusters = await self._gather_with_progress(
            [
                self.rename_cluster_group(cluster)
                for cluster in label_to_clusters.values()
            ],
            desc="Renaming cluster groups",
            show_preview=False,  # Disable preview to avoid nested Live displays
        )

        res = []
        for new_cluster in new_clusters:
            res.extend(new_cluster)

        return res

    async def reduce_clusters(self, clusters: list[Cluster]) -> list[Cluster]:
        """
        This takes in a list of existing clusters and generates a few higher order clusters that are more general. This represents a single iteration of the meta clustering process.

        In the event that we have a single cluster, we will just return a new higher level cluster which has the same name as the original cluster. ( This is an edge case which we should definitely handle better )
        """
        if not clusters:
            return []

        if len(clusters) == 1:
            logger.info("Only one cluster, returning it as a meta cluster")
            new_cluster = Cluster(
                name=clusters[0].name,
                description=clusters[0].description,
                slug=clusters[0].slug,
                chat_ids=clusters[0].chat_ids,
                parent_id=None,
            )
            return [new_cluster, clusters[0]]

        texts_to_embed = [str(cluster) for cluster in clusters]

        logger.info(
            f"Embedding {len(texts_to_embed)} clusters for meta-clustering using {type(self.embedding_model).__name__}..."
        )

        cluster_embeddings = await self.embedding_model.embed(texts_to_embed)

        if not cluster_embeddings or len(cluster_embeddings) != len(clusters):
            logger.error(
                "Error: Number of embeddings does not match number of clusters or embeddings are empty for meta-clustering."
            )
            return []

        clusters_and_embeddings = [
            {
                "item": cluster,
                "embedding": embedding,
            }
            for cluster, embedding in zip(clusters, cluster_embeddings)
        ]

        cluster_id_to_clusters: dict[int, list[Cluster]] = (
            self.clustering_model.cluster(clusters_and_embeddings)
        )  # type: ignore

        new_clusters = await self._gather_with_progress(
            [
                self.generate_meta_clusters(
                    cluster_id_to_clusters[cluster_id], show_preview=True
                )
                for cluster_id in cluster_id_to_clusters
            ],
            desc="Generating Meta Clusters",
            show_preview=True,
        )

        res = []
        for new_cluster in new_clusters:
            res.extend(new_cluster)

        return res

`checkpoint_filename: str` `property` ¶

The filename to use for checkpointing this model's output.

`client = instructor.from_provider(model, async_client=True)` `instance-attribute` ¶

`clustering_model = clustering_model` `instance-attribute` ¶

`console = console` `instance-attribute` ¶

`embedding_model = embedding_model` `instance-attribute` ¶

`max_clusters = max_clusters` `instance-attribute` ¶

`max_concurrent_requests = max_concurrent_requests` `instance-attribute` ¶

`model = model` `instance-attribute` ¶

`sem = Semaphore(max_concurrent_requests)` `instance-attribute` ¶

`init(max_concurrent_requests: int = 50, model: str = 'openai/gpt-4o-mini', embedding_model: Optional[BaseEmbeddingModel] = None, clustering_model: Union[BaseClusteringMethod, None] = None, max_clusters: int = 10, console: Optional['Console'] = None, **kwargs)` ¶

Source code in kura/meta_cluster.py

def __init__(
    self,
    max_concurrent_requests: int = 50,
    model: str = "openai/gpt-4o-mini",
    embedding_model: Optional[BaseEmbeddingModel] = None,
    clustering_model: Union[BaseClusteringMethod, None] = None,
    max_clusters: int = 10,
    console: Optional["Console"] = None,
    **kwargs,  # For future use
):
    if clustering_model is None:
        from kura.cluster import KmeansClusteringModel

        clustering_model = KmeansClusteringModel(12)

    self.max_concurrent_requests = max_concurrent_requests
    self.sem = Semaphore(max_concurrent_requests)

    import instructor

    self.client = instructor.from_provider(model, async_client=True)
    self.console = console
    self.max_clusters = max_clusters

    if embedding_model is None:
        embedding_model = OpenAIEmbeddingModel()

    self.embedding_model = embedding_model
    self.clustering_model = clustering_model
    self.model = model
    self.console = console

    logger.info(
        f"Initialized MetaClusterModel with model={model}, max_concurrent_requests={max_concurrent_requests}, embedding_model={type(embedding_model).__name__}, clustering_model={type(clustering_model).__name__}, max_clusters={max_clusters}"
    )

    # Debug: Check if console is set
    if self.console:
        logger.debug(f"Console is set to {type(self.console)}")
    else:
        logger.debug("Console is None - Rich progress bars will not be available")

`generate_candidate_clusters(clusters: list[Cluster], sem: Semaphore) -> list[str]` `async` ¶

Source code in kura/meta_cluster.py

async def generate_candidate_clusters(
    self, clusters: list[Cluster], sem: Semaphore
) -> list[str]:
    async with sem:
        resp = await self.client.chat.completions.create(
            messages=[
                {
                    "role": "user",
                    "content": """
            You are tasked with creating higher-level cluster names based on a given list of clusters and their descriptions. Your goal is to come up with broader categories that could encompass one or more of the provided clusters

            First, review the list of clusters and their descriptions:
            <cluster_list>
                {% for cluster in clusters %}
                <cluster>{{ cluster.name }}: {{ cluster.description }}</cluster>
                {% endfor %}
            </cluster_list>

            Your task is to create at most {{ desired_number }} higher-level cluster names that could potentially include one or more of the provided clusters. These higher-level clusters should represent broader categories or themes that emerge from the given clusters, while remaining as specific as possible. If there are many clusters with a specific theme, ensure that the higher-level cluster name remains the maximum level of specificity. You are helping to organize user behavior data in order to improve safety, monitoring, and observability. You can generate less than {{ desired_number }} names if you feel that fewer are appropriate and accurately capture the clusters.

            Guidelines for creating higher-level clusters names
            1. Analyze the themes, topics or characteristics common to multiple clusters.
            2. Create names that are specific enough to be meaningful but but not so specific that they can't meaningfully represent many different clusters. Avoid overly general or vague terms, and do not hesitate to describe socially harmful or sensitive topics (in fact, clusters that clearly describe harmful behavior are slightly preferred); specificity is necessary for observability and enforcement.
            3. Ensure that the higher-level cluster names are distinct from one another.
            4. Use clear, concise, and descriptive language for the cluster names. Assume neither good nor bad faith for the content in the clusters.

            Think about the relationships between the given clusters and potential overarching themes.

            Focus on creating meaningful, distinct and precise ( but not overly specific ) higher-level cluster names that could encompass multiple sub-clusters.
            """.strip(),
                },
            ],
            response_model=CandidateClusters,
            context={
                "clusters": clusters,
                "desired_number": math.ceil(len(clusters) / 2)
                if len(clusters)
                >= 3  # If we have two clusters we just merge them tbh
                else 1,
            },
            max_retries=3,
        )
        return resp.candidate_cluster_names

`generate_meta_clusters(clusters: list[Cluster], show_preview: bool = True) -> list[Cluster]` `async` ¶

Source code in kura/meta_cluster.py

async def generate_meta_clusters(
    self, clusters: list[Cluster], show_preview: bool = True
) -> list[Cluster]:
    # Use a single Live display for the entire meta clustering operation
    if self.console and show_preview:
        try:
            from rich.progress import (
                Progress,
                SpinnerColumn,
                TextColumn,
                BarColumn,
                TaskProgressColumn,
                TimeRemainingColumn,
            )
            from rich.live import Live
            from rich.layout import Layout
            from rich.panel import Panel
            from rich.text import Text
            from rich.errors import LiveError

            # Create layout for the entire meta clustering operation
            layout = Layout()
            layout.split_column(
                Layout(
                    name="progress", size=6
                ),  # More space for multiple progress bars
                Layout(name="preview"),
            )

            # Create progress display
            progress = Progress(
                SpinnerColumn(),
                TextColumn("[progress.description]{task.description}"),
                BarColumn(),
                TaskProgressColumn(),
                TimeRemainingColumn(),
                console=self.console,
            )
            layout["progress"].update(progress)

            preview_buffer = []
            max_preview_items = 3

            try:
                with Live(layout, console=self.console, refresh_per_second=4):
                    # Step 1: Generate candidate clusters
                    candidate_labels = await self.generate_candidate_clusters(
                        clusters, Semaphore(self.max_concurrent_requests)
                    )

                    # Step 2: Label clusters with progress
                    label_task_id = progress.add_task(
                        "[cyan]Labeling clusters...", total=len(clusters)
                    )
                    cluster_labels = []
                    for i, cluster in enumerate(clusters):
                        result = await self.label_cluster(cluster, candidate_labels)
                        cluster_labels.append(result)
                        progress.update(label_task_id, completed=i + 1)

                    # Group clusters by label
                    label_to_clusters = {}
                    for label in cluster_labels:
                        if label["label"] not in label_to_clusters:
                            label_to_clusters[label["label"]] = []
                        label_to_clusters[label["label"]].append(label["cluster"])

                    # Step 3: Rename cluster groups with progress and preview
                    rename_task_id = progress.add_task(
                        "[cyan]Renaming cluster groups...",
                        total=len(label_to_clusters),
                    )
                    new_clusters = []
                    for i, cluster_group in enumerate(label_to_clusters.values()):
                        result = await self.rename_cluster_group(cluster_group)
                        new_clusters.append(result)
                        progress.update(rename_task_id, completed=i + 1)

                        # Update preview with new meta clusters
                        for cluster in result:
                            if (
                                hasattr(cluster, "name")
                                and hasattr(cluster, "description")
                                and cluster.parent_id is None
                            ):
                                preview_buffer.append(cluster)
                                if len(preview_buffer) > max_preview_items:
                                    preview_buffer.pop(0)

                        # Update preview display
                        if preview_buffer:
                            preview_text = Text()
                            for j, cluster in enumerate(preview_buffer):
                                preview_text.append(
                                    "Meta Cluster: ", style="bold magenta"
                                )
                                preview_text.append(
                                    f"{cluster.name[:80]}...\n", style="bold white"
                                )
                                preview_text.append(
                                    "Description: ", style="bold cyan"
                                )
                                preview_text.append(
                                    f"{cluster.description[:100]}...\n\n",
                                    style="dim white",
                                )

                            layout["preview"].update(
                                Panel(
                                    preview_text,
                                    title=f"[magenta]Recent Meta Clusters ({len(preview_buffer)}/{max_preview_items})",
                                    border_style="magenta",
                                )
                            )

                    # Flatten results
                    res = []
                    for new_cluster in new_clusters:
                        res.extend(new_cluster)

                    return res

            except LiveError:
                # Fall back to the original method without Live display
                return await self._generate_meta_clusters_fallback(clusters)

        except ImportError:
            # Rich not available, fall back
            return await self._generate_meta_clusters_fallback(clusters)
    else:
        # No console or preview disabled, use original method
        return await self._generate_meta_clusters_fallback(clusters)

`label_cluster(cluster: Cluster, candidate_clusters: list[str])` `async` ¶

Source code in kura/meta_cluster.py

    async def label_cluster(self, cluster: Cluster, candidate_clusters: list[str]):
        async with self.sem:
            resp = await self.client.chat.completions.create(
                messages=[
                    {
                        "role": "user",
                        "content": """
You are tasked with categorizing a specific cluster into one of the provided higher-level clusters for observability, monitoring, and content moderation. Your goal is to determine which higher-level cluster best fits the given specific cluster based on its name and description.

First, here are the ONLY valid higher-level clusters you may select from:
<higher_level_clusters>
{% for cluster in candidate_clusters %}
<higher_level_cluster>{{ cluster }}</higher_level_cluster>
{% endfor %}
</higher_level_clusters>

Here is the specific cluster to categorize:
<specific_cluster>
Name: {{ cluster.name }}
Description: {{ cluster.description }}
</specific_cluster>

RULES:
1. You MUST select EXACTLY ONE higher-level cluster from the provided list
2. You MUST output the higher-level cluster name EXACTLY as written - no modifications allowed
3. You MUST NOT create new cluster names or combinations
4. You MUST NOT output any additional text or explanations
5. You MUST NOT use partial matches or approximate names

CLASSIFICATION PROCESS:
1. First, record the exact list of valid higher-level clusters
2. Read the specific cluster's name and description carefully
3. Compare the specific cluster's key characteristics against each valid higher-level cluster
4. Select the single most appropriate higher-level cluster that encompasses the specific cluster
5. Verify your selected cluster exactly matches one from the valid list
6. Output ONLY the selected higher-level cluster name, exactly as it appeared in the valid list

Here is the specific cluster to categorize:

<specific_cluster>
Name: {{ cluster.name }}
Description: {{ cluster.description }}
</specific_cluster>

Based on this information, determine the most appropriate higher-level cluster and provide your answer as instructed.
                        """,
                    }
                ],
                response_model=ClusterLabel,
                context={
                    "cluster": cluster,
                    "candidate_clusters": candidate_clusters,
                },
                max_retries=3,
            )
            return {
                "cluster": cluster,
                "label": resp.higher_level_cluster,
            }

`reduce_clusters(clusters: list[Cluster]) -> list[Cluster]` `async` ¶

This takes in a list of existing clusters and generates a few higher order clusters that are more general. This represents a single iteration of the meta clustering process.

In the event that we have a single cluster, we will just return a new higher level cluster which has the same name as the original cluster. ( This is an edge case which we should definitely handle better )

Source code in kura/meta_cluster.py

async def reduce_clusters(self, clusters: list[Cluster]) -> list[Cluster]:
    """
    This takes in a list of existing clusters and generates a few higher order clusters that are more general. This represents a single iteration of the meta clustering process.

    In the event that we have a single cluster, we will just return a new higher level cluster which has the same name as the original cluster. ( This is an edge case which we should definitely handle better )
    """
    if not clusters:
        return []

    if len(clusters) == 1:
        logger.info("Only one cluster, returning it as a meta cluster")
        new_cluster = Cluster(
            name=clusters[0].name,
            description=clusters[0].description,
            slug=clusters[0].slug,
            chat_ids=clusters[0].chat_ids,
            parent_id=None,
        )
        return [new_cluster, clusters[0]]

    texts_to_embed = [str(cluster) for cluster in clusters]

    logger.info(
        f"Embedding {len(texts_to_embed)} clusters for meta-clustering using {type(self.embedding_model).__name__}..."
    )

    cluster_embeddings = await self.embedding_model.embed(texts_to_embed)

    if not cluster_embeddings or len(cluster_embeddings) != len(clusters):
        logger.error(
            "Error: Number of embeddings does not match number of clusters or embeddings are empty for meta-clustering."
        )
        return []

    clusters_and_embeddings = [
        {
            "item": cluster,
            "embedding": embedding,
        }
        for cluster, embedding in zip(clusters, cluster_embeddings)
    ]

    cluster_id_to_clusters: dict[int, list[Cluster]] = (
        self.clustering_model.cluster(clusters_and_embeddings)
    )  # type: ignore

    new_clusters = await self._gather_with_progress(
        [
            self.generate_meta_clusters(
                cluster_id_to_clusters[cluster_id], show_preview=True
            )
            for cluster_id in cluster_id_to_clusters
        ],
        desc="Generating Meta Clusters",
        show_preview=True,
    )

    res = []
    for new_cluster in new_clusters:
        res.extend(new_cluster)

    return res

`rename_cluster_group(clusters: list[Cluster]) -> list[Cluster]` `async` ¶

Source code in kura/meta_cluster.py

async def rename_cluster_group(self, clusters: list[Cluster]) -> list[Cluster]:
    async with self.sem:
        resp = await self.client.chat.completions.create(
            messages=[
                {
                    "role": "system",
                    "content": """
                    You are tasked with summarizing a group of related cluster names into a short, precise, and accurate overall description and name. Your goal is to create a concise summary that captures the essence of these clusters

                    The cluster name should be at most ten words long (perhaps less) and be specific but also reflective of most of the clusters that comprise them. The cluster name should be a sentence in the imperative that captures the user's request. For example, 'Brainstorm ideas for a birthday party' or 'Help me find a new job are good examples.

                    Do not shy away from describing socially harmful or sensitive topics. For instance, "Write fantasy sexual roleplay with octopi and monsters", "Generate blog spam for gambling websites", or "Assist with high school math homework" would be better than "Write erotic content" or "Help with homework". Be as descriptive as possible and assume neither good nor bad faith. Do not hesitate to identify and describe socially harmful or sensitive topics specifically; specificity is necessary for monitoring.

                    Ensure your summary and name accurately represent the clusters and are specific to the clusters.
                    """,
                },
                {
                    "role": "user",
                    "content": """
                    Here are the related cluster names
                    <clusters>
                        {% for cluster in clusters %}
                            <cluster>{{ cluster.name }}: {{ cluster.description }}</cluster>
                        {% endfor %}
                    </clusters>
                    """,
                },
            ],
            context={"clusters": clusters},
            response_model=GeneratedCluster,
        )

        res = []

        new_cluster = Cluster(
            name=resp.name,
            description=resp.summary,
            slug=resp.slug,
            chat_ids=[
                chat_id for cluster in clusters for chat_id in cluster.chat_ids
            ],
            parent_id=None,
        )

        res.append(new_cluster)

        for cluster in clusters:
            res.append(
                Cluster(
                    id=cluster.id,
                    name=cluster.name,
                    description=cluster.description,
                    slug=cluster.slug,
                    chat_ids=cluster.chat_ids,
                    parent_id=new_cluster.id,
                )
            )

        return res

`reduce_clusters_from_base_clusters(clusters: list[Cluster], *, model: BaseMetaClusterModel, checkpoint_manager: Optional[BaseCheckpointManager] = None) -> list[Cluster]` `async` ¶

Reduce clusters into a hierarchical structure.

Iteratively combines similar clusters until the number of root clusters is less than or equal to the model's max_clusters setting.

Parameters:

Name	Type	Description	Default
`clusters`	`list[Cluster]`	List of initial clusters to reduce	required
`model`	`BaseMetaClusterModel`	Meta-clustering model to use for reduction	required
`checkpoint_manager`	`Optional[BaseCheckpointManager]`	Optional checkpoint manager for caching	`None`

Returns:

Type	Description
`list[Cluster]`	List of clusters with hierarchical structure

Example

meta_model = MetaClusterModel(max_clusters=5) reduced = await reduce_clusters( ... clusters=base_clusters, ... model=meta_model, ... checkpoint_manager=checkpoint_mgr ... )

Source code in kura/meta_cluster.py

async def reduce_clusters_from_base_clusters(
    clusters: list[Cluster],
    *,
    model: BaseMetaClusterModel,
    checkpoint_manager: Optional[BaseCheckpointManager] = None,
) -> list[Cluster]:
    """Reduce clusters into a hierarchical structure.

    Iteratively combines similar clusters until the number of root clusters
    is less than or equal to the model's max_clusters setting.

    Args:
        clusters: List of initial clusters to reduce
        model: Meta-clustering model to use for reduction
        checkpoint_manager: Optional checkpoint manager for caching

    Returns:
        List of clusters with hierarchical structure

    Example:
        >>> meta_model = MetaClusterModel(max_clusters=5)
        >>> reduced = await reduce_clusters(
        ...     clusters=base_clusters,
        ...     model=meta_model,
        ...     checkpoint_manager=checkpoint_mgr
        ... )
    """
    logger.info(
        f"Starting cluster reduction from {len(clusters)} initial clusters using {type(model).__name__}"
    )

    # Try to load from checkpoint
    if checkpoint_manager:
        cached = checkpoint_manager.load_checkpoint(model.checkpoint_filename, Cluster)
        if cached:
            root_count = len([c for c in cached if c.parent_id is None])
            logger.info(
                f"Loaded {len(cached)} clusters from checkpoint ({root_count} root clusters)"
            )
            return cached

    # Start with all clusters as potential roots
    all_clusters = clusters.copy()
    root_clusters = clusters.copy()

    # Get max_clusters from model if available, otherwise use default
    max_clusters = getattr(model, "max_clusters", 10)
    logger.info(f"Starting with {len(root_clusters)} clusters, target: {max_clusters}")

    # Iteratively reduce until we have desired number of root clusters
    while len(root_clusters) > max_clusters:
        # Get updated clusters from meta-clustering
        new_current_level = await model.reduce_clusters(root_clusters)

        # Find new root clusters (those without parents)
        root_clusters = [c for c in new_current_level if c.parent_id is None]

        # Remove old clusters that now have parents
        old_cluster_ids = {c.id for c in new_current_level if c.parent_id}
        all_clusters = [c for c in all_clusters if c.id not in old_cluster_ids]

        # Add new clusters to the complete list
        all_clusters.extend(new_current_level)

        logger.info(f"Reduced to {len(root_clusters)} root clusters")

    logger.info(
        f"Cluster reduction complete: {len(all_clusters)} total clusters, {len(root_clusters)} root clusters"
    )

    # Save to checkpoint
    if checkpoint_manager:
        checkpoint_manager.save_checkpoint(model.checkpoint_filename, all_clusters)

    return all_clusters

Dimensionality Reduction¶

`kura.dimensionality` ¶

`logger = logging.getLogger(name)` `module-attribute` ¶

`HDBUMAP` ¶

Bases: BaseDimensionalityReduction

Source code in kura/dimensionality.py

class HDBUMAP(BaseDimensionalityReduction):
    @property
    def checkpoint_filename(self) -> str:
        """The filename to use for checkpointing this model's output."""
        return "dimensionality"

    def __init__(
        self,
        embedding_model: BaseEmbeddingModel = OpenAIEmbeddingModel(),
        n_components: int = 2,
        min_dist: float = 0.1,
        metric: str = "cosine",
        n_neighbors: Union[int, None] = None,
    ):
        self.embedding_model = embedding_model
        self.n_components = n_components
        self.min_dist = min_dist
        self.metric = metric
        self.n_neighbors = n_neighbors
        logger.info(
            f"Initialized HDBUMAP with embedding_model={type(embedding_model).__name__}, n_components={n_components}, min_dist={min_dist}, metric={metric}, n_neighbors={n_neighbors}"
        )

    async def reduce_dimensionality(
        self, clusters: list[Cluster]
    ) -> list[ProjectedCluster]:
        # Embed all clusters
        from umap import UMAP

        if not clusters:
            logger.warning("Empty clusters list provided to reduce_dimensionality")
            return []

        logger.info(f"Starting dimensionality reduction for {len(clusters)} clusters")
        texts_to_embed = [str(c) for c in clusters]

        try:
            cluster_embeddings = await self.embedding_model.embed(texts_to_embed)
            logger.debug(f"Generated embeddings for {len(clusters)} clusters")
        except Exception as e:
            logger.error(f"Failed to generate embeddings for clusters: {e}")
            raise

        if not cluster_embeddings or len(cluster_embeddings) != len(texts_to_embed):
            logger.error(
                f"Error: Number of embeddings ({len(cluster_embeddings) if cluster_embeddings else 0}) does not match number of clusters ({len(texts_to_embed)}) or embeddings are empty."
            )
            return []

        embeddings = np.array(cluster_embeddings)
        logger.debug(f"Created embedding matrix of shape {embeddings.shape}")

        # Project to 2D using UMAP
        n_neighbors_actual = (
            self.n_neighbors if self.n_neighbors else min(15, len(embeddings) - 1)
        )
        logger.debug(
            f"Using UMAP with n_neighbors={n_neighbors_actual}, min_dist={self.min_dist}, metric={self.metric}"
        )

        try:
            umap_reducer = UMAP(
                n_components=self.n_components,
                n_neighbors=n_neighbors_actual,
                min_dist=self.min_dist,
                metric=self.metric,
            )
            reduced_embeddings = umap_reducer.fit_transform(embeddings)
            logger.info(
                f"UMAP dimensionality reduction completed: {embeddings.shape} -> {reduced_embeddings.shape}"  # type: ignore
            )
        except Exception as e:
            logger.error(f"UMAP dimensionality reduction failed: {e}")
            raise

        # Create projected clusters with 2D coordinates
        res = []
        for i, cluster in enumerate(clusters):
            projected = ProjectedCluster(
                slug=cluster.slug,
                id=cluster.id,
                name=cluster.name,
                description=cluster.description,
                chat_ids=cluster.chat_ids,
                parent_id=cluster.parent_id,
                x_coord=float(reduced_embeddings[i][0]),  # pyright: ignore
                y_coord=float(reduced_embeddings[i][1]),  # pyright: ignore
                level=0,
            )
            res.append(projected)

        res = calculate_cluster_levels(res)

        logger.info(f"Successfully created {len(res)} projected clusters")
        return res

`checkpoint_filename: str` `property` ¶

The filename to use for checkpointing this model's output.

`embedding_model = embedding_model` `instance-attribute` ¶

`metric = metric` `instance-attribute` ¶

`min_dist = min_dist` `instance-attribute` ¶

`n_components = n_components` `instance-attribute` ¶

`n_neighbors = n_neighbors` `instance-attribute` ¶

`init(embedding_model: BaseEmbeddingModel = OpenAIEmbeddingModel(), n_components: int = 2, min_dist: float = 0.1, metric: str = 'cosine', n_neighbors: Union[int, None] = None)` ¶

Source code in kura/dimensionality.py

def __init__(
    self,
    embedding_model: BaseEmbeddingModel = OpenAIEmbeddingModel(),
    n_components: int = 2,
    min_dist: float = 0.1,
    metric: str = "cosine",
    n_neighbors: Union[int, None] = None,
):
    self.embedding_model = embedding_model
    self.n_components = n_components
    self.min_dist = min_dist
    self.metric = metric
    self.n_neighbors = n_neighbors
    logger.info(
        f"Initialized HDBUMAP with embedding_model={type(embedding_model).__name__}, n_components={n_components}, min_dist={min_dist}, metric={metric}, n_neighbors={n_neighbors}"
    )

`reduce_dimensionality(clusters: list[Cluster]) -> list[ProjectedCluster]` `async` ¶

Source code in kura/dimensionality.py

async def reduce_dimensionality(
    self, clusters: list[Cluster]
) -> list[ProjectedCluster]:
    # Embed all clusters
    from umap import UMAP

    if not clusters:
        logger.warning("Empty clusters list provided to reduce_dimensionality")
        return []

    logger.info(f"Starting dimensionality reduction for {len(clusters)} clusters")
    texts_to_embed = [str(c) for c in clusters]

    try:
        cluster_embeddings = await self.embedding_model.embed(texts_to_embed)
        logger.debug(f"Generated embeddings for {len(clusters)} clusters")
    except Exception as e:
        logger.error(f"Failed to generate embeddings for clusters: {e}")
        raise

    if not cluster_embeddings or len(cluster_embeddings) != len(texts_to_embed):
        logger.error(
            f"Error: Number of embeddings ({len(cluster_embeddings) if cluster_embeddings else 0}) does not match number of clusters ({len(texts_to_embed)}) or embeddings are empty."
        )
        return []

    embeddings = np.array(cluster_embeddings)
    logger.debug(f"Created embedding matrix of shape {embeddings.shape}")

    # Project to 2D using UMAP
    n_neighbors_actual = (
        self.n_neighbors if self.n_neighbors else min(15, len(embeddings) - 1)
    )
    logger.debug(
        f"Using UMAP with n_neighbors={n_neighbors_actual}, min_dist={self.min_dist}, metric={self.metric}"
    )

    try:
        umap_reducer = UMAP(
            n_components=self.n_components,
            n_neighbors=n_neighbors_actual,
            min_dist=self.min_dist,
            metric=self.metric,
        )
        reduced_embeddings = umap_reducer.fit_transform(embeddings)
        logger.info(
            f"UMAP dimensionality reduction completed: {embeddings.shape} -> {reduced_embeddings.shape}"  # type: ignore
        )
    except Exception as e:
        logger.error(f"UMAP dimensionality reduction failed: {e}")
        raise

    # Create projected clusters with 2D coordinates
    res = []
    for i, cluster in enumerate(clusters):
        projected = ProjectedCluster(
            slug=cluster.slug,
            id=cluster.id,
            name=cluster.name,
            description=cluster.description,
            chat_ids=cluster.chat_ids,
            parent_id=cluster.parent_id,
            x_coord=float(reduced_embeddings[i][0]),  # pyright: ignore
            y_coord=float(reduced_embeddings[i][1]),  # pyright: ignore
            level=0,
        )
        res.append(projected)

    res = calculate_cluster_levels(res)

    logger.info(f"Successfully created {len(res)} projected clusters")
    return res

`reduce_dimensionality_from_clusters(clusters: list[Cluster], *, model: BaseDimensionalityReduction, checkpoint_manager: Optional[BaseCheckpointManager] = None) -> list[ProjectedCluster]` `async` ¶

Reduce dimensions of clusters for visualization.

Projects clusters to 2D space using the provided dimensionality reduction model. Supports different algorithms (UMAP, t-SNE, PCA, etc.) through the model interface.

Parameters:

Name	Type	Description	Default
`clusters`	`list[Cluster]`	List of clusters to project	required
`model`	`BaseDimensionalityReduction`	Dimensionality reduction model to use (UMAP, t-SNE, etc.)	required
`checkpoint_manager`	`Optional[BaseCheckpointManager]`	Optional checkpoint manager for caching	`None`

Returns:

Type	Description
`list[ProjectedCluster]`	List of projected clusters with 2D coordinates

Example

dim_model = HDBUMAP(n_components=2) projected = await reduce_dimensionality( ... clusters=hierarchical_clusters, ... model=dim_model, ... checkpoint_manager=checkpoint_mgr ... )

Source code in kura/dimensionality.py

async def reduce_dimensionality_from_clusters(
    clusters: list[Cluster],
    *,
    model: BaseDimensionalityReduction,
    checkpoint_manager: Optional[BaseCheckpointManager] = None,
) -> list[ProjectedCluster]:
    """Reduce dimensions of clusters for visualization.

    Projects clusters to 2D space using the provided dimensionality reduction model.
    Supports different algorithms (UMAP, t-SNE, PCA, etc.) through the model interface.

    Args:
        clusters: List of clusters to project
        model: Dimensionality reduction model to use (UMAP, t-SNE, etc.)
        checkpoint_manager: Optional checkpoint manager for caching

    Returns:
        List of projected clusters with 2D coordinates

    Example:
        >>> dim_model = HDBUMAP(n_components=2)
        >>> projected = await reduce_dimensionality(
        ...     clusters=hierarchical_clusters,
        ...     model=dim_model,
        ...     checkpoint_manager=checkpoint_mgr
        ... )
    """
    logger.info(
        f"Starting dimensionality reduction for {len(clusters)} clusters using {type(model).__name__}"
    )

    # Try to load from checkpoint
    if checkpoint_manager:
        cached = checkpoint_manager.load_checkpoint(
            model.checkpoint_filename, ProjectedCluster
        )
        if cached:
            logger.info(f"Loaded {len(cached)} projected clusters from checkpoint")
            return cached

    # Reduce dimensionality
    logger.info("Projecting clusters to 2D space...")
    projected_clusters = await model.reduce_dimensionality(clusters)
    logger.info(f"Projected {len(projected_clusters)} clusters to 2D")

    # Save to checkpoint
    if checkpoint_manager:
        checkpoint_manager.save_checkpoint(
            model.checkpoint_filename, projected_clusters
        )

    return projected_clusters

API Reference¶

How to Use This Reference¶

Core Classes¶

Procedural API¶

Pipeline Functions¶

Custom fields available in metadata¶

kura.meta_cluster.reduce_clusters_from_base_clusters(clusters: list[Cluster], *, model: BaseMetaClusterModel, checkpoint_manager: Optional[BaseCheckpointManager] = None) -> list[Cluster] async ¶

kura.dimensionality.reduce_dimensionality_from_clusters(clusters: list[Cluster], *, model: BaseDimensionalityReduction, checkpoint_manager: Optional[BaseCheckpointManager] = None) -> list[ProjectedCluster] async ¶

Checkpoint Management¶

kura.checkpoint.CheckpointManager ¶

__init__(checkpoint_dir: str, *, enabled: bool = True) ¶

delete_checkpoint(filename: str) -> bool ¶

get_checkpoint_path(filename: str) -> Path ¶

list_checkpoints() -> List[str] ¶

load_checkpoint(filename: str, model_class: type[T], **kwargs) -> Optional[List[T]] ¶

save_checkpoint(filename: str, data: List[T], **kwargs) -> None ¶

setup_checkpoint_dir() -> None ¶

Implementation Classes¶

Embedding Models¶

kura.embedding ¶

logger = logging.getLogger(__name__) module-attribute ¶

CohereEmbeddingModel ¶

client = AsyncClient(api_key=api_key) instance-attribute ¶

input_type = input_type instance-attribute ¶

model_name = model_name instance-attribute ¶

__init__(model_name: str = 'embed-v4.0', model_batch_size: int = 96, n_concurrent_jobs: int = 5, input_type: str = 'clustering', api_key: str | None = None) ¶

embed(texts: list[str]) -> list[list[float]] async ¶

slug() ¶

OpenAIEmbeddingModel ¶

client = AsyncOpenAI() instance-attribute ¶

model_name = model_name instance-attribute ¶

__init__(model_name: str = 'text-embedding-3-small', model_batch_size: int = 50, n_concurrent_jobs: int = 5) ¶

embed(texts: list[str]) -> list[list[float]] async ¶

slug() ¶

SentenceTransformerEmbeddingModel ¶

model = SentenceTransformer(model_name, device=device) instance-attribute ¶

model_name = model_name instance-attribute ¶

__init__(model_name: str = 'all-MiniLM-L6-v2', model_batch_size: int = 128, device: str = 'cpu') ¶

embed(texts: list[str]) -> list[list[float]] async ¶

slug() -> str ¶

embed_summaries(summaries: list[ConversationSummary], embedding_model: BaseEmbeddingModel) -> list[dict[str, Union[ConversationSummary, list[float]]]] async ¶

Summarization¶

kura.summarisation ¶

T = TypeVar('T', bound=GeneratedSummary) module-attribute ¶

logger = logging.getLogger(__name__) module-attribute ¶

SummaryModel ¶

sentiment & complexity will be in summaries[0].metadata¶

cache = cache instance-attribute ¶

checkpoint_filename: str property ¶

console = console instance-attribute ¶

max_concurrent_requests = max_concurrent_requests instance-attribute ¶

model = model instance-attribute ¶

__init__(model: Union[str, KnownModelName] = 'openai/gpt-4o-mini', max_concurrent_requests: int = 50, checkpoint_filename: str = 'summaries', console: Optional[Console] = None, cache: Optional[CacheStrategy] = None) ¶

summarise(conversations: list[Conversation], prompt: str = DEFAULT_SUMMARY_PROMPT, *, response_schema: Type[T] = GeneratedSummary, temperature: float = 0.2, **kwargs) -> list[ConversationSummary] async ¶

Access core fields¶

Access custom fields in metadata¶

Custom fields available in metadata¶

Clustering¶

kura.cluster ¶

logger = logging.getLogger(__name__) module-attribute ¶

ClusterDescriptionModel ¶

checkpoint_filename: str property ¶

console = console instance-attribute ¶

max_concurrent_requests = max_concurrent_requests instance-attribute ¶

model = model instance-attribute ¶

temperature = temperature instance-attribute ¶

__init__(model: Union[str, KnownModelName] = 'openai/gpt-4o-mini', max_concurrent_requests: int = 50, temperature: float = 0.2, checkpoint_filename: str = 'clusters', console: Optional[Console] = None) ¶

generate_cluster_description(summaries: List[ConversationSummary], contrastive_examples: List[ConversationSummary], semaphore: Semaphore, client: AsyncInstructor, prompt: str = DEFAULT_CLUSTER_PROMPT) -> Cluster async ¶

generate_clusters(cluster_id_to_summaries: Dict[int, List[ConversationSummary]], prompt: str = DEFAULT_CLUSTER_PROMPT, max_contrastive_examples: int = 10) -> List[Cluster] async ¶

KmeansClusteringModel ¶

clusters_per_group = clusters_per_group instance-attribute ¶

__init__(clusters_per_group: int = 10) ¶

cluster(items: list[dict[str, Union[ConversationSummary, list[float]]]]) -> dict[int, list[ConversationSummary]] ¶

get_contrastive_examples(cluster_id: int, cluster_id_to_summaries: Dict[int, List[ConversationSummary]], max_contrastive_examples: int = 10) -> List[ConversationSummary] ¶

Meta-Clustering¶

kura.meta_cluster ¶

logger = logging.getLogger(__name__) module-attribute ¶

CandidateClusters ¶

candidate_cluster_names: list[str] instance-attribute ¶

validate_candidate_cluster_names(v: list[str]) -> list[str] ¶

`kura.meta_cluster.reduce_clusters_from_base_clusters(clusters: list[Cluster], *, model: BaseMetaClusterModel, checkpoint_manager: Optional[BaseCheckpointManager] = None) -> list[Cluster]` `async` ¶

`kura.dimensionality.reduce_dimensionality_from_clusters(clusters: list[Cluster], *, model: BaseDimensionalityReduction, checkpoint_manager: Optional[BaseCheckpointManager] = None) -> list[ProjectedCluster]` `async` ¶

`kura.checkpoint.CheckpointManager` ¶

`init(checkpoint_dir: str, *, enabled: bool = True)` ¶

`delete_checkpoint(filename: str) -> bool` ¶

`get_checkpoint_path(filename: str) -> Path` ¶

`list_checkpoints() -> List[str]` ¶

`load_checkpoint(filename: str, model_class: type[T], **kwargs) -> Optional[List[T]]` ¶

`save_checkpoint(filename: str, data: List[T], **kwargs) -> None` ¶

`setup_checkpoint_dir() -> None` ¶

`kura.embedding` ¶

`logger = logging.getLogger(name)` `module-attribute` ¶

`CohereEmbeddingModel` ¶

`client = AsyncClient(api_key=api_key)` `instance-attribute` ¶

`input_type = input_type` `instance-attribute` ¶

`model_name = model_name` `instance-attribute` ¶

`init(model_name: str = 'embed-v4.0', model_batch_size: int = 96, n_concurrent_jobs: int = 5, input_type: str = 'clustering', api_key: str | None = None)` ¶

`embed(texts: list[str]) -> list[list[float]]` `async` ¶

`slug()` ¶

`OpenAIEmbeddingModel` ¶

`client = AsyncOpenAI()` `instance-attribute` ¶

`model_name = model_name` `instance-attribute` ¶

`init(model_name: str = 'text-embedding-3-small', model_batch_size: int = 50, n_concurrent_jobs: int = 5)` ¶

`embed(texts: list[str]) -> list[list[float]]` `async` ¶

`slug()` ¶

`SentenceTransformerEmbeddingModel` ¶

`model = SentenceTransformer(model_name, device=device)` `instance-attribute` ¶

`model_name = model_name` `instance-attribute` ¶

`init(model_name: str = 'all-MiniLM-L6-v2', model_batch_size: int = 128, device: str = 'cpu')` ¶

`embed(texts: list[str]) -> list[list[float]]` `async` ¶

`slug() -> str` ¶

`embed_summaries(summaries: list[ConversationSummary], embedding_model: BaseEmbeddingModel) -> list[dict[str, Union[ConversationSummary, list[float]]]]` `async` ¶

`kura.summarisation` ¶

`T = TypeVar('T', bound=GeneratedSummary)` `module-attribute` ¶

`logger = logging.getLogger(name)` `module-attribute` ¶

`SummaryModel` ¶

`cache = cache` `instance-attribute` ¶

`checkpoint_filename: str` `property` ¶

`console = console` `instance-attribute` ¶

`max_concurrent_requests = max_concurrent_requests` `instance-attribute` ¶

`model = model` `instance-attribute` ¶

`init(model: Union[str, KnownModelName] = 'openai/gpt-4o-mini', max_concurrent_requests: int = 50, checkpoint_filename: str = 'summaries', console: Optional[Console] = None, cache: Optional[CacheStrategy] = None)` ¶

`summarise(conversations: list[Conversation], prompt: str = DEFAULT_SUMMARY_PROMPT, *, response_schema: Type[T] = GeneratedSummary, temperature: float = 0.2, **kwargs) -> list[ConversationSummary]` `async` ¶

`kura.cluster` ¶

`logger = logging.getLogger(name)` `module-attribute` ¶

`ClusterDescriptionModel` ¶

`checkpoint_filename: str` `property` ¶

`console = console` `instance-attribute` ¶

`max_concurrent_requests = max_concurrent_requests` `instance-attribute` ¶

`model = model` `instance-attribute` ¶

`temperature = temperature` `instance-attribute` ¶

`init(model: Union[str, KnownModelName] = 'openai/gpt-4o-mini', max_concurrent_requests: int = 50, temperature: float = 0.2, checkpoint_filename: str = 'clusters', console: Optional[Console] = None)` ¶

`generate_cluster_description(summaries: List[ConversationSummary], contrastive_examples: List[ConversationSummary], semaphore: Semaphore, client: AsyncInstructor, prompt: str = DEFAULT_CLUSTER_PROMPT) -> Cluster` `async` ¶

`generate_clusters(cluster_id_to_summaries: Dict[int, List[ConversationSummary]], prompt: str = DEFAULT_CLUSTER_PROMPT, max_contrastive_examples: int = 10) -> List[Cluster]` `async` ¶

`KmeansClusteringModel` ¶

`clusters_per_group = clusters_per_group` `instance-attribute` ¶

`init(clusters_per_group: int = 10)` ¶

`cluster(items: list[dict[str, Union[ConversationSummary, list[float]]]]) -> dict[int, list[ConversationSummary]]` ¶

`get_contrastive_examples(cluster_id: int, cluster_id_to_summaries: Dict[int, List[ConversationSummary]], max_contrastive_examples: int = 10) -> List[ConversationSummary]` ¶

`kura.meta_cluster` ¶

`logger = logging.getLogger(name)` `module-attribute` ¶

`CandidateClusters` ¶

`candidate_cluster_names: list[str]` `instance-attribute` ¶

`validate_candidate_cluster_names(v: list[str]) -> list[str]` ¶

`ClusterLabel` ¶

`higher_level_cluster: str` `instance-attribute` ¶

`validate_higher_level_cluster(v: str, info: ValidationInfo) -> str` ¶

`MetaClusterModel` ¶

`checkpoint_filename: str` `property` ¶

`client = instructor.from_provider(model, async_client=True)` `instance-attribute` ¶

`clustering_model = clustering_model` `instance-attribute` ¶

`console = console` `instance-attribute` ¶

`embedding_model = embedding_model` `instance-attribute` ¶

`max_clusters = max_clusters` `instance-attribute` ¶

`max_concurrent_requests = max_concurrent_requests` `instance-attribute` ¶

`model = model` `instance-attribute` ¶

`sem = Semaphore(max_concurrent_requests)` `instance-attribute` ¶

`init(max_concurrent_requests: int = 50, model: str = 'openai/gpt-4o-mini', embedding_model: Optional[BaseEmbeddingModel] = None, clustering_model: Union[BaseClusteringMethod, None] = None, max_clusters: int = 10, console: Optional['Console'] = None, **kwargs)` ¶

`generate_candidate_clusters(clusters: list[Cluster], sem: Semaphore) -> list[str]` `async` ¶

`generate_meta_clusters(clusters: list[Cluster], show_preview: bool = True) -> list[Cluster]` `async` ¶