[Bug]: PromptCachingCache extract_cacheable_prefix broken when message.content is a string?

### What happened?

`PromptCachingCache`'s `extract_cacheable_prefix` function may return an empty prefix when `message.content` is a string:

https://github.com/BerriAI/litellm/blob/b86aae02121b33e54165ad8209c634a84d811b60/litellm/router_utils/prompt_caching_cache.py#L77-L80

This appears to break API requests with bodies that look like the following, which in my testing with LiteLLM connected to AWS Bedrock allows for cache creation/reads:
```json
{
    "model": model_id,
    "stream": False,
    "max_tokens": 1024,
    "messages": [
        {"role": "system", "content": "You are an LLM named Prompt Cache Helper"},
        {
            "role": "user", 
            "content": large_message,
            "cache_control": {
                "type": "ephemeral",
                "ttl": "5m"
            }
        },
    ],
}
```

But this approach works:
```json
{
    "model": model_id,
    "stream": False,
    "max_tokens": 1024,
    "messages": [
        {
            "role": "system",
            "content": "You are an Prompt Cache Helper"
        },
        {
            "role": "user", 
            "content": [
                {
                    "type": "text",
                    "text": large_message,
                    "cache_control": {
                        "type": "ephemeral",
                        "ttl": "5m" # or 5m
                    }
                },
            ]
        },
    ],
}
```

Note that these API specs for Claude and OpenAI don't say that `cache_control` can be a sibling key of `content`, so not sure if this is actually a bug, or if LiteLLM or Bedrock is just more flexible and allows for `cache_control` to be a sibling key...  
- https://platform.claude.com/docs/en/api/messages/create#:~:text=a%20single%20request.-,content,-%3A%20string%20or
- https://platform.openai.com/docs/api-reference/messages/createMessage#messages_createmessage-content

### Steps to Reproduce

1. Config to use the prompt caching precheck:
```yaml
router_settings:
  enable_pre_call_checks: true
  optional_pre_call_checks: ["prompt_caching"]
```
2. Send an API request for caching with the `cache_control` as a sibling of a `content` that is a string:
 ```json
{
    "model": model_id,
    "stream": False,
    "max_tokens": 1024,
    "messages": [
        {"role": "system", "content": "You are an LLM named Prompt Cache Helper"},
        {
            "role": "user", 
            "content": large_message,
            "cache_control": {
                "type": "ephemeral",
                "ttl": "5m"
            }
        },
    ],
}
```
3. Send a few of those API requests to see cache write and reads. For example, in the response you might see something like:
```json
  "usage": {
    "completion_tokens": 1024,
    "prompt_tokens": 5425,
    "total_tokens": 6449,
    "prompt_tokens_details": {
      "cached_tokens": 5425,
      "cache_creation_tokens": 0
    },
    "cache_creation_input_tokens": 0,
    "cache_read_input_tokens": 5425
  }
```
4. If you debug into the code, you'll notice that the cache key prefix is None, which is incorrect because the usage results in step 3 above showed that the cache is being used:
https://github.com/BerriAI/litellm/blob/b86aae02121b33e54165ad8209c634a84d811b60/litellm/router_utils/prompt_caching_cache.py#L214-L217

### Relevant log output

```shell

```

### What part of LiteLLM is this about?

Proxy

### What LiteLLM version are you on ?

v1.80.11-stable

### Twitter / LinkedIn details

_No response_

	for msg_idx, message in enumerate(messages):
	content = message.get("content")
	if not isinstance(content, list):
	continue

	# Generate cache key using cacheable prefix
	cache_key = PromptCachingCache.get_prompt_caching_cache_key(messages, tools)
	if cache_key is None:
	return None

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bug]: PromptCachingCache extract_cacheable_prefix broken when message.content is a string? #19228

What happened?

Steps to Reproduce

Relevant log output

What part of LiteLLM is this about?

What LiteLLM version are you on ?

Twitter / LinkedIn details

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Uh oh!

[Bug]: PromptCachingCache extract_cacheable_prefix broken when message.content is a string? #19228

Description

What happened?

Steps to Reproduce

Relevant log output

What part of LiteLLM is this about?

What LiteLLM version are you on ?

Twitter / LinkedIn details

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions