19 Jun 06:13

Updating Patch Version

This patch version Introduces explicit resource management to prevent unexpected resource leaks.
By default, the library closes underlying HTTP and gRPC connections when the client is garbage-collected. However, you can now manually close the Friendli or AsyncFriendli client using the .close() method or utilize a context manager to ensure proper closure when exiting a with block.

Usage examples

import asyncio
from friendli import AsyncFriendli

client = AsyncFriendli(base_url="0.0.0.0:8000", use_grpc=True)

async def run():
    async with client:
        stream = await client.completions.create(
            prompt="Explain what gRPC is. Also give me a Python code snippet of gRPC client.",
            stream=True,
            top_k=1,
        )

        async for chunk in stream:
            print(chunk.text, end="", flush=True)

asyncio.run(run())

Assets 2

18 Jun 06:45

kooyunmo

v1.4.0

9bc5ff6

Release v1.4.0 🚀

gRPC client support for completions API.

Assets 2

12 Jun 03:10

kooyunmo

v1.3.7

9343fa7

Release v1.3.7 🚀

Minor: add a default value for the "index" and "text" fields of the completion stream's chunk.

Assets 2

10 Jun 06:40

kooyunmo

v1.3.6

ec053ea

Release v1.3.6 🚀

Support Phi3 FP8 conversion.
Hotfix for safetensor checkpoint saver.

Assets 2

25 May 04:11

kooyunmo

v1.3.5

c7a1114

Release v1.3.5 🚀

Optimize CPU RAM usage during quantization with offloading
Support FP8 conversion for DBRX, Mixtral, and Command R+

Assets 2

02 Apr 03:22

kooyunmo

v1.3.4

485a264

Release v1.3.4 🚀

Hotfix for LoRA checkpoint saving error.

Assets 2

01 Apr 05:35

kooyunmo

v1.3.3

2dfe8ea

Release v1.3.3 🚀

New Features

FP8 Checkpoint Conversion: We've introduced a new feature for FP8 checkpoint conversion.
Sharded Safetensors Checkpoint Saving: Added the ability to save sharded safetensors checkpoints.
LoRA Support on Mistral Model: We have added support for LoRA (Low-Rank Adaptation) on the Mistral model.

Bug Fixes

BF16 Hotfix: Addressed an urgent issue with bf16 processing.
BFloat Safetensors Conversion: Fixed an issue related to bfloat conversion for safetensors.
Automatic Token Refresh: Resolved a bug affecting automatic token refresh.

Assets 2

26 Mar 14:45

kooyunmo

v1.3.2

deff9db

Release v1.3.2 🚀

Add base_model_name_or_path option to friendli model convert-adapter.
Remove stale dependencies.

Assets 2

25 Mar 10:54

kooyunmo

v1.3.1

68912bb

Release v1.3.1 🚀

Update protobuf schema.
Patch sending API requests with content type application/protobuf.

Assets 2

23 Mar 16:10

kooyunmo

v1.3.0

b1aa632

Release v1.3.0 🚀

Now resources of Friendli Dedicated Endpoints can be managed with CLI and SDK. The available resources are endpoint, model, team, and project.
Login with CLI is now available. SSO login is also available.
Update on Multi-LoRA checkpoint conversion.

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updating Patch Version

Usage examples

New Features

Bug Fixes

Releases: friendliai/friendli-client

Release v1.4.1 🚀

Updating Patch Version

Usage examples

Release v1.4.0 🚀

Release v1.3.7 🚀

Release v1.3.6 🚀

Release v1.3.5 🚀

Release v1.3.4 🚀

Release v1.3.3 🚀

New Features

Bug Fixes

Release v1.3.2 🚀

Release v1.3.1 🚀

Release v1.3.0 🚀