update

2025-02-09 17:17:39 +08:00 · 2025-02-09 17:17:39 +08:00 · 2421e36986
commit 2421e36986
parent 2c979215be
3 changed files with 171 additions and 232 deletions
--- a/README.md
+++ b/README.md
@ -1,234 +1,3 @@
 ## chat.guanjihuan.com

-本仓库主要记录这篇博文中的代码：https://www.guanjihuan.com/archives/38502 。
-
-这里把 https://chat.guanjihuan.com 的主要实现代码进行开源。代码参考各个开源大模型的 GitHub 或 HuggingFace 主页、大语言模型的 API 官网，以及 HuggingFace 和 Pytorch 的文档等。
-
-硬件要求：如果是本地 GPU 运行模型，还需要 Nvidia 显卡，至少 6G 显存。说明：这里只测试了几个模型，还有更多开源大模型，感兴趣的可以自行测试。通常，8G 显存的显卡可以量化地加载 7B 左右的模型（70亿参数）；16G 显存的显卡可以完整加载 7B 左右的模型（70亿参数）或量化地加载 14B 左右的模型（140亿参数）；更大参数空间的模型的运行需要更大显存的显卡。开源大模型的排行榜有：
-
-+ https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard
-+ https://cevalbenchmark.com/static/leaderboard.html
-+ https://opencompass.org.cn/leaderboard-llm
-+ https://www.superclueai.com
-
-### 一、基础环境
-
-运行这里的代码需要安装 Python 环境，可以选择安装 Anaconda：https://www.anaconda.com 。
-
-Web 框架是使用 Streamlit：https://streamlit.io 、https://github.com/streamlit/streamlit 。
-
-Streamlit 的安装：
-
-```
-pip install streamlit
-```
-
-运行命令：
-
-```
-streamlit run web_demo.py
-```
-
-或
-
-```
-python -m streamlit run web_demo.py
-```
-
-如果是在公网IP下访问，并指定8501端口和黑色主题，那么运行命令为：
-
-```
-streamlit run web_demo.py --theme.base dark --server.port 8501 --server.address 0.0.0.0 
-```
-
-如果是本地运行开源大语言模型，为了防止一些不必要的报错，可以更新一下操作系统的显卡驱动并重启：
-
-```
-sudo apt-get update
-
-sudo apt-get install ubuntu-drivers-common
-
-sudo ubuntu-drivers autoinstall
-```
-
-此外，更新一下 Pytorch（ [https://pytorch.org/get-started/locally/](https://pytorch.org/get-started/locally/) ）也可以防止一些报错：
-
-```
-conda install pytorch torchvision torchaudio pytorch-cuda=11.8 -c pytorch -c nvidia
-```
-
-### 二、本地运行开源大语言模型
-
-#### 1. 开源模型 ChatGLM
-
-ChatGLM3-6B 主页：https://github.com/THUDM/ChatGLM3 。 安装该模型依赖的环境：
-
-```
-pip install -r requirements.txt
-```
-
-模型文件下载：https://huggingface.co/THUDM/chatglm3-6b-32k ，放在目录 THUDM/chatglm3-6b-32k 下。
-
-显存/内存要求：量化加载大概要 6G 显存；默认加载大概需要 13G 显存；CPU加载大概需要 25G 内存。
-
-运行命令：
-
-```
-python -m streamlit run ./ChatGLM.py --theme.base dark --server.port 8501
-```
-
-如果量化加载时 bitsandbytes 报错，那么安装该软件包：pip install bitsandbytes
-
-#### 2. 开源模型 Qwen
-
-Qwen 主页：https://github.com/QwenLM/Qwen 。 安装该模型依赖的环境：
-
-```
-pip install -r requirements.txt
-```
-
-Qwen-7B-Chat-Int4 模型文件下载：https://huggingface.co/Qwen/Qwen-7B-Chat-Int4 ，放在目录 Qwen/Qwen-7B-Chat-Int4 下。
-
-Qwen-14B-Chat-Int4 模型文件下载：https://huggingface.co/Qwen/Qwen-14B-Chat-Int4 ，放在目录 Qwen/Qwen-14B-Chat-Int4 下。
-
-显存要求：Qwen-7B-Chat-Int4 大概需要 8G 显存；Qwen-14B-Chat-Int4 大概需要 12G 显存。
-
-运行命令：
-
-```
-python -m streamlit run ./Qwen.py --theme.base dark --server.port 8501
-```
-
-此外，如果运行有报错，可能还需要安装：
-
-```
-pip install optimum
-pip install auto-gptq
-pip install --upgrade s3fs aiobotocore botocore
-```
-
-#### 3. 开源模型 InternLM
-
-InternLM 主页：https://github.com/InternLM/InternLM 。运行代码时，需要调用其中的 tools 文件夹。
-
-internlm-chat-7b 模型文件下载：https://huggingface.co/internlm/internlm-chat-7b ，放在 internlm/internlm-chat-7b 目录下。说明：提供的代码是加载 internlm-chat-7b 模型， 目前已经有 internlm2-chat-7b 模型，但个人还未测试。internlm2-chat-7b 模型文件下载：https://huggingface.co/internlm/internlm2-chat-7b
-
-显存要求：大概需要 7G 的显存。
-
-运行命令：
-
-```
-python -m streamlit run ./InternLM.py --theme.base dark --server.port 8501
-```
-
-#### 4. 开源模型 使用 Ollama 调用 llama3.2
-
-Ollama 部分参考这篇：https://www.guanjihuan.com/archives/43861
-
-这里的代码给出了具体的 Streamlit 的实现。
-
-#### 5. ……
-
-### 三、使用大语言模型 API
-
-#### 1. 智谱 - ChatGLM_Turbo
-
-智谱 API key 获取（收费，可免费试用）：https://maas.aminer.cn
-
-运行命令：
-
-```
-python -m streamlit run ./ChatGLM_Turbo.py --theme.base dark --server.port 8501
-```
-
-说明：当前代码只对 pip install zhipuai==1.0.7 有效，对最新版本不兼容。另外，早期使用的模型调用是 model="chatglm_turbo"， 官网文档最新的模型是 model="glm-3-turbo"。工单客服回复内容为：“chatglm_turbo与glm-3-turbo是不同的模型，glm-3-turbo理论上能力优于chatglm_turbo，且价格更便宜”。个人推荐的是 glm-4-flash 模型。
-
-#### 2. 阿里 - Qwen_Turbo
-
-阿里 API key 获取（有的收费，有的免费）：https://dashscope.aliyun.com
-
-运行命令：
-
-```
-python -m streamlit run ./Qwen_Turbo.py --theme.base dark --server.port 8501
-```
-
-需要安装软件包：pip install dashscope
-
-#### 3. 腾讯 - 混元大模型
-
-腾讯 API key 获取（有的收费，有的免费）：https://cloud.tencent.com/product/hunyuan
-
-运行命令：
-```
-python -m streamlit run ./Hunyuan_Lite.py --theme.base dark --server.port 8501
-```
-
-#### 4. 讯飞 - 星火大模型
-
-讯飞 API key 获取（有的收费，有的免费）：https://xinghuo.xfyun.cn
-
-运行命令：
-
-```
-python -m streamlit run ./星火大模型.py --theme.base dark --server.port 8501
-```
-
-#### 5. 百度 - ERNIE_Speed_128K
-
-百度千帆大模型平台 API key 获取（有的收费，有的免费）：https://console.bce.baidu.com/qianfan/overview
-
-运行命令：
-
-```
-python -m streamlit run ./ERNIE_Speed_128K.py --theme.base dark --server.port 8501
-```
-
-#### 6. 零一万物 - Yi_Spark
-
-零一万物大模型开放平台（有免费额度）：https://platform.lingyiwanwu.com
-
-需要安装 OpenAI 软件包：
-
-```
-pip install openai
-```
-
-运行命令：
-
-```
-python -m streamlit run ./Yi_Spark.py --theme.base dark --server.port 8501
-```
-
-#### 7. 火山引擎 - Doubao_lite_32k
-
-豆包大模型 - 火山引擎（有免费额度）：https://www.volcengine.com/product/doubao
-
-需要安装：
-
-```
-pip install volcengine-python-sdk
-```
-
-运行命令：
-```
-python -m streamlit run ./Doubao_lite_32k.py --theme.base dark --server.port 8501
-```
-
-#### 8. OpenAI - GPT_3.5_Turbo
-
-OpenAI 的 API 接口（需要海外的 IP 地址以及海外银行卡）：https://platform.openai.com
-
-需要安装 OpenAI 软件包：
-
-```
-pip install openai
-```
-
-运行命令：
-
-```
-python -m streamlit run ./GPT_3.5_Turbo.py --theme.base dark --server.port 8501
-```
-
-#### 9. ……
+本仓库记录这篇博文中的代码：https://www.guanjihuan.com/archives/38502
--- a/DeepSeek_Chat/DeepSeek_Chat.py
+++ b/DeepSeek_Chat/DeepSeek_Chat.py
@ -0,0 +1,78 @@
+"""
+This code is supported by the website: https://www.guanjihuan.com
+The newest version of this code is on the web page: https://www.guanjihuan.com/archives/38502
+"""
+
+import streamlit as st
+st.set_page_config(
+    page_title="Chat",
+    layout='wide'
+)
+
+import openai
+API_BASE = "https://api.deepseek.com"
+API_KEY = "your key"
+
+
+with st.sidebar:
+    with st.expander('参数', expanded=True):
+        top_p = st.slider('top_p', 0.01, 1.0, step=0.01, value=0.8, key='top_p_session')
+        temperature = st.slider('temperature', 0.51, 1.0, step=0.01, value=0.85, key='temperature_session') 
+        def reset_parameter():
+            st.session_state['top_p_session'] = 0.8
+            st.session_state['temperature_session'] = 0.85
+        reset_parameter_button = st.button('重置', on_click=reset_parameter)
+
+prompt = st.chat_input("在这里输入您的命令")
+
+def clear_all():
+    st.session_state.messages = []
+    st.session_state.ai_response = []
+
+if 'messages' not in st.session_state:
+    st.session_state.messages = []
+if 'ai_response' not in st.session_state:
+    st.session_state.ai_response = []
+
+for ai_response in st.session_state.ai_response:
+    with st.chat_message(ai_response["role"], avatar=ai_response.get("avatar")):
+        st.markdown(ai_response["content"])
+
+prompt_placeholder = st.chat_message("user", avatar='user')
+with st.chat_message("robot", avatar="assistant"):
+    message_placeholder = st.empty()
+
+def response_of_deepseek_chat(prompt):
+    st.session_state.messages.append({'role': 'user', 'content': prompt})
+    client = openai.OpenAI(
+        api_key=API_KEY,
+        base_url=API_BASE
+    )
+    completion = client.chat.completions.create(
+        model="deepseek-chat",
+        messages=st.session_state.messages,
+        stream=True,
+        temperature=temperature, 
+        top_p=top_p,
+    )
+    full_content = ''
+    for chunk in completion:
+        response = chunk.choices[0].delta.content or ""
+        full_content += response
+        message_placeholder.markdown(full_content)
+        if stop_button:
+            break
+    st.session_state.messages.append({'role': 'assistant',
+                        'content': full_content})
+    st.session_state.ai_response.append({"role": "robot", "content": full_content, "avatar": "assistant"})
+    return full_content
+    
+if prompt:
+    prompt_placeholder.markdown(prompt)
+    st.session_state.ai_response.append({"role": "user", "content": prompt, "avatar": 'user'})
+    stop = st.empty()
+    stop_button = stop.button('停止', key='break_response')
+    response_of_deepseek_chat(prompt)
+    stop.empty()
+button_clear = st.button("清空", on_click=clear_all, key='clear')
+  
--- a/DeepSeek_Reasoner/DeepSeek_Reasoner.py
+++ b/DeepSeek_Reasoner/DeepSeek_Reasoner.py
@ -0,0 +1,92 @@
+"""
+This code is supported by the website: https://www.guanjihuan.com
+The newest version of this code is on the web page: https://www.guanjihuan.com/archives/38502
+"""
+
+import streamlit as st
+st.set_page_config(
+    page_title="Chat",
+    layout='wide'
+)
+
+import openai
+API_BASE = "https://api.deepseek.com"
+API_KEY = "your key"
+
+
+with st.sidebar:
+    with st.expander('参数', expanded=True):
+        top_p = st.slider('top_p', 0.01, 1.0, step=0.01, value=0.8, key='top_p_session')
+        temperature = st.slider('temperature', 0.51, 1.0, step=0.01, value=0.85, key='temperature_session') 
+        def reset_parameter():
+            st.session_state['top_p_session'] = 0.8
+            st.session_state['temperature_session'] = 0.85
+        reset_parameter_button = st.button('重置', on_click=reset_parameter)
+
+prompt = st.chat_input("在这里输入您的命令")
+
+def clear_all():
+    st.session_state.messages = []
+    st.session_state.ai_response = []
+
+if 'messages' not in st.session_state:
+    st.session_state.messages = []
+if 'ai_response' not in st.session_state:
+    st.session_state.ai_response = []
+
+for ai_response in st.session_state.ai_response:
+    with st.chat_message(ai_response["role"], avatar=ai_response.get("avatar")):
+        st.markdown(ai_response["content"])
+
+prompt_placeholder = st.chat_message("user", avatar='user')
+with st.chat_message("robot", avatar="assistant"):
+    message_placeholder = st.empty()
+
+def response_of_deepseek_chat(prompt):
+    st.session_state.messages.append({'role': 'user', 'content': prompt})
+    client = openai.OpenAI(
+        api_key=API_KEY,
+        base_url=API_BASE
+    )
+    completion = client.chat.completions.create(
+        model="deepseek-reasoner",
+        messages=st.session_state.messages,
+        stream=True,
+        temperature=temperature, 
+        top_p=top_p,
+    )
+    full_content = ''
+    all_full_content = ''
+    think_or_not = 1
+    answer_or_not = 1
+    for chunk in completion:
+        response = chunk.choices[0].delta.content or ""
+        reasoning_content = chunk.choices[0].delta.reasoning_content
+        if response == None:
+            if think_or_not == 1:
+                all_full_content += '[开始思考]\n\n'
+                think_or_not = 0
+            all_full_content += reasoning_content
+        else:
+            if answer_or_not == 1:
+                all_full_content += '\n\n[结束思考]\n\n'
+                answer_or_not = 0
+            all_full_content += response
+            full_content += response
+        message_placeholder.markdown(all_full_content)
+        if stop_button:
+            break
+    st.session_state.messages.append({'role': 'assistant',
+                        'content': full_content})
+    st.session_state.ai_response.append({"role": "robot", "content": all_full_content, "avatar": "assistant"})
+    return all_full_content
+    
+if prompt:
+    prompt_placeholder.markdown(prompt)
+    st.session_state.ai_response.append({"role": "user", "content": prompt, "avatar": 'user'})
+    stop = st.empty()
+    stop_button = stop.button('停止', key='break_response')
+    response_of_deepseek_chat(prompt)
+    stop.empty()
+button_clear = st.button("清空", on_click=clear_all, key='clear')
+