【 InternLM 大模型开源社区第三期（夏季闯关）】进阶岛第6关

随笔7个月前发布安慕希酸奶

74 0 0

MindSearch 快速部署

基础任务（完成此任务即完成闯关）

按照教程，将 MindSearch 部署到 HuggingFace 美化 Gradio 的界面，并提供截图和 Hugging Face 的Space的链接。

获取硅基流动 API Key

因为要使用硅基流动的 API Key，所以接下来便是注册并获取 API Key 了。

首先，我们打开 https://account.siliconflow.cn/login 来注册硅基流动的账号（如果注册过，则直接登录即可）。

在完成注册后，打开 https://cloud.siliconflow.cn/account/ak 来准备 API Key。首先创建新 API 密钥，然后点击密钥进行复制，以备后续使用。

部署到 HuggingFace Space

最后，我们来将 MindSearch 部署到 HuggingFace Space。

我们首先打开 https://huggingface.co/spaces ，并点击 Create new Space，如下图所示。
【 InternLM 大模型开源社区第三期（夏季闯关）】进阶岛第6关

在输入 Space name 并选择 License 后，选择配置如下所示。

【 InternLM 大模型开源社区第三期（夏季闯关）】进阶岛第6关

然后，我们进入 Settings，配置硅基流动的 API Key。如下图所示。

【 InternLM 大模型开源社区第三期（夏季闯关）】进阶岛第6关

选择 New secrets，name 一栏输入 SILICON_API_KEY，value 一栏输入你的 API Key 的内容。

最后，我们先新建一个目录，准备提交到 HuggingFace Space 的全部文件。

# 创建新目录
mkdir -p /workspaces/mindsearch/mindsearch_deploy
# 准备复制文件
cd /workspaces/mindsearch
cp -r /workspaces/mindsearch/MindSearch/mindsearch /workspaces/mindsearch/mindsearch_deploy
cp /workspaces/mindsearch/MindSearch/requirements.txt /workspaces/mindsearch/mindsearch_deploy
# 创建 app.py 作为程序入口
touch /workspaces/mindsearch/mindsearch_deploy/app.py
1
2
3
4
5
6
7
8

其中，app.py 的内容如下：

import json
import os

import gradio as gr
import requests
from lagent.schema import AgentStatusCode

os.system("python -m mindsearch.app --lang cn --model_format internlm_silicon &")

PLANNER_HISTORY = []
SEARCHER_HISTORY = []


def rst_mem(history_planner: list, history_searcher: list):
    '''
    Reset the chatbot memory.
    '''
    history_planner = []
    history_searcher = []
    if PLANNER_HISTORY:
        PLANNER_HISTORY.clear()
    return history_planner, history_searcher


def format_response(gr_history, agent_return):
    if agent_return['state'] in [
            AgentStatusCode.STREAM_ING, AgentStatusCode.ANSWER_ING
    ]:
        gr_history[-1][1] = agent_return['response']
    elif agent_return['state'] == AgentStatusCode.PLUGIN_START:
        thought = gr_history[-1][1].split('```')[0]
        if agent_return['response'].startswith('```'):
            gr_history[-1][1] = thought + '
' + agent_return['response']
    elif agent_return['state'] == AgentStatusCode.PLUGIN_END:
        thought = gr_history[-1][1].split('```')[0]
        if isinstance(agent_return['response'], dict):
            gr_history[-1][
                1] = thought + '
' + f'```json
{json.dumps(agent_return["response"], ensure_ascii=False, indent=4)}
```'  # noqa: E501
    elif agent_return['state'] == AgentStatusCode.PLUGIN_RETURN:
        assert agent_return['inner_steps'][-1]['role'] == 'environment'
        item = agent_return['inner_steps'][-1]
        gr_history.append([
            None,
            f"```json
{json.dumps(item['content'], ensure_ascii=False, indent=4)}
```"
        ])
        gr_history.append([None, ''])
    return


def predict(history_planner, history_searcher):

    def streaming(raw_response):
        for chunk in raw_response.iter_lines(chunk_size=8192,
                                             decode_unicode=False,
                                             delimiter=b'
'):
            if chunk:
                decoded = chunk.decode('utf-8')
                if decoded == '
':
                    continue
                if decoded[:6] == 'data: ':
                    decoded = decoded[6:]
                elif decoded.startswith(': ping - '):
                    continue
                response = json.loads(decoded)
                yield (response['response'], response['current_node'])

    global PLANNER_HISTORY
    PLANNER_HISTORY.append(dict(role='user', content=history_planner[-1][0]))
    new_search_turn = True

    url = 'http://localhost:8002/solve'
    headers = {'Content-Type': 'application/json'}
    data = {'inputs': PLANNER_HISTORY}
    raw_response = requests.post(url,
                                 headers=headers,
                                 data=json.dumps(data),
                                 timeout=20,
                                 stream=True)

    for resp in streaming(raw_response):
        agent_return, node_name = resp
        if node_name:
            if node_name in ['root', 'response']:
                continue
            agent_return = agent_return['nodes'][node_name]['detail']
            if new_search_turn:
                history_searcher.append([agent_return['content'], ''])
                new_search_turn = False
            format_response(history_searcher, agent_return)
            if agent_return['state'] == AgentStatusCode.END:
                new_search_turn = True
            yield history_planner, history_searcher
        else:
            new_search_turn = True
            format_response(history_planner, agent_return)
            if agent_return['state'] == AgentStatusCode.END:
                PLANNER_HISTORY = agent_return['inner_steps']
            yield history_planner, history_searcher
    return history_planner, history_searcher


with gr.Blocks() as demo:
    gr.HTML("""<h1 align="center">MindSearch Gradio Demo</h1>""")
    gr.HTML("""<p style="text-align: center; font-family: Arial, sans-serif;">MindSearch is an open-source AI Search Engine Framework with Perplexity.ai Pro performance. You can deploy your own Perplexity.ai-style search engine using either closed-source LLMs (GPT, Claude) or open-source LLMs (InternLM2.5-7b-chat).</p>""")
    gr.HTML("""
    <div style="text-align: center; font-size: 16px;">
        <a href="https://github.com/InternLM/MindSearch" style="margin-right: 15px; text-decoration: none; color: #4A90E2;">🔗 GitHub</a>
        <a href="https://arxiv.org/abs/2407.20183" style="margin-right: 15px; text-decoration: none; color: #4A90E2;">📄 Arxiv</a>
        <a href="https://huggingface.co/papers/2407.20183" style="margin-right: 15px; text-decoration: none; color: #4A90E2;">📚 Hugging Face Papers</a>
        <a href="https://huggingface.co/spaces/internlm/MindSearch" style="text-decoration: none; color: #4A90E2;">🤗 Hugging Face Demo</a>
    </div>
    """)
    with gr.Row():
        with gr.Column(scale=10):
            with gr.Row():
                with gr.Column():
                    planner = gr.Chatbot(label='planner',
                                         height=700,
                                         show_label=True,
                                         show_copy_button=True,
                                         bubble_full_width=False,
                                         render_markdown=True)
                with gr.Column():
                    searcher = gr.Chatbot(label='searcher',
                                          height=700,
                                          show_label=True,
                                          show_copy_button=True,
                                          bubble_full_width=False,
                                          render_markdown=True)
            with gr.Row():
                user_input = gr.Textbox(show_label=False,
                                        placeholder='帮我搜索一下 InternLM 开源体系',
                                        lines=5,
                                        container=False)
            with gr.Row():
                with gr.Column(scale=2):
                    submitBtn = gr.Button('Submit')
                with gr.Column(scale=1, min_width=20):
                    emptyBtn = gr.Button('Clear History')

    def user(query, history):
        return '', history + [[query, '']]

    submitBtn.click(user, [user_input, planner], [user_input, planner],
                    queue=False).then(predict, [planner, searcher],
                                      [planner, searcher])
    emptyBtn.click(rst_mem, [planner, searcher], [planner, searcher],
                   queue=False)

demo.queue()
demo.launch(server_name='0.0.0.0',
            server_port=7860,
            inbrowser=True,
            share=True)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154

在最后，将 /root/mindsearch/mindsearch_deploy 目录下的文件（使用 git）提交到 HuggingFace Space 即可完成部署了。注意将代码提交到huggingface space中需要配置hugginface的token。

上传huggingface

这里主要记录一下教程中没有介绍的huggingface配置git的过程
1.打开settings
【 InternLM 大模型开源社区第三期（夏季闯关）】进阶岛第6关
2.新建一个许可为write的token

3.根据网站内容配置git
https://huggingface.co/blog/zh/password-git-deprecation

# 全部修改为自己的参数
git remote set-url origin https://<user_name>:<token>@huggingface.co/<user_name>/<repo_name>
1
2

4.后续克隆上传项目
可参考这篇博客https://blog.csdn.net/wwwzhouhui/article/details/141500252

demo截图

demo项目网址：https://huggingface.co/spaces/ylzt/mindsearch
【 InternLM 大模型开源社区第三期（夏季闯关）】进阶岛第6关

参考资料

书生大模型实战营「第3期」

随笔

特别提醒: 内容为用户自行发布,如有侵权,请联系我们管理员删除,邮箱:mail@xieniao.com ,在收到您的邮件后我们会在3个工作日内处理。

后台管理非常合理有序，还有各类产品供用户选择

随笔

7个月前

0430

流量访客亲们每天能达到多少？？ – 淘宝天猫

随笔

1年前

01490

Linux服务器之间设置ssh免密登录

随笔

7个月前

0510

使用Origin绘制柱状图(入门)

随笔

7个月前

0750

暂无评论

您必须登录才能参与评论！

立即登录

暂无评论...

【 InternLM 大模型开源社区第三期（夏季闯关）】进阶岛第6关

MindSearch 快速部署

目录

基础任务（完成此任务即完成闯关）

获取硅基流动 API Key

部署到 HuggingFace Space

上传huggingface

demo截图

参考资料

10个Python保护代码和数据的方法

华为开源自研AI框架昇思MindSpore应用案例：ICT实现图像修复

相关文章

后台管理非常合理有序，还有各类产品供用户选择

流量访客亲们每天能达到多少？？ – 淘宝天猫

Linux服务器之间设置ssh免密登录

使用Origin绘制柱状图(入门)

暂无评论

热门网站

爱奇艺

Wicked Backgrounds

好看的韩国漫画_韩漫在线免费阅读-汗汗漫画网

NBA直播

YY影院

吃瓜网 – 吃瓜群众的吃瓜网站

热门文章

Vue+Express全栈开发项目实战技能：‌从0到1打造完整电商项目

[端游] 传奇单机版C#韩版五职业带英雄游戏服务端

盐酸奥洛他定滴眼液（齐必妥）

淘宝店铺可以对外出租吗？有什么手续？ – 淘宝天猫

有谁知道知识产权的图标具体是什么样的？ – 淘宝天猫

立思丁（夫西地酸乳膏）

复方氨酚苯海拉明片（白城亿正）

一个手机号能否注册两个小红书账号？深度解析与操作指南

【 InternLM 大模型开源社区 第三期（夏季闯关）】进阶岛第6关

MindSearch 快速部署

目录

基础任务（完成此任务即完成闯关）

获取硅基流动 API Key

部署到 HuggingFace Space

上传huggingface

demo截图

参考资料

10个Python保护代码和数据的方法

华为开源自研AI框架昇思MindSpore应用案例：ICT实现图像修复

相关文章

热门网站

爱奇艺

Wicked Backgrounds

好看的韩国漫画_韩漫在线免费阅读-汗汗漫画网

NBA直播

YY影院

吃瓜网 – 吃瓜群众的吃瓜网站

热门文章

标签云

【 InternLM 大模型开源社区第三期（夏季闯关）】进阶岛第6关