SpringBoot集成免费的EdgeTTS实现文本转语音

更新时间：2025年10月20日 08:27:21 作者：程序猿DD

在需要文本转语音（TTS）的应用场景中（如语音助手、语音通知、内容播报等）,Java生态缺少类似Python生态的Edge TTS 客户端库,不过没关系,现在可以通过 UnifiedTTS 提供的 API 来调用免费的 EdgeTTS 能力,本文给大家介绍了SpringBoot集成免费的EdgeTTS实现文本转语音

引言

在需要文本转语音（TTS）的应用场景中（如语音助手、语音通知、内容播报等），Java生态缺少类似Python生态的Edge TTS 客户端库。不过没关系，现在可以通过 UnifiedTTS 提供的 API 来调用免费的 EdgeTTS 能力。同时，UnifiedTTS 还支持 Azure TTS、MiniMax TTS、Elevenlabs TTS 等多种模型，通过对请求接口的抽象封装，用户可以方便在不同模型与音色之间灵活切换。

下面我们以调用免费的EdgeTTS为目标，构建一个包含文本转语音功能的Spring Boot应用。

实战

1. 构建 Spring Boot 应用

通过 start.spring.io 或其他构建基础的Spring Boot工程，根据你构建应用的需要增加一些依赖，比如最后用接口提供服务的话，可以加入web模块：

<dependencies>
    <dependency>
        <groupId>org.springframework.boot</groupId>
        <artifactId>spring-boot-starter-web</artifactId>
    </dependency>
</dependencies>

2. 注册 UnifiedTTS，获取 API Key

前往 UnifiedTTS 官网注册账号（直接GitHub登录即可）
从左侧菜单进入“API密钥”页面，创建 API Key；

存好API Key，后续需要使用

3. 集成 UnifiedTTS API

下面根据API 文档：https://unifiedtts.com/zh/api-docs/tts-sync 实现一个可运行的参考实现，包括配置文件、请求模型、服务类与控制器。

3.1 配置文件（application.properties）

unified-tts.host=https://unifiedtts.com
unified-tts.api-key=your-api-key-here

这里unifiedtts.api-key参数记得替换成之前创建的ApiKey。

3.2 配置加载类

@Data
@ConfigurationProperties(prefix = "unified-tts")
public class UnifiedTtsProperties {

    private String host;
    private String apiKey;

}

3.3 请求封装和响应封装

@Data
@AllArgsConstructor
@NoArgsConstructor
public class UnifiedTtsRequest {
    
    private String model;
    private String voice;
    private String text;
    private Double speed;
    private Double pitch;
    private Double volume;
    private String format;

}

@Data
@AllArgsConstructor
@NoArgsConstructor
public class UnifiedTtsResponse {

    private boolean success;
    private String message;
    private long timestamp;
    private UnifiedTtsResponseData data;

    @Data
    @AllArgsConstructor
    @NoArgsConstructor
    public static class UnifiedTtsResponseData {
        @JsonProperty("request_id")
        private String requestId;

        @JsonProperty("audio_url")
        private String audioUrl;

        @JsonProperty("file_size")
        private long fileSize;
    }
}

UnifiedTTS 抽象了不同模型的请求，这样用户可以用同一套请求参数标准来实现对不同TTS模型的调用，这个非常方便。所以，为了简化TTS的客户端调用，非常推荐使用 UnifiedTTS。

3.3 服务实现（调用 UnifiedTTS）

使用 Spring Boot自带的RestClient HTTP客户端来实现UnifiedTTS的功能实现类，提供两个实现：

接收音频字节并返回。

@Service
public class UnifiedTtsService {

    private final RestClient restClient;
    private final UnifiedTtsProperties properties;

    public UnifiedTtsService(RestClient restClient, UnifiedTtsProperties properties) {
        this.restClient = restClient;
        this.properties = properties;
    }

    /**
     * 调用 UnifiedTTS 同步 TTS 接口，返回音频字节数据。
     *
     * <p>请求头：
     * <ul>
     *   <li>Content-Type: application/json</li>
     *   <li>X-API-Key: 来自配置的 API Key</li>
     *   <li>Accept: 接受二进制流或常见 mp3/mpeg 音频类型</li>
     * </ul>
     *
     * @param request 模型、音色、文本、速度/音调/音量、输出格式等参数
     * @return 音频二进制字节（例如 mp3）
     * @throws IllegalStateException 当服务端返回非 2xx 或无内容时抛出
     */
    public byte[] synthesize(UnifiedTtsRequest request) {
        ResponseEntity<byte[]> response = restClient
                .post()
                .uri("/api/v1/common/tts-sync")
                .contentType(MediaType.APPLICATION_JSON)
                .accept(MediaType.APPLICATION_OCTET_STREAM, MediaType.valueOf("audio/mpeg"), MediaType.valueOf("audio/mp3"))
                .header("X-API-Key", properties.getApiKey())
                .body(request)
                .retrieve()
                .toEntity(byte[].class);

        if (response.getStatusCode().is2xxSuccessful() && response.getBody() != null) {
            return response.getBody();
        }
        throw new IllegalStateException("UnifiedTTS synthesize failed: " + response.getStatusCode());
    }

    /**
     * 调用合成并将音频写入指定文件。
     *
     * <p>若输出路径的父目录不存在，会自动创建；失败时抛出运行时异常。
     *
     * @param request TTS 请求参数
     * @param outputPath 目标文件路径（例如 output.mp3）
     * @return 实际写入的文件路径
     */
    public Path synthesizeToFile(UnifiedTtsRequest request, Path outputPath) {
        byte[] data = synthesize(request);
        try {
            if (outputPath.getParent() != null) {
                Files.createDirectories(outputPath.getParent());
            }
            Files.write(outputPath, data);
            return outputPath;
        } catch (IOException e) {
            throw new RuntimeException("Failed to write TTS output to file: " + outputPath, e);
        }
    }
}

3.4 单元测试

@SpringBootTest
class UnifiedTtsServiceTest {

    @Autowired
    private UnifiedTtsService unifiedTtsService;

    @Test
    void testRealSynthesizeAndDownloadToFile() throws Exception {
        UnifiedTtsRequest req = new UnifiedTtsRequest(
            "edge-tts",
            "en-US-JennyNeural",
            "Hello, this is a test of text to speech synthesis.",
            1.0,
            1.0,
            1.0,
            "mp3"
        );

        // 调用真实接口，断言返回结构
        UnifiedTtsResponse resp = unifiedTtsService.synthesize(req);
        assertNotNull(resp);
        assertTrue(resp.isSuccess(), "Response should be success");
        assertNotNull(resp.getData(), "Response data should not be null");
        assertNotNull(resp.getData().getAudioUrl(), "audio_url should be present");

        // 在当前工程目录下生成测试结果目录并写入文件
        Path projectDir = Paths.get(System.getProperty("user.dir"));
        Path resultDir = projectDir.resolve("test-result");
        Files.createDirectories(resultDir);
        Path out = resultDir.resolve(System.currentTimeMillis() + ".mp3");
        Path written = unifiedTtsService.synthesizeToFile(req, out);
        System.out.println("UnifiedTTS test output: " + written.toAbsolutePath());
        assertTrue(Files.exists(written), "Output file should exist");
        assertTrue(Files.size(written) > 0, "Output file size should be > 0");
    }
}

4. 运行与验证

执行单元测试之后，可以在工程目录test-result下找到生成的音频文件：

5. 常用参数与音色选择

目前支持的常用参数如下图所示：

小结

本文展示了如何在 Spring Boot 中集成 UnifiedTTS 的 EdgeTTS 能力，实现文本转语音并输出为 mp3。UnifiedTTS 通过统一的 API 屏蔽了不同 TTS 模型的差异，使你无需维护多个 SDK，即可在成本与效果之间自由切换。根据业务需求，你可以进一步完善异常处理、缓存与并发控制，实现更可靠的生产级 TTS 服务。

以上就是SpringBoot集成免费的EdgeTTS实现文本转语音的详细内容，更多关于SpringBoot EdgeTTS文本转语音的资料请关注脚本之家其它相关文章！

您可能感兴趣的文章:

Spring AOP快速入门及开发步骤
Spring AOP（面向切面编程）核心概念包括切面(Aspect)、连接点(JoinPoint)、切点(Pointcut)、通知(Advice)等,通过在不改变原代码的情况下,对方法进行增强,实现了代码的解耦和功能扩展,本文带来大家掌握Spring 中 AOP 的开发步骤,感兴趣的朋友一起看看吧
2024-10-10
java的MybatisPlus调用储存过程的返回数据问题
这篇文章主要介绍了java的MybatisPlus调用储存过程的返回数据问题,具有很好的参考价值,希望对大家有所帮助,如有错误或未考虑完全的地方,望不吝赐教
2023-12-12
Java中的对象和引用详解
这篇文章主要介绍了Java中的对象和引用详解的相关资料,需要的朋友可以参考下
2017-05-05
SpringBoot复杂参数应用详细讲解
我们在编写接口时会传入复杂参数，如Map、Model等，这种类似的参数会有相应的参数解析器进行解析，并且最后会将解析出的值放到request域中，下面我们一起来探析一下其中的原理
2022-09-09
Java8常用的新特性详解
这篇文章主要介绍了Java8常用的新特性详解,文中有非常详细的代码示例,对正在学习Java8新特性的小伙伴们有非常好的帮助,需要的朋友可以参考下
2021-04-04
HandlerMapping之RequestMappingHandlerMapping作用详解
这篇文章主要介绍了HandlerMapping之RequestMappingHandlerMapping作用详解,HandlerMapping是用来寻找Handler的,并不与Handler的类型或者实现绑定,而是根据需要定义的,那么为什么要单独给@RequestMapping实现一个HandlerMapping,需要的朋友可以参考下
2023-10-10
Java中定时任务的6种实现方式
这篇文章主要给大家分享的是Java中定时任务的6种实现方式，几乎在所有的项目中，定时任务的使用都是不可或缺的，如果使用不当甚至会造成资损，下面文章我们就来看看Java中定时任务的具体使用方式吧
2021-10-10
Mybatis注解开发单表、多表操作的实现代码
这篇文章主要介绍了Mybatis高级:Mybatis注解开发单表操作,Mybatis注解开发多表操作,构建sql语句,综合案例学生管理系统使用接口注解方式优化,需要的朋友可以参考下
2021-02-02
SpringBoot集成Redis使用Cache缓存的实现方法
SpringBoot通过配置RedisConfig类和使用Cache注解可以轻松集成Redis实现缓存,主要包括@EnableCaching开启缓存,自定义key生成器,改变序列化规则,以及配置RedisCacheManager,本文为使用SpringBoot与Redis处理缓存提供了详实的指导和示例,感兴趣的朋友一起看看吧
2024-10-10
Spring Cloud Loadbalancer服务均衡负载器详解
这篇文章主要介绍了Spring Cloud Loadbalancer服务均衡负载器,具有很好的参考价值,希望对大家有所帮助,如有错误或未考虑完全的地方,望不吝赐教
2025-03-03