x86 - 通过 RETF 从 32 位切换到 64 位

Question

Pato

Asked: 2025-03-04 19:16:06 +0800 CST2025-03-04 19:16:06 +0800 CST 2025-03-04 19:16:06 +0800 CST

68000 汇编 – 通过堆栈传递字符串连接参数

772

我正在开发一个Motorola 68000汇编程序，该程序使用子例程连接两个字符串。挑战在于通过堆栈实现输入和输出的参数传递，因此我专注于正确设置和恢复堆栈。

我在Sep Roland和Erik Eidt的帮助下开发了程序逻辑。之后，我研究了如何使用堆栈传递参数，这就是为什么我的代码有大量注释的原因。

任务要求：

用68000 汇编语言实现一个使用堆栈传递参数的子程序。
该子程序接受两个输入字符串：
- A ="Hello"
- B ="World"
它将它们连接成输出字符串C，结果为：
- C ="HelloWorld"
主程序应该：
1. 通过推送参数来准备堆栈。
2. 调用子程序。
3. 函数返回后正确恢复堆栈。

我的实现：

          ORG $8000
   
;DATA
StringA DC.B 'Hello',0    ; First string with a null terminator
StringB DC.B 'World',0    ; Second string with a null terminator
StringC DS.B 256          ; Buffer for the concatenated string 

START: 

; The stack pointer (A7) starts at address $8000. 
; In the 68000 architecture, A7 always points to the memory address where 
; the next value will be saved (push operation).

      pea.l StringC ; Equivalent to [move.l #StringC, -(a7)]
                    ; The stack pointer (A7) is decremented by 4 (pushing a longword = 4 bytes)
                    ; Initial A7 = $8000, now A7 = $7FFC
      
      pea.l StringB  ; A7 = $7FF8
      pea.l StringA  ; A7 = $7FF4

; Therefore, the stack (from lowest to highest address) contains:
; A7 = $7FF4  |StringA address| 
; A7 = $7FF8  |StringB address| 
; A7 = $7FFC  |StringC address| 
; A7 = $8000 (original SP value before the push operations)

      bsr.s CopyStrings     ; Call the first subroutine, saving the PC (Program Counter)
                            ; onto the stack
                                
; When executing bsr.s, the processor:
; - Saves the return address (PC) on the stack (another 4 bytes subtracted from A7).
; - Then branches to CopyStrings.

; Upon returning from the subroutine (rts), the stack pointer A7 will remain 
; where the subroutine left it. However, we need to clean up the three parameters 
; (StringA, StringB, StringC) that we previously pushed.

      addq.l #8,a7  ; Restore 8 bytes of the stack
      addq.l #4,a7  ; Restore the remaining 4 bytes (total 12 bytes)

      SIMHALT 

CopyStrings:
      ; At the entry of the subroutine, the stack looks like this:
      ; A7    |Return Address | 
      ; A7+4  |StringA Address| 
      ; A7+8  |StringB Address| 
      ; A7+12 |StringC Address|
      
      move.l 4(a7),a0  ; Retrieve the address of StringA 
      move.l 8(a7),a1  ; Retrieve the address of StringB
      move.l 12(a7),a2 ; Retrieve the address of StringC 
      
CopyA: 
      move.b (a0)+,(a2)+  ; Load a character from StringA into StringC
                          
      bne.s CopyA         ; If the character is not null, continue copying
      subq.l #1,a2        ; Move back 1 byte to overwrite the null terminator

CopyB:
      move.b (a1)+,(a2)+  ; Load a character from StringB into StringC
      bne.s CopyB         ; If the character is not null, continue copying
      rts                 ; Return from subroutine
    
     END START

问题：

我通过堆栈传递参数的方法正确吗？
我应该考虑哪些优化或最佳实践？

任何反馈都将不胜感激！

2 个回答

Voted

Erik Eidt · Answer 1 · 2025-03-04T23:17:22+08:00

对于在堆栈上传递参数的 C 风格调用，您的方法是正确的。C 风格（尤其是较旧的 C）以反向传递参数，以便它们在堆栈上按正向顺序出现。这对于可变函数（例如printf）特别有用。此外，调用者会清理推送的参数，这对可变函数也特别有用。较旧的 C 编译器将所有函数视为潜在的可变函数，因为早期函数原型并不是真正需要的。这意味着您可以省略参数（例如可选参数），或传递额外的参数，并且由于调用者知道它推送了什么，因此它负责弹出。

另一方面，Pascal 不支持可变参数函数，因此会按正向顺序传递参数，并在返回时由被调用方从堆栈中删除参数。由于返回地址实际上妨碍了传递的参数，因此芯片设计人员制作了一个特殊的返回和释放指令，rtd该指令支持函数返回到堆栈顶部的地址，但也支持在通过弹出获得返回地址后弹出参数。（如果没有该指令，被调用方清理结尾必须将返回地址弹出到寄存器中，从堆栈中弹出参数，然后使用寄存器中的返回地址进行间接跳转）。

我认为较新的 C 能够在声明（即，在生成返回/结尾的代码时）和使用（即，在调用时、在调用站点）时清楚地区分可变参数函数和采用固定参数的函数，因此，虽然可能还会向后传递参数，但能够将其用于rtd非可变参数函数。

此外，68k 的现代调用约定可能会在寄存器中传递至少 6 个项，在 d0-d2 中传递 3 个项，在 a0-a2 中传递 3 个项，具体取决于类型（无论是指针还是整数）。溢出参数将进入堆栈（而可变函数可能会在堆栈上传递所有参数）。

您的函数没有输出/返回值。如果有，并且您希望通过堆栈传递，则调用者可以在传递参数之前将零或未初始化的字/长推入堆栈，以便被调用者仍可以使用rtd返回并释放除返回值之外的所有内容。

还有一个问题是，使用两个 16 位addq指令弹出是否比使用单个较长的（32 位）addi指令更好。我会选择较长的指令，以减少指令数量，虽然我不知道 68k 系列各个型号的确切时间，但我怀疑这可能是相同或更快的。

Sep Roland · Answer 2 · 2025-03-05T02:53:02+08:00

我通过堆栈传递参数的方法正确吗？

没问题，但您可以将这两条addq.l #.., a7指令合并为一条adda.l #12, a7。在 68000 上，这将在 14 个时钟内运行，比您编写的少 2 个时钟。

我应该考虑哪些优化或最佳实践？

现在数组的地址已在堆栈中，您有绝佳的机会在CopyStrings子例程中少破坏一个地址寄存器。这在较大的程序中非常有用。请注意，您不能使用简单的加载地址寄存器。指令集有一条专门用于此的指令。movemovea

CopyStrings:
      ; At the entry of the subroutine, the stack looks like this:
      ; A7    |Return Address | 
      ; A7+4  |StringA Address| 
      ; A7+8  |StringB Address| 
      ; A7+12 |StringC Address|
      
      movea.l 4(a7), a0  ; Retrieve the address of StringA
      movea.l 12(a7), a1 ; Retrieve the address of StringC
CopyA:
      move.b  (a0)+, (a1)+
      bne.s   CopyA
      subq.l  #1, a1     ; Move back 1 byte to overwrite the null terminator

      movea.l 8(a7), a0  ; Retrieve the address of StringB
CopyB:
      move.b  (a0)+, (a1)+
      bne.s   CopyB
      rts

68000 汇编 – 通过堆栈传递字符串连接参数

任务要求：

我的实现：

问题：

重新格式化数字，在固定位置插入分隔符

为什么 C++20 概念会导致循环约束错误，而老式的 SFINAE 不会？

VScode 自动卸载扩展的问题（Material 主题）

Vue 3：创建时出错“预期标识符但发现‘导入’”[重复]

具有指定基础类型但没有枚举器的“枚举类”的用途是什么？

如何修复未手动导入的模块的 MODULE_NOT_FOUND 错误？

`(表达式，左值) = 右值` 在 C 或 C++ 中是有效的赋值吗？为什么有些编译器会接受/拒绝它？

在 C++ 中，一个不执行任何操作的空程序需要 204KB 的堆，但在 C 中则不需要

PowerBI 目前与 BigQuery 不兼容：Simba 驱动程序与 Windows 更新有关

AdMob：MobileAds.initialize() - 对于某些设备，“java.lang.Integer 无法转换为 java.lang.String”

68000 汇编 – 通过堆栈传递字符串连接参数

任务要求：

我的实现：

问题：

2 个回答

相关问题