Added the IBM 5161 ISA expansion for PC and XT;
Cleaned up the parallel port emulation, added IRQ support, and made enabling/disabling per port;
Added the Award 430NX and the Intel Classic/PCI (Alfredo, 420TX);
Finished the 586MC1;
Added 8087 emulation;
Moved Cyrix 6x86'es to the Dev branch;
Sanitized/cleaned up memregs.c/h and intel.c/h;
Split the chipsets from machines and sanitized Port 92 emulation;
Added support for the 15bpp mode to the Compaq ATI 28800;
Moved the MR 386DX and 486 machines to the Dev branch;
Ported the new dynamic recompiler from PCem, but it remains in Dev branch until after v2.00;
Ported the new timer code from PCem;
Cleaned up the CPU table of unused stuff and better optimized its structure;
Ported the Open-XT and Open-AT from VARCem, the Open-AT is in the Dev branch;
Ported the XT MFM controller rewrite and adding of more controllers (incl. two RLL ones), from VARCem;
Added the AHA-1540A and the BusTek BT-542B;
Moved the Sumo SCSI-AT to the Dev branch;
Minor IDE, FDC, and floppy drive code clean-ups;
Made NCR 5380/53C400-based cards' BIOS address configurable;
Got rid of the legacy romset variable;
Unified (video) buffer and buffer32 into one and make the unified buffer 32-bit;
Added the Amstead PPC512 per PCem patch by John Elliott;
Switched memory mapping granularity from 16k to 4k (less than 1k not possible due to internal pages);
Rewrote the CL-GD 54xx blitter, fixes Win-OS/2 on the 54x6 among other thing;
Added the Image Manager 1024 and Professional Graphics Controller per PCem patch by John Elliott and work done on VARCem;
Added Headland HT-216, GC-205 and Video 7 VGA 1024i emulation based on PCem commit;
Implemented the fuction keys for the Toshiba T1000/T1200/T3100 enhancement;
Amstrad MegaPC does now works correctly with non-internal graphics card;
The SLiRP code no longer casts a packed struct type to a non-packed struct type;
The Xi8088 and PB410a no longer hang on 86Box when PS/2 mouse is not present;
The S3 Virge on BeOS is no longer broken (was broken by build #1591);
OS/2 2.0 build 6.167 now sees key presses again;
Xi8088 now work on CGA again;
86F images converted from either the old or new variants of the HxC MFM format now work correctly;
Hardware interrupts with a vector of 0xFF are now handled correctly;
OPTi 495SX boards no longer incorrectly have 64 MB maximum RAM when 32 MB is correct;
Fixed VNC keyboard input bugs;
Fixed AT RTC periodic interrupt - Chicago 58s / 73f / 73g / 81 MIDI play no longer hangs with the build's own VTD driver;
Fixed mouse polling with internal mice - Amstrad and Olivetti mice now work correctly;
Triones ATAPI DMA driver now correctly reads a file at the end of a CD image with a sectors number not divisible by 4;
Compaq Portable now works with all graphics cards;
Fixed various MDSI Genius bugs;
Added segment limit checks and improved page fault checks for several CPU instructions - Memphis 15xx WINSETUP and Chicago 58s WINDISK.CPL no longer issue a GPF, and some S3 drivers that used to have glitches, now work correctly;
Further improved the 808x emulation, also fixes the noticably choppy sound when using 808x CPU's, also fixes #355;
OS/2 installer no logner locks up on splash screen on PS/2 Model 70 and 80, fixes #400.
Fixed several Amstead bugs, GEM no longer crashes on the Amstrad 1640, fixes #391.
Ported John Elliott's Amstrad fixes and improvement from PCem, and fixed the default language so it's correctly Engliish, fixes #278, fixes #389.
Fixed a minor IDE timing bug, fixes #388.
Fixed Toshiba T1000 RAM issues, fixes #379.
Fixed EGA/(S)VGA overscan border handling, fixes #378;
Got rid of the now long useless IDE channel 2 auto-removal, fixes #370;
Fixed the BIOS files used by the AMSTRAD PC1512, fixes #366;
Ported the Unicode CD image file name fix from VARCem, fixes #365;
Fixed high density floppy disks on the Xi8088, fixes #359;
Fixed some bugs in the Hercules emulation, fixes #346, fixes #358;
Fixed the SCSI hard disk mode sense pages, fixes #356;
Removed the AMI Unknown 386SX because of impossibility to identify the chipset, closes #349;
Fixed bugs in the serial mouse emulation, fixes #344;
Compiled 86Box binaries now include all the required .DLL's, fixes #341;
Made some combo boxes in the Settings dialog slightly wider, fixes #276.
2019-09-20 14:02:30 +02:00
|
|
|
#ifdef __aarch64__
|
|
|
|
|
|
|
|
|
|
#include <stdlib.h>
|
|
|
|
|
#include <stdint.h>
|
2020-03-29 14:24:42 +02:00
|
|
|
#include <86box/86box.h>
|
Added the IBM 5161 ISA expansion for PC and XT;
Cleaned up the parallel port emulation, added IRQ support, and made enabling/disabling per port;
Added the Award 430NX and the Intel Classic/PCI (Alfredo, 420TX);
Finished the 586MC1;
Added 8087 emulation;
Moved Cyrix 6x86'es to the Dev branch;
Sanitized/cleaned up memregs.c/h and intel.c/h;
Split the chipsets from machines and sanitized Port 92 emulation;
Added support for the 15bpp mode to the Compaq ATI 28800;
Moved the MR 386DX and 486 machines to the Dev branch;
Ported the new dynamic recompiler from PCem, but it remains in Dev branch until after v2.00;
Ported the new timer code from PCem;
Cleaned up the CPU table of unused stuff and better optimized its structure;
Ported the Open-XT and Open-AT from VARCem, the Open-AT is in the Dev branch;
Ported the XT MFM controller rewrite and adding of more controllers (incl. two RLL ones), from VARCem;
Added the AHA-1540A and the BusTek BT-542B;
Moved the Sumo SCSI-AT to the Dev branch;
Minor IDE, FDC, and floppy drive code clean-ups;
Made NCR 5380/53C400-based cards' BIOS address configurable;
Got rid of the legacy romset variable;
Unified (video) buffer and buffer32 into one and make the unified buffer 32-bit;
Added the Amstead PPC512 per PCem patch by John Elliott;
Switched memory mapping granularity from 16k to 4k (less than 1k not possible due to internal pages);
Rewrote the CL-GD 54xx blitter, fixes Win-OS/2 on the 54x6 among other thing;
Added the Image Manager 1024 and Professional Graphics Controller per PCem patch by John Elliott and work done on VARCem;
Added Headland HT-216, GC-205 and Video 7 VGA 1024i emulation based on PCem commit;
Implemented the fuction keys for the Toshiba T1000/T1200/T3100 enhancement;
Amstrad MegaPC does now works correctly with non-internal graphics card;
The SLiRP code no longer casts a packed struct type to a non-packed struct type;
The Xi8088 and PB410a no longer hang on 86Box when PS/2 mouse is not present;
The S3 Virge on BeOS is no longer broken (was broken by build #1591);
OS/2 2.0 build 6.167 now sees key presses again;
Xi8088 now work on CGA again;
86F images converted from either the old or new variants of the HxC MFM format now work correctly;
Hardware interrupts with a vector of 0xFF are now handled correctly;
OPTi 495SX boards no longer incorrectly have 64 MB maximum RAM when 32 MB is correct;
Fixed VNC keyboard input bugs;
Fixed AT RTC periodic interrupt - Chicago 58s / 73f / 73g / 81 MIDI play no longer hangs with the build's own VTD driver;
Fixed mouse polling with internal mice - Amstrad and Olivetti mice now work correctly;
Triones ATAPI DMA driver now correctly reads a file at the end of a CD image with a sectors number not divisible by 4;
Compaq Portable now works with all graphics cards;
Fixed various MDSI Genius bugs;
Added segment limit checks and improved page fault checks for several CPU instructions - Memphis 15xx WINSETUP and Chicago 58s WINDISK.CPL no longer issue a GPF, and some S3 drivers that used to have glitches, now work correctly;
Further improved the 808x emulation, also fixes the noticably choppy sound when using 808x CPU's, also fixes #355;
OS/2 installer no logner locks up on splash screen on PS/2 Model 70 and 80, fixes #400.
Fixed several Amstead bugs, GEM no longer crashes on the Amstrad 1640, fixes #391.
Ported John Elliott's Amstrad fixes and improvement from PCem, and fixed the default language so it's correctly Engliish, fixes #278, fixes #389.
Fixed a minor IDE timing bug, fixes #388.
Fixed Toshiba T1000 RAM issues, fixes #379.
Fixed EGA/(S)VGA overscan border handling, fixes #378;
Got rid of the now long useless IDE channel 2 auto-removal, fixes #370;
Fixed the BIOS files used by the AMSTRAD PC1512, fixes #366;
Ported the Unicode CD image file name fix from VARCem, fixes #365;
Fixed high density floppy disks on the Xi8088, fixes #359;
Fixed some bugs in the Hercules emulation, fixes #346, fixes #358;
Fixed the SCSI hard disk mode sense pages, fixes #356;
Removed the AMI Unknown 386SX because of impossibility to identify the chipset, closes #349;
Fixed bugs in the serial mouse emulation, fixes #344;
Compiled 86Box binaries now include all the required .DLL's, fixes #341;
Made some combo boxes in the Settings dialog slightly wider, fixes #276.
2019-09-20 14:02:30 +02:00
|
|
|
#include "cpu.h"
|
2020-03-29 14:24:42 +02:00
|
|
|
#include <86box/mem.h>
|
Added the IBM 5161 ISA expansion for PC and XT;
Cleaned up the parallel port emulation, added IRQ support, and made enabling/disabling per port;
Added the Award 430NX and the Intel Classic/PCI (Alfredo, 420TX);
Finished the 586MC1;
Added 8087 emulation;
Moved Cyrix 6x86'es to the Dev branch;
Sanitized/cleaned up memregs.c/h and intel.c/h;
Split the chipsets from machines and sanitized Port 92 emulation;
Added support for the 15bpp mode to the Compaq ATI 28800;
Moved the MR 386DX and 486 machines to the Dev branch;
Ported the new dynamic recompiler from PCem, but it remains in Dev branch until after v2.00;
Ported the new timer code from PCem;
Cleaned up the CPU table of unused stuff and better optimized its structure;
Ported the Open-XT and Open-AT from VARCem, the Open-AT is in the Dev branch;
Ported the XT MFM controller rewrite and adding of more controllers (incl. two RLL ones), from VARCem;
Added the AHA-1540A and the BusTek BT-542B;
Moved the Sumo SCSI-AT to the Dev branch;
Minor IDE, FDC, and floppy drive code clean-ups;
Made NCR 5380/53C400-based cards' BIOS address configurable;
Got rid of the legacy romset variable;
Unified (video) buffer and buffer32 into one and make the unified buffer 32-bit;
Added the Amstead PPC512 per PCem patch by John Elliott;
Switched memory mapping granularity from 16k to 4k (less than 1k not possible due to internal pages);
Rewrote the CL-GD 54xx blitter, fixes Win-OS/2 on the 54x6 among other thing;
Added the Image Manager 1024 and Professional Graphics Controller per PCem patch by John Elliott and work done on VARCem;
Added Headland HT-216, GC-205 and Video 7 VGA 1024i emulation based on PCem commit;
Implemented the fuction keys for the Toshiba T1000/T1200/T3100 enhancement;
Amstrad MegaPC does now works correctly with non-internal graphics card;
The SLiRP code no longer casts a packed struct type to a non-packed struct type;
The Xi8088 and PB410a no longer hang on 86Box when PS/2 mouse is not present;
The S3 Virge on BeOS is no longer broken (was broken by build #1591);
OS/2 2.0 build 6.167 now sees key presses again;
Xi8088 now work on CGA again;
86F images converted from either the old or new variants of the HxC MFM format now work correctly;
Hardware interrupts with a vector of 0xFF are now handled correctly;
OPTi 495SX boards no longer incorrectly have 64 MB maximum RAM when 32 MB is correct;
Fixed VNC keyboard input bugs;
Fixed AT RTC periodic interrupt - Chicago 58s / 73f / 73g / 81 MIDI play no longer hangs with the build's own VTD driver;
Fixed mouse polling with internal mice - Amstrad and Olivetti mice now work correctly;
Triones ATAPI DMA driver now correctly reads a file at the end of a CD image with a sectors number not divisible by 4;
Compaq Portable now works with all graphics cards;
Fixed various MDSI Genius bugs;
Added segment limit checks and improved page fault checks for several CPU instructions - Memphis 15xx WINSETUP and Chicago 58s WINDISK.CPL no longer issue a GPF, and some S3 drivers that used to have glitches, now work correctly;
Further improved the 808x emulation, also fixes the noticably choppy sound when using 808x CPU's, also fixes #355;
OS/2 installer no logner locks up on splash screen on PS/2 Model 70 and 80, fixes #400.
Fixed several Amstead bugs, GEM no longer crashes on the Amstrad 1640, fixes #391.
Ported John Elliott's Amstrad fixes and improvement from PCem, and fixed the default language so it's correctly Engliish, fixes #278, fixes #389.
Fixed a minor IDE timing bug, fixes #388.
Fixed Toshiba T1000 RAM issues, fixes #379.
Fixed EGA/(S)VGA overscan border handling, fixes #378;
Got rid of the now long useless IDE channel 2 auto-removal, fixes #370;
Fixed the BIOS files used by the AMSTRAD PC1512, fixes #366;
Ported the Unicode CD image file name fix from VARCem, fixes #365;
Fixed high density floppy disks on the Xi8088, fixes #359;
Fixed some bugs in the Hercules emulation, fixes #346, fixes #358;
Fixed the SCSI hard disk mode sense pages, fixes #356;
Removed the AMI Unknown 386SX because of impossibility to identify the chipset, closes #349;
Fixed bugs in the serial mouse emulation, fixes #344;
Compiled 86Box binaries now include all the required .DLL's, fixes #341;
Made some combo boxes in the Settings dialog slightly wider, fixes #276.
2019-09-20 14:02:30 +02:00
|
|
|
|
|
|
|
|
#include "codegen.h"
|
|
|
|
|
#include "codegen_allocator.h"
|
|
|
|
|
#include "codegen_backend.h"
|
|
|
|
|
#include "codegen_backend_arm64_defs.h"
|
|
|
|
|
#include "codegen_backend_arm64_ops.h"
|
|
|
|
|
#include "codegen_reg.h"
|
|
|
|
|
#include "x86.h"
|
|
|
|
|
#include "x87.h"
|
|
|
|
|
|
|
|
|
|
#if defined(__linux__) || defined(__APPLE__)
|
|
|
|
|
#include <sys/mman.h>
|
|
|
|
|
#include <unistd.h>
|
|
|
|
|
#endif
|
|
|
|
|
#if defined WIN32 || defined _WIN32 || defined _WIN32
|
|
|
|
|
#include <windows.h>
|
|
|
|
|
#endif
|
|
|
|
|
|
|
|
|
|
void *codegen_mem_load_byte;
|
|
|
|
|
void *codegen_mem_load_word;
|
|
|
|
|
void *codegen_mem_load_long;
|
|
|
|
|
void *codegen_mem_load_quad;
|
|
|
|
|
void *codegen_mem_load_single;
|
|
|
|
|
void *codegen_mem_load_double;
|
|
|
|
|
|
|
|
|
|
void *codegen_mem_store_byte;
|
|
|
|
|
void *codegen_mem_store_word;
|
|
|
|
|
void *codegen_mem_store_long;
|
|
|
|
|
void *codegen_mem_store_quad;
|
|
|
|
|
void *codegen_mem_store_single;
|
|
|
|
|
void *codegen_mem_store_double;
|
|
|
|
|
|
|
|
|
|
void *codegen_fp_round;
|
|
|
|
|
void *codegen_fp_round_quad;
|
|
|
|
|
|
|
|
|
|
void *codegen_gpf_rout;
|
|
|
|
|
void *codegen_exit_rout;
|
|
|
|
|
|
|
|
|
|
host_reg_def_t codegen_host_reg_list[CODEGEN_HOST_REGS] =
|
|
|
|
|
{
|
|
|
|
|
{REG_X19, 0},
|
|
|
|
|
{REG_X20, 0},
|
|
|
|
|
{REG_X21, 0},
|
|
|
|
|
{REG_X22, 0},
|
|
|
|
|
{REG_X23, 0},
|
|
|
|
|
{REG_X24, 0},
|
|
|
|
|
{REG_X25, 0},
|
|
|
|
|
{REG_X26, 0},
|
|
|
|
|
{REG_X27, 0},
|
|
|
|
|
{REG_X28, 0}
|
|
|
|
|
};
|
|
|
|
|
|
|
|
|
|
host_reg_def_t codegen_host_fp_reg_list[CODEGEN_HOST_FP_REGS] =
|
|
|
|
|
{
|
|
|
|
|
{REG_V8, 0},
|
|
|
|
|
{REG_V9, 0},
|
|
|
|
|
{REG_V10, 0},
|
|
|
|
|
{REG_V11, 0},
|
|
|
|
|
{REG_V12, 0},
|
|
|
|
|
{REG_V13, 0},
|
|
|
|
|
{REG_V14, 0},
|
|
|
|
|
{REG_V15, 0}
|
|
|
|
|
};
|
|
|
|
|
|
|
|
|
|
static void build_load_routine(codeblock_t *block, int size, int is_float)
|
|
|
|
|
{
|
|
|
|
|
uint32_t *branch_offset;
|
|
|
|
|
uint32_t *misaligned_offset;
|
|
|
|
|
int offset;
|
|
|
|
|
|
|
|
|
|
/*In - W0 = address
|
|
|
|
|
Out - W0 = data, W1 = abrt*/
|
|
|
|
|
/*MOV W1, W0, LSR #12
|
|
|
|
|
MOV X2, #readlookup2
|
|
|
|
|
LDR X1, [X2, X1, LSL #3]
|
|
|
|
|
CMP X1, #-1
|
|
|
|
|
BEQ +
|
|
|
|
|
LDRB W0, [X1, X0]
|
|
|
|
|
MOV W1, #0
|
|
|
|
|
RET
|
|
|
|
|
* STP X29, X30, [SP, #-16]
|
|
|
|
|
BL readmembl
|
|
|
|
|
LDRB R1, cpu_state.abrt
|
|
|
|
|
LDP X29, X30, [SP, #-16]
|
|
|
|
|
RET
|
|
|
|
|
*/
|
|
|
|
|
codegen_alloc(block, 80);
|
|
|
|
|
host_arm64_MOV_REG_LSR(block, REG_W1, REG_W0, 12);
|
|
|
|
|
host_arm64_MOVX_IMM(block, REG_X2, (uint64_t)readlookup2);
|
|
|
|
|
host_arm64_LDRX_REG_LSL3(block, REG_X1, REG_X2, REG_X1);
|
|
|
|
|
if (size != 1)
|
|
|
|
|
{
|
|
|
|
|
host_arm64_TST_IMM(block, REG_W0, size-1);
|
|
|
|
|
misaligned_offset = host_arm64_BNE_(block);
|
|
|
|
|
}
|
|
|
|
|
host_arm64_CMPX_IMM(block, REG_X1, -1);
|
|
|
|
|
branch_offset = host_arm64_BEQ_(block);
|
|
|
|
|
if (size == 1 && !is_float)
|
|
|
|
|
host_arm64_LDRB_REG(block, REG_W0, REG_W1, REG_W0);
|
|
|
|
|
else if (size == 2 && !is_float)
|
|
|
|
|
host_arm64_LDRH_REG(block, REG_W0, REG_W1, REG_W0);
|
|
|
|
|
else if (size == 4 && !is_float)
|
|
|
|
|
host_arm64_LDR_REG(block, REG_W0, REG_W1, REG_W0);
|
|
|
|
|
else if (size == 4 && is_float)
|
|
|
|
|
host_arm64_LDR_REG_F32(block, REG_V_TEMP, REG_W1, REG_W0);
|
|
|
|
|
else if (size == 8)
|
|
|
|
|
host_arm64_LDR_REG_F64(block, REG_V_TEMP, REG_W1, REG_W0);
|
|
|
|
|
host_arm64_MOVZ_IMM(block, REG_W1, 0);
|
|
|
|
|
host_arm64_RET(block, REG_X30);
|
|
|
|
|
|
|
|
|
|
host_arm64_branch_set_offset(branch_offset, &block_write_data[block_pos]);
|
|
|
|
|
if (size != 1)
|
|
|
|
|
host_arm64_branch_set_offset(misaligned_offset, &block_write_data[block_pos]);
|
|
|
|
|
host_arm64_STP_PREIDX_X(block, REG_X29, REG_X30, REG_XSP, -16);
|
|
|
|
|
if (size == 1)
|
|
|
|
|
host_arm64_call(block, (void *)readmembl);
|
|
|
|
|
else if (size == 2)
|
|
|
|
|
host_arm64_call(block, (void *)readmemwl);
|
|
|
|
|
else if (size == 4)
|
|
|
|
|
host_arm64_call(block, (void *)readmemll);
|
|
|
|
|
else if (size == 8)
|
|
|
|
|
host_arm64_call(block, (void *)readmemql);
|
|
|
|
|
else
|
|
|
|
|
fatal("build_load_routine - unknown size %i\n", size);
|
|
|
|
|
codegen_direct_read_8(block, REG_W1, &cpu_state.abrt);
|
|
|
|
|
if (size == 4 && is_float)
|
|
|
|
|
host_arm64_FMOV_S_W(block, REG_V_TEMP, REG_W0);
|
|
|
|
|
else if (size == 8)
|
|
|
|
|
host_arm64_FMOV_D_Q(block, REG_V_TEMP, REG_X0);
|
|
|
|
|
host_arm64_LDP_POSTIDX_X(block, REG_X29, REG_X30, REG_XSP, 16);
|
|
|
|
|
host_arm64_RET(block, REG_X30);
|
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
static void build_store_routine(codeblock_t *block, int size, int is_float)
|
|
|
|
|
{
|
|
|
|
|
uint32_t *branch_offset;
|
|
|
|
|
uint32_t *misaligned_offset;
|
|
|
|
|
int offset;
|
|
|
|
|
|
|
|
|
|
/*In - R0 = address, R1 = data
|
|
|
|
|
Out - R1 = abrt*/
|
|
|
|
|
/*MOV W2, W0, LSR #12
|
|
|
|
|
MOV X3, #writelookup2
|
|
|
|
|
LDR X2, [X3, X2, LSL #3]
|
|
|
|
|
CMP X2, #-1
|
|
|
|
|
BEQ +
|
|
|
|
|
STRB W1, [X2, X0]
|
|
|
|
|
MOV W1, #0
|
|
|
|
|
RET
|
|
|
|
|
* STP X29, X30, [SP, #-16]
|
|
|
|
|
BL writemembl
|
|
|
|
|
LDRB R1, cpu_state.abrt
|
|
|
|
|
LDP X29, X30, [SP, #-16]
|
|
|
|
|
RET
|
|
|
|
|
*/
|
|
|
|
|
codegen_alloc(block, 80);
|
|
|
|
|
host_arm64_MOV_REG_LSR(block, REG_W2, REG_W0, 12);
|
|
|
|
|
host_arm64_MOVX_IMM(block, REG_X3, (uint64_t)writelookup2);
|
|
|
|
|
host_arm64_LDRX_REG_LSL3(block, REG_X2, REG_X3, REG_X2);
|
|
|
|
|
if (size != 1)
|
|
|
|
|
{
|
|
|
|
|
host_arm64_TST_IMM(block, REG_W0, size-1);
|
|
|
|
|
misaligned_offset = host_arm64_BNE_(block);
|
|
|
|
|
}
|
|
|
|
|
host_arm64_CMPX_IMM(block, REG_X2, -1);
|
|
|
|
|
branch_offset = host_arm64_BEQ_(block);
|
|
|
|
|
if (size == 1 && !is_float)
|
|
|
|
|
host_arm64_STRB_REG(block, REG_X1, REG_X2, REG_X0);
|
|
|
|
|
else if (size == 2 && !is_float)
|
|
|
|
|
host_arm64_STRH_REG(block, REG_X1, REG_X2, REG_X0);
|
|
|
|
|
else if (size == 4 && !is_float)
|
|
|
|
|
host_arm64_STR_REG(block, REG_X1, REG_X2, REG_X0);
|
|
|
|
|
else if (size == 4 && is_float)
|
|
|
|
|
host_arm64_STR_REG_F32(block, REG_V_TEMP, REG_X2, REG_X0);
|
|
|
|
|
else if (size == 8)
|
|
|
|
|
host_arm64_STR_REG_F64(block, REG_V_TEMP, REG_X2, REG_X0);
|
|
|
|
|
host_arm64_MOVZ_IMM(block, REG_X1, 0);
|
|
|
|
|
host_arm64_RET(block, REG_X30);
|
|
|
|
|
|
|
|
|
|
host_arm64_branch_set_offset(branch_offset, &block_write_data[block_pos]);
|
|
|
|
|
if (size != 1)
|
|
|
|
|
host_arm64_branch_set_offset(misaligned_offset, &block_write_data[block_pos]);
|
|
|
|
|
host_arm64_STP_PREIDX_X(block, REG_X29, REG_X30, REG_XSP, -16);
|
|
|
|
|
if (size == 4 && is_float)
|
|
|
|
|
host_arm64_FMOV_W_S(block, REG_W1, REG_V_TEMP);
|
|
|
|
|
else if (size == 8)
|
|
|
|
|
host_arm64_FMOV_Q_D(block, REG_X1, REG_V_TEMP);
|
|
|
|
|
if (size == 1)
|
|
|
|
|
host_arm64_call(block, (void *)writemembl);
|
|
|
|
|
else if (size == 2)
|
|
|
|
|
host_arm64_call(block, (void *)writememwl);
|
|
|
|
|
else if (size == 4)
|
|
|
|
|
host_arm64_call(block, (void *)writememll);
|
|
|
|
|
else if (size == 8)
|
|
|
|
|
host_arm64_call(block, (void *)writememql);
|
|
|
|
|
else
|
|
|
|
|
fatal("build_store_routine - unknown size %i\n", size);
|
|
|
|
|
codegen_direct_read_8(block, REG_W1, &cpu_state.abrt);
|
|
|
|
|
host_arm64_LDP_POSTIDX_X(block, REG_X29, REG_X30, REG_XSP, 16);
|
|
|
|
|
host_arm64_RET(block, REG_X30);
|
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
static void build_loadstore_routines(codeblock_t *block)
|
|
|
|
|
{
|
|
|
|
|
codegen_mem_load_byte = &block_write_data[block_pos];
|
|
|
|
|
build_load_routine(block, 1, 0);
|
|
|
|
|
codegen_mem_load_word = &block_write_data[block_pos];
|
|
|
|
|
build_load_routine(block, 2, 0);
|
|
|
|
|
codegen_mem_load_long = &block_write_data[block_pos];
|
|
|
|
|
build_load_routine(block, 4, 0);
|
|
|
|
|
codegen_mem_load_quad = &block_write_data[block_pos];
|
|
|
|
|
build_load_routine(block, 8, 0);
|
|
|
|
|
codegen_mem_load_single = &block_write_data[block_pos];
|
|
|
|
|
build_load_routine(block, 4, 1);
|
|
|
|
|
codegen_mem_load_double = &block_write_data[block_pos];
|
|
|
|
|
build_load_routine(block, 8, 1);
|
|
|
|
|
|
|
|
|
|
codegen_mem_store_byte = &block_write_data[block_pos];
|
|
|
|
|
build_store_routine(block, 1, 0);
|
|
|
|
|
codegen_mem_store_word = &block_write_data[block_pos];
|
|
|
|
|
build_store_routine(block, 2, 0);
|
|
|
|
|
codegen_mem_store_long = &block_write_data[block_pos];
|
|
|
|
|
build_store_routine(block, 4, 0);
|
|
|
|
|
codegen_mem_store_quad = &block_write_data[block_pos];
|
|
|
|
|
build_store_routine(block, 8, 0);
|
|
|
|
|
codegen_mem_store_single = &block_write_data[block_pos];
|
|
|
|
|
build_store_routine(block, 4, 1);
|
|
|
|
|
codegen_mem_store_double = &block_write_data[block_pos];
|
|
|
|
|
build_store_routine(block, 8, 1);
|
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
static void build_fp_round_routine(codeblock_t *block, int is_quad)
|
|
|
|
|
{
|
|
|
|
|
uint64_t *jump_table;
|
|
|
|
|
|
|
|
|
|
codegen_alloc(block, 80);
|
|
|
|
|
host_arm64_LDR_IMM_W(block, REG_TEMP, REG_CPUSTATE, (uintptr_t)&cpu_state.new_fp_control - (uintptr_t)&cpu_state);
|
|
|
|
|
host_arm64_ADR(block, REG_TEMP2, 12);
|
|
|
|
|
host_arm64_LDR_REG_X(block, REG_TEMP2, REG_TEMP2, REG_TEMP);
|
|
|
|
|
host_arm64_BR(block, REG_TEMP2);
|
|
|
|
|
|
|
|
|
|
jump_table = (uint64_t *)&block_write_data[block_pos];
|
|
|
|
|
block_pos += 4*8;
|
|
|
|
|
|
|
|
|
|
jump_table[X87_ROUNDING_NEAREST] = (uint64_t)(uintptr_t)&block_write_data[block_pos]; //tie even
|
|
|
|
|
if (is_quad)
|
|
|
|
|
host_arm64_FCVTNS_X_D(block, REG_TEMP, REG_V_TEMP);
|
|
|
|
|
else
|
|
|
|
|
host_arm64_FCVTNS_W_D(block, REG_TEMP, REG_V_TEMP);
|
|
|
|
|
host_arm64_RET(block, REG_X30);
|
|
|
|
|
|
|
|
|
|
jump_table[X87_ROUNDING_UP] = (uint64_t)(uintptr_t)&block_write_data[block_pos]; //pos inf
|
|
|
|
|
if (is_quad)
|
|
|
|
|
host_arm64_FCVTPS_X_D(block, REG_TEMP, REG_V_TEMP);
|
|
|
|
|
else
|
|
|
|
|
host_arm64_FCVTPS_W_D(block, REG_TEMP, REG_V_TEMP);
|
|
|
|
|
host_arm64_RET(block, REG_X30);
|
|
|
|
|
|
|
|
|
|
jump_table[X87_ROUNDING_DOWN] = (uint64_t)(uintptr_t)&block_write_data[block_pos]; //neg inf
|
|
|
|
|
if (is_quad)
|
|
|
|
|
host_arm64_FCVTMS_X_D(block, REG_TEMP, REG_V_TEMP);
|
|
|
|
|
else
|
|
|
|
|
host_arm64_FCVTMS_W_D(block, REG_TEMP, REG_V_TEMP);
|
|
|
|
|
host_arm64_RET(block, REG_X30);
|
|
|
|
|
|
|
|
|
|
jump_table[X87_ROUNDING_CHOP] = (uint64_t)(uintptr_t)&block_write_data[block_pos]; //zero
|
|
|
|
|
if (is_quad)
|
|
|
|
|
host_arm64_FCVTZS_X_D(block, REG_TEMP, REG_V_TEMP);
|
|
|
|
|
else
|
|
|
|
|
host_arm64_FCVTZS_W_D(block, REG_TEMP, REG_V_TEMP);
|
|
|
|
|
host_arm64_RET(block, REG_X30);
|
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
void codegen_backend_init()
|
|
|
|
|
{
|
|
|
|
|
codeblock_t *block;
|
|
|
|
|
int c;
|
|
|
|
|
#if defined(__linux__) || defined(__APPLE__)
|
|
|
|
|
void *start;
|
|
|
|
|
size_t len;
|
|
|
|
|
long pagesize = sysconf(_SC_PAGESIZE);
|
|
|
|
|
long pagemask = ~(pagesize - 1);
|
|
|
|
|
#endif
|
|
|
|
|
|
|
|
|
|
codeblock = malloc(BLOCK_SIZE * sizeof(codeblock_t));
|
|
|
|
|
codeblock_hash = malloc(HASH_SIZE * sizeof(codeblock_t *));
|
|
|
|
|
|
|
|
|
|
memset(codeblock, 0, BLOCK_SIZE * sizeof(codeblock_t));
|
|
|
|
|
memset(codeblock_hash, 0, HASH_SIZE * sizeof(codeblock_t *));
|
|
|
|
|
|
|
|
|
|
for (c = 0; c < BLOCK_SIZE; c++)
|
|
|
|
|
{
|
|
|
|
|
codeblock[c].pc = BLOCK_PC_INVALID;
|
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
block_current = 0;
|
|
|
|
|
block_pos = 0;
|
|
|
|
|
block = &codeblock[block_current];
|
|
|
|
|
block->head_mem_block = codegen_allocator_allocate(NULL, block_current);
|
|
|
|
|
block->data = codeblock_allocator_get_ptr(block->head_mem_block);
|
|
|
|
|
block_write_data = block->data;
|
|
|
|
|
build_loadstore_routines(block);
|
|
|
|
|
|
|
|
|
|
codegen_fp_round = &block_write_data[block_pos];
|
|
|
|
|
build_fp_round_routine(block, 0);
|
|
|
|
|
codegen_fp_round_quad = &block_write_data[block_pos];
|
|
|
|
|
build_fp_round_routine(block, 1);
|
|
|
|
|
|
|
|
|
|
codegen_alloc(block, 80);
|
|
|
|
|
codegen_gpf_rout = &block_write_data[block_pos];
|
|
|
|
|
host_arm64_mov_imm(block, REG_ARG0, 0);
|
|
|
|
|
host_arm64_mov_imm(block, REG_ARG1, 0);
|
|
|
|
|
host_arm64_call(block, (void *)x86gpf);
|
|
|
|
|
|
|
|
|
|
codegen_exit_rout = &block_write_data[block_pos];
|
|
|
|
|
host_arm64_LDP_POSTIDX_X(block, REG_X19, REG_X20, REG_XSP, 64);
|
|
|
|
|
host_arm64_LDP_POSTIDX_X(block, REG_X21, REG_X22, REG_XSP, 16);
|
|
|
|
|
host_arm64_LDP_POSTIDX_X(block, REG_X23, REG_X24, REG_XSP, 16);
|
|
|
|
|
host_arm64_LDP_POSTIDX_X(block, REG_X25, REG_X26, REG_XSP, 16);
|
|
|
|
|
host_arm64_LDP_POSTIDX_X(block, REG_X27, REG_X28, REG_XSP, 16);
|
|
|
|
|
host_arm64_LDP_POSTIDX_X(block, REG_X29, REG_X30, REG_XSP, 16);
|
|
|
|
|
host_arm64_RET(block, REG_X30);
|
|
|
|
|
|
|
|
|
|
block_write_data = NULL;
|
|
|
|
|
|
|
|
|
|
codegen_allocator_clean_blocks(block->head_mem_block);
|
|
|
|
|
|
|
|
|
|
asm("mrs %0, fpcr\n"
|
|
|
|
|
: "=r" (cpu_state.old_fp_control)
|
|
|
|
|
);
|
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
void codegen_set_rounding_mode(int mode)
|
|
|
|
|
{
|
|
|
|
|
if (mode < 0 || mode > 3)
|
|
|
|
|
fatal("codegen_set_rounding_mode - invalid mode\n");
|
|
|
|
|
cpu_state.new_fp_control = mode << 3;
|
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
/*R10 - cpu_state*/
|
|
|
|
|
void codegen_backend_prologue(codeblock_t *block)
|
|
|
|
|
{
|
|
|
|
|
block_pos = BLOCK_START;
|
|
|
|
|
|
|
|
|
|
/*Entry code*/
|
|
|
|
|
|
|
|
|
|
host_arm64_STP_PREIDX_X(block, REG_X29, REG_X30, REG_XSP, -16);
|
|
|
|
|
host_arm64_STP_PREIDX_X(block, REG_X27, REG_X28, REG_XSP, -16);
|
|
|
|
|
host_arm64_STP_PREIDX_X(block, REG_X25, REG_X26, REG_XSP, -16);
|
|
|
|
|
host_arm64_STP_PREIDX_X(block, REG_X23, REG_X24, REG_XSP, -16);
|
|
|
|
|
host_arm64_STP_PREIDX_X(block, REG_X21, REG_X22, REG_XSP, -16);
|
|
|
|
|
host_arm64_STP_PREIDX_X(block, REG_X19, REG_X20, REG_XSP, -64);
|
|
|
|
|
|
|
|
|
|
host_arm64_MOVX_IMM(block, REG_CPUSTATE, (uint64_t)&cpu_state);
|
|
|
|
|
|
|
|
|
|
if (block->flags & CODEBLOCK_HAS_FPU)
|
|
|
|
|
{
|
|
|
|
|
host_arm64_LDR_IMM_W(block, REG_TEMP, REG_CPUSTATE, (uintptr_t)&cpu_state.TOP - (uintptr_t)&cpu_state);
|
|
|
|
|
host_arm64_SUB_IMM(block, REG_TEMP, REG_TEMP, block->TOP);
|
|
|
|
|
host_arm64_STR_IMM_W(block, REG_TEMP, REG_XSP, IREG_TOP_diff_stack_offset);
|
|
|
|
|
}
|
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
void codegen_backend_epilogue(codeblock_t *block)
|
|
|
|
|
{
|
|
|
|
|
host_arm64_LDP_POSTIDX_X(block, REG_X19, REG_X20, REG_XSP, 64);
|
|
|
|
|
host_arm64_LDP_POSTIDX_X(block, REG_X21, REG_X22, REG_XSP, 16);
|
|
|
|
|
host_arm64_LDP_POSTIDX_X(block, REG_X23, REG_X24, REG_XSP, 16);
|
|
|
|
|
host_arm64_LDP_POSTIDX_X(block, REG_X25, REG_X26, REG_XSP, 16);
|
|
|
|
|
host_arm64_LDP_POSTIDX_X(block, REG_X27, REG_X28, REG_XSP, 16);
|
|
|
|
|
host_arm64_LDP_POSTIDX_X(block, REG_X29, REG_X30, REG_XSP, 16);
|
|
|
|
|
host_arm64_RET(block, REG_X30);
|
|
|
|
|
|
|
|
|
|
codegen_allocator_clean_blocks(block->head_mem_block);
|
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
#endif
|