yaxpeax-x86 - yaxpeax x86 decoder

Age	Commit message (Collapse)	Author
2021-07-03	clean up x86_32 and make interfaces match x86_64	iximeow

2021-07-03	prefixes on 0f01-series opcodes are more strict	iximeow

2021-07-03	add hreset	iximeow

2021-07-03	port over x86_64 improvements to x86_32	iximeow

2021-07-03	support pconfig/tme	iximeow

2021-07-03	reject instructions when their opcode is `Invalid`	iximeow
	the evex route would allow "valid" instructions that have the opcode `invalid`. this is.. not correct.
2021-07-03	fix incorrect rex prefix selection	iximeow

2021-07-02	adjust decode logic for better pipelining	iximeow
	at least on my zen2. when reading prefixes, optimize for the likely case of reading an instruction rather than an invalid run of prefixes. checking if we've exceeded the x86 length bound immediately after reading the byte is only a benefit if we'd otherwise read an impossibly-long instruction; in this case we can exit exactly at prefix byte 15 rather than potentially later at byte 16 (assuming a one-byte instruction like `c3`), or byte ~24 (a more complex store with immediate and displacement). these casese are extremely unlikely in practice. more likely is that reading a prefix byte is one of the first two or three bytes in an instruction, and we will never benefit from checking the x86 length bound at this point. instead, only check length bounds after decoding the entire instruction. this penalizes the slowest path through the decoder but speeds up the likely path about 5% on my zen2 processor. additionally, begin reading instruction bytes as soon as we enter the decoder, and before initial clearing of instruction data. again, this is for zen2 pipeline reasons. reading the first byte and corresponding `OPCODES` entry improves the odds that this data is available by the time we check for `Interpretation::Prefix` in the opcode scanning loop. then, if we did not load an instruction, we immediately know another byte must be read; begin reading this byte before applying `rex` prefixes, and as soon as a prefix is known to not be one of the escape-code prefix byte (c5, c4, 62, 0f). this clocked in at another ~5% in total. i've found that `read_volatile` is necessary to force rust to begin the loadwhere it's written, rather than reordering it over other data. i'm not committed to this being a guaranteed truth. also, don't bother checking for `Invalid`. again, `Opcode::Invalid` is a relatively unlikely path through the decoder and `Nothing` is already optiimized for `None` cases. this appears to be another small improvement in throughput but i wouldn't want to give it a number - it was relatively small and may not be attributable to this effect.
2021-07-02	intel keylocker instructions that access memory have memory access sizes	iximeow

2021-07-02	fix several strict rejection for several	iximeow

2021-07-02	consolidate length/extension checks to help compilers DCE	iximeow

2021-07-01	fix warnings	iximeow

2021-07-01	reorder prefix checks	iximeow
	this measures a bit faster. it doesn't seem like it should be. the rex prefix checks compile identically but move a lea for a later expression up and pipelines better?
2021-07-01	reallocate OperandCode, convert disparate registers to array	iximeow
	also remove redundant assignments of operand_count and some OperandSpec, bulk-assign all registers and operands on entry to `read_instr`. this all, taken together, shaves off about 7 cycles per decode.
2021-07-01	making opcode u32 reduces a stall?	iximeow

2021-07-01	complete yaxpeax-arch 0.1.0 adaptation, shore up .mem_size()	iximeow

2021-07-01	update yaxpeax-x86 to yaxpeax-arch 0.1.0 interfaces	iximeow

2021-06-29	fix several lingering mem_size discrepancies	iximeow

2021-06-28	remove old movsx/movzx-related memory size hacks	iximeow

2021-06-28	protected mode memory sizes	iximeow
	also some long-mode cleanup in corresponding areas
2021-06-27	protected-mode avx512	iximeow

2021-06-27	remove support for nonexistent prefixes	iximeow

2021-06-27	PartialEq impls for data in instructiosn, and Instruction itself	iximeow

2021-06-27	all tests now passing for long mode	iximeow

2021-06-27	report memory sizes for all long-mode instructions	iximeow

2021-06-26	awkward	iximeow
	i really didnt know rust could do this
2021-06-26	add long-mode avx512 support, except for compressed displacements	iximeow

2021-06-11	add extensive avx and initial avx2 tests, fix several bugs and missing ↵	iximeow
	instructions
2021-05-07	remove dead OperandSpec variants	iximeow

2021-03-21	remove some forgotten println comments	iximeow

2021-03-21	make Opcode, Operand, and DecodeError non_exhaustive	iximeow
	in the future these can and will change (new operands, new instructions) and i would prefer they not be major breaking changes. applications can ignore them and probably do undesired variants anyway. if you want to write a 1120-variant match, are you me? why would you do this
2021-03-21	in real programs, having read_operands inlined hurts performance!	iximeow
	the in-repo benchmark got better with this inlined but it's probably better to leave it up to the compiler when finally stitching stuff together. i suspect that having read_operands inlined resulted in just too many live values, and the compiler was inspired to play hijinks that pipelined poorly. disas-bench shows a ~15% improvement from this change.
2021-03-21	fuzzing shows resetting operands is not beneficial	iximeow

2021-03-21	fix potential successful decodes with Opcode::Invalid	iximeow
	vmov* are.. somehow messed up too
2021-03-21	add tsxldtrk	iximeow
	does intel know no bounds
2021-03-21	xed says setssbsy and saveprevssp are more permissive	iximeow

2021-03-21	add missing vpmaxuw, remove nonsense avx mov	iximeow

2021-03-21	complete CET support, add UINTR, add missing VORP{S,D}, other cleanup	iximeow

2021-03-21	add waitpkg, clean up unused values, old comments	iximeow

2021-03-21	add tdx	iximeow
	decoder flag to come
2021-03-21	rewrite 0f-based instruction handling	iximeow
	this is... a more significant rewrite than i expected yaxpeax-x86 to ever need. it turns out that capstone is extremely permissive about duplicative 66/f2/f3 prefixes to the point that the implemented prefex handling was unsalvageable. while this replaces the 0f opcode tables, i haven't profiled these changes. it's possible this is a net improvement for single-byte opcodes, it could be a net loss. code size may be severely impacted. there is still work to do. but this in total gets very close to iced/xed/zydis parity, far more than before. also adds several small extensions, gfni, 3dnow, enqcmd, invpcid, some of cet, and a few missing avx instructions.
2021-03-17	support several new extensions, 3dnow, and nuance in invalid operands	iximeow

2021-03-14	alternate display mode for c-style expressions	iximeow

2021-03-13	split ffi crate to support distinct 16, 32, and 64-bit builds	iximeow
	initial work to optionally discard any instruction printing support when using `-Z build-std` to fully remove .eh_frame, a stripped long_mode_no_fmt .so is 61kb!
2021-01-15	support xchg AX/reg0.1.5	iximeow

2021-01-15	small perf tweaks	iximeow
	clearing reg_rrr and reg_mmm more efficiently is an extremely small win, but a win read_imm_signed generally should inline well and runs afoul of some heuristic. inlining gets about 8% improved throughput on the (unrealistic) in-repo benchmark it would be great to be able to avoid bounds checks somehow; it looks like they alone are another ~10% of decode time. i'm not sure how to pull that off while retaining the generic iterator parameter. might just not be possible.
2021-01-15	fix several missing or invalid decodings among 0f01 opcodes	iximeow
	* `mwaitx`, `monitorx`, `rdpru`, and `clzero` are now supported * swapgs is no longer decoded in protected mode * rdpkru and wrpkru are no longer decoded if mod bits != 11
2020-11-19	fix decoding of rex-prefixed modrm+sib operands selecting index 0b100 and ↵0.1.4	iximeow
	base 0b101 for memory operands with a base, index, and displacement either the wrong base would be selected (register number ignored, so only `ax` or `r8` would be reported), or yaxpeax-x86 would report a base register is present when it is not (`RegIndexBaseScaleDisp` when the operand is actually `RegScaleDisp`) thank you to Evan Johnson for catching and reporting this bug! also bump crate version to 0.1.4 as this will be immediately tagged and released.
2020-10-27	fix misdecode of instructions in opcode 0x800.1.3	iximeow

2020-08-15	add RegSpec constructors, consts, and const fns0.1.2	iximeow